Why You Should Try the Real Data for the Scene Text Recognition

Vladimir Loginov

2021-07-29Scene Text Recognition

Abstract

Recent works in the text recognition area have pushed forward the recognition results to the new horizons. But for a long time a lack of large human-labeled natural text recognition datasets has been forcing researchers to use synthetic data for training text recognition models. Even though synthetic datasets are very large (MJSynth and SynthTest, two most famous synthetic datasets, have several million images each), their diversity could be insufficient, compared to natural datasets like ICDAR and others. Fortunately, the recently released text-recognition annotation for OpenImages V5 dataset has comparable with synthetic dataset number of instances and more diverse examples. We have used this annotation with a Text Recognition head architecture from the Yet Another Mask Text Spotter and got comparable to the SOTA results. On some datasets we have even outperformed previous SOTA models. In this paper we also introduce a text recognition model. The model's code is available.

Results

Task	Dataset	Metric	Value	Model
Scene Parsing	SVT	Accuracy	94.7	Yet Another Text Recognizer
Scene Parsing	ICDAR2015	Accuracy	80.2	Yet Another Text Recognizer
Scene Parsing	ICDAR 2003	Accuracy	97.1	Yet Another Text Recognizer
Scene Parsing	ICDAR2013	Accuracy	96.8	Yet Another Text Recognizer
2D Semantic Segmentation	SVT	Accuracy	94.7	Yet Another Text Recognizer
2D Semantic Segmentation	ICDAR2015	Accuracy	80.2	Yet Another Text Recognizer
2D Semantic Segmentation	ICDAR 2003	Accuracy	97.1	Yet Another Text Recognizer
2D Semantic Segmentation	ICDAR2013	Accuracy	96.8	Yet Another Text Recognizer
Scene Text Recognition	SVT	Accuracy	94.7	Yet Another Text Recognizer
Scene Text Recognition	ICDAR2015	Accuracy	80.2	Yet Another Text Recognizer
Scene Text Recognition	ICDAR 2003	Accuracy	97.1	Yet Another Text Recognizer
Scene Text Recognition	ICDAR2013	Accuracy	96.8	Yet Another Text Recognizer

Why You Should Try the Real Data for the Scene Text Recognition

Abstract

Results

Related Papers

Why You Should Try the Real Data for the Scene Text Recognition

Abstract

Results

Related Papers