TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Why You Should Try the Real Data for the Scene Text Recogn...

Why You Should Try the Real Data for the Scene Text Recognition

Vladimir Loginov

2021-07-29Scene Text Recognition
PaperPDFCode(official)

Abstract

Recent works in the text recognition area have pushed forward the recognition results to the new horizons. But for a long time a lack of large human-labeled natural text recognition datasets has been forcing researchers to use synthetic data for training text recognition models. Even though synthetic datasets are very large (MJSynth and SynthTest, two most famous synthetic datasets, have several million images each), their diversity could be insufficient, compared to natural datasets like ICDAR and others. Fortunately, the recently released text-recognition annotation for OpenImages V5 dataset has comparable with synthetic dataset number of instances and more diverse examples. We have used this annotation with a Text Recognition head architecture from the Yet Another Mask Text Spotter and got comparable to the SOTA results. On some datasets we have even outperformed previous SOTA models. In this paper we also introduce a text recognition model. The model's code is available.

Results

TaskDatasetMetricValueModel
Scene ParsingSVTAccuracy94.7Yet Another Text Recognizer
Scene ParsingICDAR2015Accuracy80.2Yet Another Text Recognizer
Scene ParsingICDAR 2003Accuracy97.1Yet Another Text Recognizer
Scene ParsingICDAR2013Accuracy96.8Yet Another Text Recognizer
2D Semantic SegmentationSVTAccuracy94.7Yet Another Text Recognizer
2D Semantic SegmentationICDAR2015Accuracy80.2Yet Another Text Recognizer
2D Semantic SegmentationICDAR 2003Accuracy97.1Yet Another Text Recognizer
2D Semantic SegmentationICDAR2013Accuracy96.8Yet Another Text Recognizer
Scene Text RecognitionSVTAccuracy94.7Yet Another Text Recognizer
Scene Text RecognitionICDAR2015Accuracy80.2Yet Another Text Recognizer
Scene Text RecognitionICDAR 2003Accuracy97.1Yet Another Text Recognizer
Scene Text RecognitionICDAR2013Accuracy96.8Yet Another Text Recognizer

Related Papers

Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition2025-03-24Efficient and Accurate Scene Text Recognition with Cascaded-Transformers2025-03-24Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation2025-03-20A Context-Driven Training-Free Network for Lightweight Scene Text Segmentation and Recognition2025-03-19EventSTR: A Benchmark Dataset and Baselines for Event Stream based Scene Text Recognition2025-02-13Billet Number Recognition Based on Test-Time Adaptation2025-02-13Ocean-OCR: Towards General OCR Application via a Vision-Language Model2025-01-26Arbitrary Reading Order Scene Text Spotter with Local Semantics Guidance2024-12-13