TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/FOTS: Fast Oriented Text Spotting with a Unified Network

FOTS: Fast Oriented Text Spotting with a Unified Network

Xuebo Liu, Ding Liang, Shi Yan, Dagui Chen, Yu Qiao, Junjie Yan

2018-01-05CVPR 2018 6Scene Text RecognitionScene Text DetectionText SpottingText Detection
PaperPDFCodeCodeCodeCodeCodeCodeCode

Abstract

Incidental scene text spotting is considered one of the most difficult and valuable challenges in the document analysis community. Most existing methods treat text detection and recognition as separate tasks. In this work, we propose a unified end-to-end trainable Fast Oriented Text Spotting (FOTS) network for simultaneous detection and recognition, sharing computation and visual information among the two complementary tasks. Specially, RoIRotate is introduced to share convolutional features between detection and recognition. Benefiting from convolution sharing strategy, our FOTS has little computation overhead compared to baseline text detection network, and the joint training method learns more generic features to make our method perform better than these two-stage methods. Experiments on ICDAR 2015, ICDAR 2017 MLT, and ICDAR 2013 datasets demonstrate that the proposed method outperforms state-of-the-art methods significantly, which further allows us to develop the first real-time oriented text spotting system which surpasses all previous state-of-the-art results by more than 5% on ICDAR 2015 text spotting task while keeping 22.6 fps.

Results

TaskDatasetMetricValueModel
Text SpottingICDAR 2015F-measure (%) - Generic Lexicon62.2FOTS
Text SpottingICDAR 2015F-measure (%) - Strong Lexicon83.6FOTS
Text SpottingICDAR 2015F-measure (%) - Weak Lexicon74.5FOTS
Scene Text DetectionICDAR 2017 MLTPrecision81.86FOTS MS
Scene Text DetectionICDAR 2017 MLTRecall62.3FOTS MS
Scene Text DetectionICDAR 2017 MLTPrecision80.95FOTS
Scene Text DetectionICDAR 2017 MLTRecall57.51FOTS
Scene Text DetectionICDAR 2015F-Measure89.84FOTS MS
Scene Text DetectionICDAR 2015Precision91.85FOTS MS
Scene Text DetectionICDAR 2015Recall87.92FOTS MS
Scene Text DetectionICDAR 2015F-Measure87.99FOTS
Scene Text DetectionICDAR 2015Precision91FOTS
Scene Text DetectionICDAR 2015Recall85.17FOTS

Related Papers

AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models2025-07-07PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning2025-06-18Text-Aware Image Restoration with Diffusion Models2025-06-11Task-driven real-world super-resolution of document scans2025-06-08CL-ISR: A Contrastive Learning and Implicit Stance Reasoning Framework for Misleading Text Detection on Social Media2025-06-05Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors2025-05-30GoMatching++: Parameter- and Data-Efficient Arbitrary-Shaped Video Text Spotting and Benchmarking2025-05-28The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection2025-05-21