TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Single Shot Text Detector with Regional Attention

Single Shot Text Detector with Regional Attention

Pan He, Weilin Huang, Tong He, Qile Zhu, Yu Qiao, Xiaolin Li

2017-09-01ICCV 2017 10Scene Text Detection
PaperPDFCode

Abstract

We present a novel single-shot text detector that directly outputs word-level bounding boxes in a natural image. We propose an attention mechanism which roughly identifies text regions via an automatically learned attentional map. This substantially suppresses background interference in the convolutional features, which is the key to producing accurate inference of words, particularly at extremely small sizes. This results in a single model that essentially works in a coarse-to-fine manner. It departs from recent FCN- based text detectors which cascade multiple FCN models to achieve an accurate prediction. Furthermore, we develop a hierarchical inception module which efficiently aggregates multi-scale inception features. This enhances local details, and also encodes strong context information, allow- ing the detector to work reliably on multi-scale and multi- orientation text with single-scale images. Our text detector achieves an F-measure of 77% on the ICDAR 2015 bench- mark, advancing the state-of-the-art results in [18, 28]. Demo is available at: http://sstd.whuang.org/.

Results

TaskDatasetMetricValueModel
Scene Text DetectionICDAR 2013Precision88SSTD
Scene Text DetectionICDAR 2013Recall86SSTD
Scene Text DetectionICDAR 2015F-Measure80.7EAST + PVANET2x RBOX (multi-scale)
Scene Text DetectionICDAR 2015Precision83.3EAST + PVANET2x RBOX (multi-scale)
Scene Text DetectionICDAR 2015Recall78.3EAST + PVANET2x RBOX (multi-scale)
Scene Text DetectionCOCO-TextF-Measure37SSTD
Scene Text DetectionCOCO-TextPrecision46SSTD
Scene Text DetectionCOCO-TextRecall31SSTD

Related Papers

The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection2025-05-21Explicit Relational Reasoning Network for Scene Text Detection2024-12-19KhmerST: A Low-Resource Khmer Scene Text Detection and Recognition Benchmark2024-10-23Spotlight Text Detector: Spotlight on Candidate Regions Like a Camera2024-09-25Region Prompt Tuning: Fine-grained Scene Text Detection Utilizing Region Text Prompt2024-09-20Revisiting Tampered Scene Text Detection in the Era of Generative AI2024-07-31Towards Unified Multi-granularity Text Detection with Interactive Attention2024-05-30Dataset and Benchmark for Urdu Natural Scenes Text Detection, Recognition and Visual Question Answering2024-05-21