TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Multi-Oriented Scene Text Detection via Corner Localizatio...

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Pengyuan Lyu, Cong Yao, Wenhao Wu, Shuicheng Yan, Xiang Bai

2018-02-25CVPR 2018 6Scene Text DetectionMulti-Oriented Scene Text Detectionobject-detectionObject DetectionText Detection
PaperPDFCode

Abstract

Previous deep learning based state-of-the-art scene text detection methods can be roughly classified into two categories. The first category treats scene text as a type of general objects and follows general object detection paradigm to localize scene text by regressing the text box locations, but troubled by the arbitrary-orientation and large aspect ratios of scene text. The second one segments text regions directly, but mostly needs complex post processing. In this paper, we present a method that combines the ideas of the two types of methods while avoiding their shortcomings. We propose to detect scene text by localizing corner points of text bounding boxes and segmenting text regions in relative positions. In inference stage, candidate boxes are generated by sampling and grouping corner points, which are further scored by segmentation maps and suppressed by NMS. Compared with previous methods, our method can handle long oriented text naturally and doesn't need complex post processing. The experiments on ICDAR2013, ICDAR2015, MSRA-TD500, MLT and COCO-Text demonstrate that the proposed algorithm achieves better or comparable results in both accuracy and efficiency. Based on VGG16, it achieves an F-measure of 84.3% on ICDAR2015 and 81.5% on MSRA-TD500.

Results

TaskDatasetMetricValueModel
Scene Text DetectionICDAR 2013Precision92Corner Localization (multi-scale)
Scene Text DetectionICDAR 2013Recall84.4Corner Localization (multi-scale)
Scene Text DetectionICDAR 2017 MLTPrecision83.8Corner Localization (single-scale)
Scene Text DetectionICDAR 2017 MLTRecall55.6Corner Localization (single-scale)
Scene Text DetectionICDAR 2017 MLTPrecision74.3Corner Localization (multi-scale)
Scene Text DetectionICDAR 2017 MLTRecall70.6Corner Localization (multi-scale)
Scene Text DetectionICDAR 2015F-Measure84.3Corner Localization (multi-scale)
Scene Text DetectionICDAR 2015Precision89.5Corner Localization (multi-scale)
Scene Text DetectionICDAR 2015Recall79.7Corner Localization (multi-scale)
Scene Text DetectionMSRA-TD500F-Measure81.5Corner Localization
Scene Text DetectionMSRA-TD500Precision87.6Corner Localization
Scene Text DetectionMSRA-TD500Recall76.2Corner Localization

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07