Detecting Multi-Oriented Text with Corner-based Region Proposals

Linjie Deng, Yanxiang Gong, Yi Lin, Jingwen Shuai, Xiaoguang Tu, Yuefei Zhang, Zheng Ma, Mei Xie

2018-04-08Scene Text Detection Data Augmentation Robust classification Text Detection

Abstract

Previous approaches for scene text detection usually rely on manually defined sliding windows. This work presents an intuitive two-stage region-based method to detect multi-oriented text without any prior knowledge regarding the textual shape. In the first stage, we estimate the possible locations of text instances by detecting and linking corners instead of shifting a set of default anchors. The quadrilateral proposals are geometry adaptive, which allows our method to cope with various text aspect ratios and orientations. In the second stage, we design a new pooling layer named Dual-RoI Pooling which embeds data augmentation inside the region-wise subnetwork for more robust classification and regression over these proposals. Experimental results on public benchmarks confirm that the proposed method is capable of achieving comparable performance with state-of-the-art methods. The code is publicly available at https://github.com/xhzdeng/crpn

Results

Task	Dataset	Metric	Value	Model
Scene Text Detection	ICDAR 2013	Precision	91.9	Corner-based Region Proposals
Scene Text Detection	ICDAR 2013	Recall	83.9	Corner-based Region Proposals
Scene Text Detection	ICDAR 2015	F-Measure	84.5	Corner-based Region Proposals
Scene Text Detection	ICDAR 2015	Precision	88.7	Corner-based Region Proposals
Scene Text Detection	ICDAR 2015	Recall	80.7	Corner-based Region Proposals
Scene Text Detection	COCO-Text	F-Measure	59.1	Corner-based Region Proposals
Scene Text Detection	COCO-Text	Precision	55.5	Corner-based Region Proposals
Scene Text Detection	COCO-Text	Recall	63.3	Corner-based Region Proposals

Related Papers

Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17 Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17 Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16 Data Augmentation in Time Series Forecasting through Inverted Framework2025-07-15 Iceberg: Enhancing HLS Modeling with Synthetic Data2025-07-14 AI-Enhanced Pediatric Pneumonia Detection: A CNN-Based Approach Using Data Augmentation and Generative Adversarial Networks (GANs)2025-07-13 FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation2025-07-11 DS@GT at CheckThat! 2025: Detecting Subjectivity via Transfer-Learning and Corrective Data Augmentation2025-07-08