Multi-Oriented Text Detection with Fully Convolutional Networks

Zheng Zhang, Chengquan Zhang, Wei Shen, Cong Yao, Wenyu Liu, Xiang Bai

2016-04-14CVPR 2016 6Scene Text Detection Text Detection

Abstract

In this paper, we propose a novel approach for text detec- tion in natural images. Both local and global cues are taken into account for localizing text lines in a coarse-to-fine pro- cedure. First, a Fully Convolutional Network (FCN) model is trained to predict the salient map of text regions in a holistic manner. Then, text line hypotheses are estimated by combining the salient map and character components. Fi- nally, another FCN classifier is used to predict the centroid of each character, in order to remove the false hypotheses. The framework is general for handling text in multiple ori- entations, languages and fonts. The proposed method con- sistently achieves the state-of-the-art performance on three text detection benchmarks: MSRA-TD500, ICDAR2015 and ICDAR2013.

Results

Task	Dataset	Metric	Value	Model
Scene Text Detection	ICDAR 2015	F-Measure	75	SegLink
Scene Text Detection	ICDAR 2015	Precision	73.1	SegLink
Scene Text Detection	ICDAR 2015	Recall	76.8	SegLink

Related Papers

AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models2025-07-07 PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning2025-06-18 Task-driven real-world super-resolution of document scans2025-06-08 CL-ISR: A Contrastive Learning and Implicit Stance Reasoning Framework for Misleading Text Detection on Social Media2025-06-05 Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors2025-05-30 The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection2025-05-21 Trends and Challenges in Authorship Analysis: A Review of ML, DL, and LLM Approaches2025-05-21 AGENT-X: Adaptive Guideline-based Expert Network for Threshold-free AI-generated teXt detection2025-05-21