WordSup: Exploiting Word Annotations for Character based Text Detection

Han Hu, Chengquan Zhang, Yuxuan Luo, Yuzhuo Wang, Junyu Han, Errui Ding

2017-08-22ICCV 2017 10Math Scene Text Detection Text Detection

Abstract

Imagery texts are usually organized as a hierarchy of several visual elements, i.e. characters, words, text lines and text blocks. Among these elements, character is the most basic one for various languages such as Western, Chinese, Japanese, mathematical expression and etc. It is natural and convenient to construct a common text detection engine based on character detectors. However, training character detectors requires a vast of location annotated characters, which are expensive to obtain. Actually, the existing real text datasets are mostly annotated in word or line level. To remedy this dilemma, we propose a weakly supervised framework that can utilize word annotations, either in tight quadrangles or the more loose bounding boxes, for character detector training. When applied in scene text detection, we are thus able to train a robust character detector by exploiting word annotations in the rich large-scale real scene text datasets, e.g. ICDAR15 and COCO-text. The character detector acts as a key role in the pipeline of our text detection engine. It achieves the state-of-the-art performance on several challenging scene text detection benchmarks. We also demonstrate the flexibility of our pipeline by various scenarios, including deformed text detection and math expression recognition.

Results

Task	Dataset	Metric	Value	Model
Scene Text Detection	ICDAR 2013	Precision	93.34	WordSup (VGG16-synth-icdar)
Scene Text Detection	ICDAR 2013	Recall	87.53	WordSup (VGG16-synth-icdar)
Scene Text Detection	ICDAR 2015	F-Measure	77	SSTD
Scene Text Detection	ICDAR 2015	Precision	80	SSTD
Scene Text Detection	ICDAR 2015	Recall	73	SSTD
Scene Text Detection	COCO-Text	F-Measure	36.8	WordSup (VGG16-synth-coco)
Scene Text Detection	COCO-Text	Precision	45.2	WordSup (VGG16-synth-coco)
Scene Text Detection	COCO-Text	Recall	30.9	WordSup (VGG16-synth-coco)

WordSup: Exploiting Word Annotations for Character based Text Detection

Abstract

Results

Related Papers

WordSup: Exploiting Word Annotations for Character based Text Detection

Abstract

Results

Related Papers