TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Detecting Oriented Text in Natural Images by Linking Segme...

Detecting Oriented Text in Natural Images by Linking Segments

Baoguang Shi, Xiang Bai, Serge Belongie

2017-03-19CVPR 2017 7Curved Text DetectionScene Text DetectionText Detection
PaperPDFCodeCodeCodeCodeCodeCode

Abstract

Most state-of-the-art text detection methods are specific to horizontal Latin text and are not fast enough for real-time applications. We introduce Segment Linking (SegLink), an oriented text detection method. The main idea is to decompose text into two locally detectable elements, namely segments and links. A segment is an oriented box covering a part of a word or text line; A link connects two adjacent segments, indicating that they belong to the same word or text line. Both elements are detected densely at multiple scales by an end-to-end trained, fully-convolutional neural network. Final detections are produced by combining segments connected by links. Compared with previous methods, SegLink improves along the dimensions of accuracy, speed, and ease of training. It achieves an f-measure of 75.0% on the standard ICDAR 2015 Incidental (Challenge 4) benchmark, outperforming the previous best by a large margin. It runs at over 20 FPS on 512x512 images. Moreover, without modification, SegLink is able to detect long lines of non-Latin text, such as Chinese.

Results

TaskDatasetMetricValueModel
Scene Text DetectionICDAR 2013Precision87.7SegLink
Scene Text DetectionICDAR 2013Recall83SegLink
Scene Text DetectionICDAR 2015F-Measure78.2WordSup (VGG16-synth-icdar)
Scene Text DetectionICDAR 2015Precision79.3WordSup (VGG16-synth-icdar)
Scene Text DetectionICDAR 2015Recall77WordSup (VGG16-synth-icdar)
Scene Text DetectionMSRA-TD500F-Measure77SegLink
Scene Text DetectionMSRA-TD500Precision86SegLink
Scene Text DetectionMSRA-TD500Recall70SegLink

Related Papers

AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models2025-07-07PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning2025-06-18Task-driven real-world super-resolution of document scans2025-06-08CL-ISR: A Contrastive Learning and Implicit Stance Reasoning Framework for Misleading Text Detection on Social Media2025-06-05Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors2025-05-30The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection2025-05-21Trends and Challenges in Authorship Analysis: A Review of ML, DL, and LLM Approaches2025-05-21AGENT-X: Adaptive Guideline-based Expert Network for Threshold-free AI-generated teXt detection2025-05-21