TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Shape Robust Text Detection with Progressive Scale Expansi...

Shape Robust Text Detection with Progressive Scale Expansion Network

Xiang Li, Wenhai Wang, Wenbo Hou, Ruo-Ze Liu, Tong Lu, Jian Yang

2018-06-07Curved Text DetectionScene Text DetectionText Detection
PaperPDFCodeCodeCodeCodeCodeCode(official)CodeCodeCode

Abstract

The challenges of shape robust text detection lie in two aspects: 1) most existing quadrangular bounding box based detectors are difficult to locate texts with arbitrary shapes, which are hard to be enclosed perfectly in a rectangle; 2) most pixel-wise segmentation-based detectors may not separate the text instances that are very close to each other. To address these problems, we propose a novel Progressive Scale Expansion Network (PSENet), designed as a segmentation-based detector with multiple predictions for each text instance. These predictions correspond to different `kernels' produced by shrinking the original text instance into various scales. Consequently, the final detection can be conducted through our progressive scale expansion algorithm which gradually expands the kernels with minimal scales to the text instances with maximal and complete shapes. Due to the fact that there are large geometrical margins among these minimal kernels, our method is effective to distinguish the adjacent text instances and is robust to arbitrary shapes. The state-of-the-art results on ICDAR 2015 and ICDAR 2017 MLT benchmarks further confirm the great effectiveness of PSENet. Notably, PSENet outperforms the previous best record by absolute 6.37\% on the curve text dataset SCUT-CTW1500. Code will be available in https://github.com/whai362/PSENet.

Results

TaskDatasetMetricValueModel
Scene Text DetectionSCUT-CTW1500F-Measure81.17PSENet-1s
Scene Text DetectionSCUT-CTW1500Precision82.5PSENet-1s
Scene Text DetectionSCUT-CTW1500Recall79.89PSENet-1s
Scene Text DetectionICDAR 2017 MLTPrecision77.01PSENet-1s
Scene Text DetectionICDAR 2017 MLTRecall68.4PSENet-1s
Scene Text DetectionICDAR 2015F-Measure87.1PSENet-1s
Scene Text DetectionICDAR 2015Precision88.7PSENet-1s
Scene Text DetectionICDAR 2015Recall85.5PSENet-1s

Related Papers

AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models2025-07-07PhantomHunter: Detecting Unseen Privately-Tuned LLM-Generated Text via Family-Aware Learning2025-06-18Task-driven real-world super-resolution of document scans2025-06-08CL-ISR: A Contrastive Learning and Implicit Stance Reasoning Framework for Misleading Text Detection on Social Media2025-06-05Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors2025-05-30The Devil is in Fine-tuning and Long-tailed Problems:A New Benchmark for Scene Text Detection2025-05-21Trends and Challenges in Authorship Analysis: A Review of ML, DL, and LLM Approaches2025-05-21AGENT-X: Adaptive Guideline-based Expert Network for Threshold-free AI-generated teXt detection2025-05-21