TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/NCAP: Scene Text Image Super-Resolution with Non-CAtegoric...

NCAP: Scene Text Image Super-Resolution with Non-CAtegorical Prior

Dongwoo Park, Suk Pil Ko

2025-04-01Super-ResolutionImage Super-Resolution
PaperPDFCode

Abstract

Scene text image super-resolution (STISR) enhances the resolution and quality of low-resolution images. Unlike previous studies that treated scene text images as natural images, recent methods using a text prior (TP), extracted from a pre-trained text recognizer, have shown strong performance. However, two major issues emerge: (1) Explicit categorical priors, like TP, can negatively impact STISR if incorrect. We reveal that these explicit priors are unstable and propose replacing them with Non-CAtegorical Prior (NCAP) using penultimate layer representations. (2) Pre-trained recognizers used to generate TP struggle with low-resolution images. To address this, most studies jointly train the recognizer with the STISR network to bridge the domain gap between low- and high-resolution images, but this can cause an overconfidence phenomenon in the prior modality. We highlight this issue and propose a method to mitigate it by mixing hard and soft labels. Experiments on the TextZoom dataset demonstrate an improvement by 3.5%, while our method significantly enhances generalization performance by 14.8\% across four text recognition datasets. Our method generalizes to all TP-guided STISR networks.

Results

TaskDatasetMetricValueModel
Super-ResolutionTextZoomASTER Overall Accuracy68.1NCAP
Super-ResolutionTextZoomAverage Accuracy63.7NCAP
Super-ResolutionTextZoomCRNN Overall Accuracy58.3NCAP
Super-ResolutionTextZoomMORAN Overall Accuracy64.6NCAP
Image Super-ResolutionTextZoomASTER Overall Accuracy68.1NCAP
Image Super-ResolutionTextZoomAverage Accuracy63.7NCAP
Image Super-ResolutionTextZoomCRNN Overall Accuracy58.3NCAP
Image Super-ResolutionTextZoomMORAN Overall Accuracy64.6NCAP
3D Object Super-ResolutionTextZoomASTER Overall Accuracy68.1NCAP
3D Object Super-ResolutionTextZoomAverage Accuracy63.7NCAP
3D Object Super-ResolutionTextZoomCRNN Overall Accuracy58.3NCAP
3D Object Super-ResolutionTextZoomMORAN Overall Accuracy64.6NCAP
16kTextZoomASTER Overall Accuracy68.1NCAP
16kTextZoomAverage Accuracy63.7NCAP
16kTextZoomCRNN Overall Accuracy58.3NCAP
16kTextZoomMORAN Overall Accuracy64.6NCAP

Related Papers

SpectraLift: Physics-Guided Spectral-Inversion Network for Self-Supervised Hyperspectral Image Super-Resolution2025-07-17IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution2025-07-14PanoDiff-SR: Synthesizing Dental Panoramic Radiographs using Diffusion and Super-resolution2025-07-12HNOSeg-XS: Extremely Small Hartley Neural Operator for Efficient and Resolution-Robust 3D Image Segmentation2025-07-104KAgent: Agentic Any Image to 4K Super-Resolution2025-07-09EAMamba: Efficient All-Around Vision State Space Model for Image Restoration2025-06-27Leveraging Vision-Language Models to Select Trustworthy Super-Resolution Samples Generated by Diffusion Models2025-06-25Unsupervised Image Super-Resolution Reconstruction Based on Real-World Degradation Patterns2025-06-20