TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/End-to-end Handwritten Paragraph Text Recognition Using a ...

End-to-end Handwritten Paragraph Text Recognition Using a Vertical Attention Network

Denis Coquenet, Clément Chatelain, Thierry Paquet

2020-12-07Handwritten Text Recognition
PaperPDFCode(official)

Abstract

Unconstrained handwritten text recognition remains challenging for computer vision systems. Paragraph text recognition is traditionally achieved by two models: the first one for line segmentation and the second one for text line recognition. We propose a unified end-to-end model using hybrid attention to tackle this task. This model is designed to iteratively process a paragraph image line by line. It can be split into three modules. An encoder generates feature maps from the whole paragraph image. Then, an attention module recurrently generates a vertical weighted mask enabling to focus on the current text line features. This way, it performs a kind of implicit line segmentation. For each text line features, a decoder module recognizes the character sequence associated, leading to the recognition of a whole paragraph. We achieve state-of-the-art character error rate at paragraph level on three popular datasets: 1.91% for RIMES, 4.45% for IAM and 3.59% for READ 2016. Our code and trained model weights are available at https://github.com/FactoDeepLearning/VerticalAttentionOCR.

Results

TaskDatasetMetricValueModel
Optical Character Recognition (OCR)IAM(line-level)Test CER5VAN
Optical Character Recognition (OCR)IAM(line-level)Test WER16.3VAN
Optical Character Recognition (OCR)IAMCER4.32VAN
Optical Character Recognition (OCR)IAMWER16.24VAN
Optical Character Recognition (OCR)READ2016(line-level)Test CER4.1VAN
Optical Character Recognition (OCR)READ2016(line-level)Test WER16.3VAN
Handwritten Text RecognitionIAM(line-level)Test CER5VAN
Handwritten Text RecognitionIAM(line-level)Test WER16.3VAN
Handwritten Text RecognitionIAMCER4.32VAN
Handwritten Text RecognitionIAMWER16.24VAN
Handwritten Text RecognitionREAD2016(line-level)Test CER4.1VAN
Handwritten Text RecognitionREAD2016(line-level)Test WER16.3VAN

Related Papers

Advancing Offline Handwritten Text Recognition: A Systematic Review of Data Augmentation and Generation Techniques2025-07-08Learning to Align: Addressing Character Frequency Distribution Shifts in Handwritten Text Recognition2025-06-11MetaWriter: Personalized Handwritten Text Recognition Using Meta-Learned Prompt Tuning2025-05-26Preserving Privacy Without Compromising Accuracy: Machine Unlearning for Handwritten Text Recognition2025-04-11Meta-DAN: towards an efficient prediction strategy for page-level handwritten text recognition2025-04-04TRIDIS: A Comprehensive Medieval and Early Modern Corpus for HTR and NER2025-03-25Benchmarking Large Language Models for Handwritten Text Recognition2025-03-19Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription2025-02-27