TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/StackMix and Blot Augmentations for Handwritten Text Recog...

StackMix and Blot Augmentations for Handwritten Text Recognition

Alex Shonenkov, Denis Karachev, Maxim Novopoltsev, Mark Potanin, Denis Dimitrov

2021-08-26Text GenerationHandwritten Text RecognitionData AugmentationHTR
PaperPDFCode(official)

Abstract

This paper proposes a handwritten text recognition(HTR) system that outperforms current state-of-the-artmethods. The comparison was carried out on three of themost frequently used in HTR task datasets, namely Ben-tham, IAM, and Saint Gall. In addition, the results on tworecently presented datasets, Peter the Greats manuscriptsand HKR Dataset, are provided.The paper describes the architecture of the neural net-work and two ways of increasing the volume of train-ing data: augmentation that simulates strikethrough text(HandWritten Blots) and a new text generation method(StackMix), which proved to be very effective in HTR tasks.StackMix can also be applied to the standalone task of gen-erating handwritten text based on printed text.

Results

TaskDatasetMetricValueModel
Optical Character Recognition (OCR)Saint GallCER3.65StackMix+Blots
Optical Character Recognition (OCR)BenthamCER1.73StackMix+Blots
Optical Character Recognition (OCR)HKRCER3.49StackMix+Blots
Optical Character Recognition (OCR)Digital PeterCER2.5StackMix+Blots
Optical Character Recognition (OCR)IAM-DCER3.01StackMix+Blots
Optical Character Recognition (OCR)IAM-BCER3.77StackMix+Blots
Handwritten Text RecognitionSaint GallCER3.65StackMix+Blots
Handwritten Text RecognitionBenthamCER1.73StackMix+Blots
Handwritten Text RecognitionHKRCER3.49StackMix+Blots
Handwritten Text RecognitionDigital PeterCER2.5StackMix+Blots
Handwritten Text RecognitionIAM-DCER3.01StackMix+Blots
Handwritten Text RecognitionIAM-BCER3.77StackMix+Blots

Related Papers

Making Language Model a Hierarchical Classifier and Generator2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17Mitigating Object Hallucinations via Sentence-Level Early Intervention2025-07-16Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs2025-07-15Seq vs Seq: An Open Suite of Paired Encoders and Decoders2025-07-15Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15