TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/CoMER: Modeling Coverage for Transformer-based Handwritten...

CoMER: Modeling Coverage for Transformer-based Handwritten Mathematical Expression Recognition

Wenqi Zhao, Liangcai Gao

2022-07-10Handwritten Mathmatical Expression Recognition
PaperPDFCodeCode(official)

Abstract

The Transformer-based encoder-decoder architecture has recently made significant advances in recognizing handwritten mathematical expressions. However, the transformer model still suffers from the lack of coverage problem, making its expression recognition rate (ExpRate) inferior to its RNN counterpart. Coverage information, which records the alignment information of the past steps, has proven effective in the RNN models. In this paper, we propose CoMER, a model that adopts the coverage information in the transformer decoder. Specifically, we propose a novel Attention Refinement Module (ARM) to refine the attention weights with past alignment information without hurting its parallelism. Furthermore, we take coverage information to the extreme by proposing self-coverage and cross-coverage, which utilize the past alignment information from the current and previous layers. Experiments show that CoMER improves the ExpRate by 0.61%/2.09%/1.59% compared to the current state-of-the-art model, and reaches 59.33%/59.81%/62.97% on the CROHME 2014/2016/2019 test sets.

Results

TaskDatasetMetricValueModel
Handwritten Mathmatical Expression RecognitionCROHME 2016ExpRate56.98CoMER
Handwritten Mathmatical Expression RecognitionHME100KExpRate68.12CoMER
Handwritten Mathmatical Expression RecognitionCROHME 2019ExpRate59.12CoMER
Handwritten Mathmatical Expression RecognitionCROHME 2014ExpRate58.38CoMER

Related Papers

Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression Recognition2025-05-29TAMER: Tree-Aware Transformer for Handwritten Mathematical Expression Recognition2024-08-16NAMER: Non-Autoregressive Modeling for Handwritten Mathematical Expression Recognition2024-07-16PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer2024-07-10ICAL: Implicit Character-Aided Learning for Enhanced Handwritten Mathematical Expression Recognition2024-05-15When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition2022-07-23TDv2: A Novel Tree-Structured Decoder for Offline Mathematical Expression Recognition2022-06-280/1 Deep Neural Networks via Block Coordinate Descent2022-06-19