TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Efficient and Interpretable Grammatical Error Correction w...

Efficient and Interpretable Grammatical Error Correction with Mixture of Experts

Muhammad Reza Qorib, Alham Fikri Aji, Hwee Tou Ng

2024-10-30Grammatical Error CorrectionRe-Ranking
PaperPDFCode(official)

Abstract

Error type information has been widely used to improve the performance of grammatical error correction (GEC) models, whether for generating corrections, re-ranking them, or combining GEC models. Combining GEC models that have complementary strengths in correcting different error types is very effective in producing better corrections. However, system combination incurs a high computational cost due to the need to run inference on the base systems before running the combination method itself. Therefore, it would be more efficient to have a single model with multiple sub-networks that specialize in correcting different error types. In this paper, we propose a mixture-of-experts model, MoECE, for grammatical error correction. Our model successfully achieves the performance of T5-XL with three times fewer effective parameters. Additionally, our model produces interpretable corrections by also identifying the error type during inference.

Results

TaskDatasetMetricValueModel
Grammatical Error CorrectionCoNLL-2014 Shared TaskF0.567.79MoECE
Grammatical Error CorrectionCoNLL-2014 Shared TaskPrecision74.29MoECE
Grammatical Error CorrectionCoNLL-2014 Shared TaskRecall50.21MoECE
Grammatical Error CorrectionBEA-2019 (test)F0.574.07MoECE

Related Papers

Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval2025-07-17CATVis: Context-Aware Thought Visualization2025-07-15SAMURAI: Shape-Aware Multimodal Retrieval for 3D Object Identification2025-06-26RAG-VisualRec: An Open Resource for Vision- and Text-Enhanced Retrieval-Augmented Generation in Recommendation2025-06-25IRanker: Towards Ranking Foundation Model2025-06-25End-to-End Spoken Grammatical Error Correction2025-06-23RMIT-ADM+S at the SIGIR 2025 LiveRAG Challenge2025-06-17