TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Jointly Learning to Label Sentences and Tokens

Jointly Learning to Label Sentences and Tokens

Marek Rei, Anders Søgaard

2018-11-14Sentence ClassificationGrammatical Error Detection
PaperPDFCodeCode

Abstract

Learning to construct text representations in end-to-end systems can be difficult, as natural languages are highly compositional and task-specific annotated datasets are often limited in size. Methods for directly supervising language composition can allow us to guide the models based on existing knowledge, regularizing them towards more robust and interpretable representations. In this paper, we investigate how objectives at different granularities can be used to learn better language representations and we propose an architecture for jointly learning to label sentences and tokens. The predictions at each level are combined together using an attention mechanism, with token-level labels also acting as explicit supervision for composing sentence-level representations. Our experiments show that by learning to perform these tasks jointly on multiple levels, the model achieves substantial improvements for both sentence classification and sequence labeling.

Results

TaskDatasetMetricValueModel
Grammatical Error CorrectionCoNLL-2014 A1F0.522.14BiLSTM-JOINT (trained on FCE)
Grammatical Error CorrectionCoNLL-2014 A2F0.529.65BiLSTM-JOINT (trained on FCE)
Grammatical Error CorrectionJFLEGF0.552.52BiLSTM-JOINT (trained on FCE)
Grammatical Error CorrectionFCEF0.552.07BiLSTM-JOINT

Related Papers

IMPARA-GED: Grammatical Error Detection is Boosting Reference-free Grammatical Error Quality Estimator2025-06-03A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations2025-05-20Detecting Spelling and Grammatical Anomalies in Russian Poetry Texts2025-05-07ARWI: Arabic Write and Improve2025-04-16Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence Benchmarks2025-01-05Consolidating and Developing Benchmarking Datasets for the Nepali Natural Language Understanding Tasks2024-11-28Cyber-Attack Technique Classification Using Two-Stage Trained Large Language Models2024-11-27Multi-label Sequential Sentence Classification via Large Language Model2024-11-23