TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Entity Projection via Machine Translation for Cross-Lingua...

Entity Projection via Machine Translation for Cross-Lingual NER

Alankar Jain, Bhargavi Paranjape, Zachary C. Lipton

2019-08-31IJCNLP 2019 11Machine Translationnamed-entity-recognitionNamed Entity RecognitionTranslationNERCross-Lingual NERNamed Entity Recognition (NER)
PaperPDFCode(official)

Abstract

Although over 100 languages are supported by strong off-the-shelf machine translation systems, only a subset of them possess large annotated corpora for named entity recognition. Motivated by this fact, we leverage machine translation to improve annotation-projection approaches to cross-lingual named entity recognition. We propose a system that improves over prior entity-projection methods by: (a) leveraging machine translation systems twice: first for translating sentences and subsequently for translating entities; (b) matching entities based on orthographic and phonetic similarity; and (c) identifying matches based on distributional statistics derived from the dataset. Our approach improves upon current state-of-the-art methods for cross-lingual named entity recognition on 5 diverse languages by an average of 4.1 points. Further, our method achieves state-of-the-art F_1 scores for Armenian, outperforming even a monolingual model trained on Armenian source data.

Results

TaskDatasetMetricValueModel
Cross-LingualCoNLL 2003Dutch69.9BiLSTM + CRF
Cross-LingualCoNLL 2003German61.5BiLSTM + CRF
Cross-LingualCoNLL 2003Spanish74.5BiLSTM + CRF
Cross-Lingual TransferCoNLL 2003Dutch69.9BiLSTM + CRF
Cross-Lingual TransferCoNLL 2003German61.5BiLSTM + CRF
Cross-Lingual TransferCoNLL 2003Spanish74.5BiLSTM + CRF

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17Function-to-Style Guidance of LLMs for Code Translation2025-07-15Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation2025-07-09Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings2025-07-09Flippi: End To End GenAI Assistant for E-Commerce2025-07-08Unconditional Diffusion for Generative Sequential Recommendation2025-07-08GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation2025-07-04TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation2025-07-01