TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Zero-Resource Cross-Lingual Named Entity Recognition

Zero-Resource Cross-Lingual Named Entity Recognition

M Saiful Bari, Shafiq Joty, Prathyusha Jwalapuram

2019-11-22Low Resource Named Entity Recognitionnamed-entity-recognitionCross-Lingual TransferNamed Entity RecognitionNERCross-Lingual NERNamed Entity Recognition (NER)
PaperPDFCode(official)

Abstract

Recently, neural methods have achieved state-of-the-art (SOTA) results in Named Entity Recognition (NER) tasks for many languages without the need for manually crafted features. However, these models still require manually annotated training data, which is not available for many languages. In this paper, we propose an unsupervised cross-lingual NER model that can transfer NER knowledge from one language to another in a completely unsupervised way without relying on any bilingual dictionary or parallel data. Our model achieves this through word-level adversarial learning and augmented fine-tuning with parameter sharing and feature augmentation. Experiments on five different languages demonstrate the effectiveness of our approach, outperforming existing models by a good margin and setting a new SOTA for each language pair.

Results

TaskDatasetMetricValueModel
Information ExtractionConll 2003 SpanishF1 score75.93Zero-Resource Cross-lingual Transfer From CoNLL-2003 English dataset.
Information ExtractionCONLL 2003 GermanF1 score65.24Zero-Resource Transfer From CoNLL-2003 English dataset.
Information ExtractionCONLL 2003 DutchF1 score74.61Zero-Resource Transfer From CoNLL-2003 English dataset.

Related Papers

Enhancing Cross-task Transfer of Large Language Models via Activation Steering2025-07-17HanjaBridge: Resolving Semantic Ambiguity in Korean LLMs via Hanja-Augmented Pre-Training2025-07-15Flippi: End To End GenAI Assistant for E-Commerce2025-07-08Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models2025-06-28Improving Named Entity Transcription with Contextual LLM-based Revision2025-06-12Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering2025-06-05Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective2025-06-05Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering2025-06-04