Zero-Resource Cross-Lingual Named Entity Recognition

M Saiful Bari, Shafiq Joty, Prathyusha Jwalapuram

2019-11-22Low Resource Named Entity Recognition named-entity-recognition Cross-Lingual Transfer Named Entity Recognition NER Cross-Lingual NER Named Entity Recognition (NER)

Paper PDF Code(official)

Abstract

Recently, neural methods have achieved state-of-the-art (SOTA) results in Named Entity Recognition (NER) tasks for many languages without the need for manually crafted features. However, these models still require manually annotated training data, which is not available for many languages. In this paper, we propose an unsupervised cross-lingual NER model that can transfer NER knowledge from one language to another in a completely unsupervised way without relying on any bilingual dictionary or parallel data. Our model achieves this through word-level adversarial learning and augmented fine-tuning with parameter sharing and feature augmentation. Experiments on five different languages demonstrate the effectiveness of our approach, outperforming existing models by a good margin and setting a new SOTA for each language pair.

Results

Task	Dataset	Metric	Value	Model
Information Extraction	Conll 2003 Spanish	F1 score	75.93	Zero-Resource Cross-lingual Transfer From CoNLL-2003 English dataset.
Information Extraction	CONLL 2003 German	F1 score	65.24	Zero-Resource Transfer From CoNLL-2003 English dataset.
Information Extraction	CONLL 2003 Dutch	F1 score	74.61	Zero-Resource Transfer From CoNLL-2003 English dataset.

Related Papers

Enhancing Cross-task Transfer of Large Language Models via Activation Steering2025-07-17 HanjaBridge: Resolving Semantic Ambiguity in Korean LLMs via Hanja-Augmented Pre-Training2025-07-15 Flippi: End To End GenAI Assistant for E-Commerce2025-07-08 Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models2025-06-28 Improving Named Entity Transcription with Contextual LLM-based Revision2025-06-12 Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering2025-06-05 Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective2025-06-05 Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering2025-06-04