TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Single-/Multi-Source Cross-Lingual NER via Teacher-Student...

Single-/Multi-Source Cross-Lingual NER via Teacher-Student Learning on Unlabeled Data in Target Language

Qianhui Wu, Zijia Lin, Börje F. Karlsson, Jian-Guang Lou, Biqing Huang

2020-04-26ACL 2020 6named-entity-recognitionCross-Lingual TransferNamed Entity RecognitionNERCross-Lingual NERNamed Entity Recognition (NER)
PaperPDFCode(official)

Abstract

To better tackle the named entity recognition (NER) problem on languages with little/no labeled data, cross-lingual NER must effectively leverage knowledge learned from source languages with rich labeled data. Previous works on cross-lingual NER are mostly based on label projection with pairwise texts or direct model transfer. However, such methods either are not applicable if the labeled data in the source languages is unavailable, or do not leverage information contained in unlabeled data in the target language. In this paper, we propose a teacher-student learning method to address such limitations, where NER models in the source languages are used as teachers to train a student model on unlabeled data in the target language. The proposed method works for both single-source and multi-source cross-lingual NER. For the latter, we further propose a similarity measuring method to better weight the supervision from different teacher models. Extensive experiments for 3 target languages on benchmark datasets well demonstrate that our method outperforms existing state-of-the-art methods for both single-source and multi-source cross-lingual NER.

Results

TaskDatasetMetricValueModel
Cross-LingualCoNLL DutchF181.33SMTS Multi sim
Cross-LingualCoNLL DutchF180.89SMTS Single
Cross-LingualCoNLL DutchF180.7SMTS Multi avg
Cross-LingualCoNLL GermanF175.33SMTS Multi sim
Cross-LingualCoNLL GermanF174.97SMTS Multi avg
Cross-LingualCoNLL GermanF173.22SMTS Single
Cross-LingualCoNLL SpanishF178SMTS Multi sim
Cross-LingualCoNLL SpanishF177.75SMTS Multi avg
Cross-LingualCoNLL SpanishF176.94SMTS Single
Cross-Lingual TransferCoNLL DutchF181.33SMTS Multi sim
Cross-Lingual TransferCoNLL DutchF180.89SMTS Single
Cross-Lingual TransferCoNLL DutchF180.7SMTS Multi avg
Cross-Lingual TransferCoNLL GermanF175.33SMTS Multi sim
Cross-Lingual TransferCoNLL GermanF174.97SMTS Multi avg
Cross-Lingual TransferCoNLL GermanF173.22SMTS Single
Cross-Lingual TransferCoNLL SpanishF178SMTS Multi sim
Cross-Lingual TransferCoNLL SpanishF177.75SMTS Multi avg
Cross-Lingual TransferCoNLL SpanishF176.94SMTS Single

Related Papers

Enhancing Cross-task Transfer of Large Language Models via Activation Steering2025-07-17HanjaBridge: Resolving Semantic Ambiguity in Korean LLMs via Hanja-Augmented Pre-Training2025-07-15Flippi: End To End GenAI Assistant for E-Commerce2025-07-08Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models2025-06-28Improving Named Entity Transcription with Contextual LLM-based Revision2025-06-12Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering2025-06-05Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective2025-06-05Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering2025-06-04