TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/CONTaiNER: Few-Shot Named Entity Recognition via Contrasti...

CONTaiNER: Few-Shot Named Entity Recognition via Contrastive Learning

Sarkar Snigdha Sarathi Das, Arzoo Katiyar, Rebecca J. Passonneau, Rui Zhang

2021-09-15ACL 2022 5Few-shot NERNamed Entity RecognitionContrastive LearningNamed Entity Recognition (NER)
PaperPDFCode(official)

Abstract

Named Entity Recognition (NER) in Few-Shot setting is imperative for entity tagging in low resource domains. Existing approaches only learn class-specific semantic features and intermediate representations from source domains. This affects generalizability to unseen target domains, resulting in suboptimal performances. To this end, we present CONTaiNER, a novel contrastive learning technique that optimizes the inter-token distribution distance for Few-Shot NER. Instead of optimizing class-specific attributes, CONTaiNER optimizes a generalized objective of differentiating between token categories based on their Gaussian-distributed embeddings. This effectively alleviates overfitting issues originating from training domains. Our experiments in several traditional test domains (OntoNotes, CoNLL'03, WNUT '17, GUM) and a new large scale Few-Shot NER dataset (Few-NERD) demonstrate that on average, CONTaiNER outperforms previous methods by 3%-13% absolute F1 points while showing consistent performance trends, even in challenging scenarios where previous approaches could not achieve appreciable performance.

Results

TaskDatasetMetricValueModel
Named Entity Recognition (NER)Few-NERD (INTRA)10 way 1~2 shot33.84CONTaiNER
Named Entity Recognition (NER)Few-NERD (INTRA)10 way 5~10 shot47.49CONTaiNER
Named Entity Recognition (NER)Few-NERD (INTRA)5 way 1~2 shot40.43CONTaiNER
Named Entity Recognition (NER)Few-NERD (INTRA)5 way 5~10 shot53.7CONTaiNER
Named Entity Recognition (NER)Few-NERD (INTER)10 way 1~2 shot48.35CONTaiNER
Named Entity Recognition (NER)Few-NERD (INTER)10 way 5~10 shot57.12CONTaiNER
Named Entity Recognition (NER)Few-NERD (INTER)5 way 1~2 shot55.95CONTaiNER
Named Entity Recognition (NER)Few-NERD (INTER)5 way 5~10 shot61.83CONTaiNER

Related Papers

SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16LLM-Driven Dual-Level Multi-Interest Modeling for Recommendation2025-07-15Latent Space Consistency for Sparse-View CT Reconstruction2025-07-15Self-supervised pretraining of vision transformers for animal behavioral analysis and neural encoding2025-07-13