TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Word Sense Disambiguation/Words in Context

Word Sense Disambiguation on Words in Context

Metric: Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy▼Extra DataPaperDate↕Code
1COSINE + Transductive Learning85.3NoFine-Tuning Pre-trained Language Model with Weak...2020-10-15Code
2PaLM 540B (finetuned) 78.8NoPaLM: Scaling Language Modeling with Pathways2022-04-05Code
3ST-MoE-32B 269B (fine-tuned)77.7NoST-MoE: Designing Stable and Transferable Sparse...2022-02-17Code
4DeBERTa-Ensemble77.5NoDeBERTa: Decoding-enhanced BERT with Disentangle...2020-06-05Code
5Vega v2 6B (fine-tuned)77.4NoToward Efficient Language Model Pretraining and ...2022-12-04-
6UL2 20B (fine-tuned)77.3NoUL2: Unifying Language Learning Paradigms2022-05-10Code
7Turing NLR v5 XXL 5.4B (fine-tuned)77.1NoToward Efficient Language Model Pretraining and ...2022-12-04-
8T5-XXL 11B76.9NoExploring the Limits of Transfer Learning with a...2019-10-23Code
9DeBERTa-1.5B76.4NoDeBERTa: Decoding-enhanced BERT with Disentangle...2020-06-05Code
10ST-MoE-L 4.1B (fine-tuned)74NoST-MoE: Designing Stable and Transferable Sparse...2022-02-17Code
11SenseBERT-large 340M72.1NoSenseBERT: Driving Some Sense into BERT2019-08-15-
12SenseBERT-base 110M70.3NoSenseBERT: Driving Some Sense into BERT2019-08-15-
13PaLM 2-L (one-shot)66.8NoPaLM 2 Technical Report2023-05-17Code
14BERT-large 340M65.5NoWiC: the Word-in-Context Dataset for Evaluating ...2018-08-28-
15FLAN-T5-Large 783M64.7NoLaMini-LM: A Diverse Herd of Distilled Models fr...2023-04-27Code
16LaMini-F-T5 783M63.8NoLaMini-LM: A Diverse Herd of Distilled Models fr...2023-04-27Code
17Context2vec59.3NoWiC: the Word-in-Context Dataset for Evaluating ...2018-08-28-
18DeConf58.7NoWiC: the Word-in-Context Dataset for Evaluating ...2018-08-28-
19SW2V58.1NoWiC: the Word-in-Context Dataset for Evaluating ...2018-08-28-
20ElMo57.7NoWiC: the Word-in-Context Dataset for Evaluating ...2018-08-28-
21T0-3B (CoT fine-tuned)56.7NoThe CoT Collection: Improving Zero-shot and Few-...2023-05-23Code
22N-Grammer 343M56.1NoN-Grammer: Augmenting Transformers with latent n...2022-07-13Code
23AlexaTM 20B53.3NoAlexaTM 20B: Few-Shot Learning Using a Large-Sca...2022-08-02Code
24Sentence LSTM53.1NoWiC: the Word-in-Context Dataset for Evaluating ...2018-08-28-
25RoE-3B52.97NoExploring the Benefits of Training Expert Langua...2023-02-07Code
26LaMini-GPT 1.5B52.4NoLaMini-LM: A Diverse Herd of Distilled Models fr...2023-04-27Code
27KiC-770M52.4NoKnowledge-in-Context: Towards Knowledgeable Semi...2022-10-28-
28PaLM 2-M (one-shot)52NoPaLM 2 Technical Report2023-05-17Code
29Hybrid H3 125M (0-shot, logit scoring)51.4NoHungry Hungry Hippos: Towards Language Modeling ...2022-12-28Code
30Hybrid H3 125M (0-shot, rank classification)51.4NoHungry Hungry Hippos: Towards Language Modeling ...2022-12-28Code
31PaLM 2-S (one-shot)50.6NoPaLM 2 Technical Report2023-05-17Code
32LaMini-T5 738M50.5NoLaMini-LM: A Diverse Herd of Distilled Models fr...2023-04-27Code
33Flipped-3B50.42NoGuess the Instruction! Flipped Learning Makes La...2022-10-06Code
34GPT-2-XL 1.5B49.8NoLaMini-LM: A Diverse Herd of Distilled Models fr...2023-04-27Code
35UL2 20B (0-shot)49.8NoUL2: Unifying Language Learning Paradigms2022-05-10Code
36GPT-3 175B (few-shot, k=32)49.4NoLanguage Models are Few-Shot Learners2020-05-28Code
37Hybrid H3 125M (3-shot, logit scoring)49.1NoHungry Hungry Hippos: Towards Language Modeling ...2022-12-28Code