Wenlin Yao, Xiaoman Pan, Lifeng Jin, Jianshu Chen, Dian Yu, Dong Yu
Word Sense Disambiguation (WSD) aims to automatically identify the exact meaning of one word according to its context. Existing supervised models struggle to make correct predictions on rare word senses due to limited training data and can only select the best definition sentence from one predefined word sense inventory (e.g., WordNet). To address the data sparsity problem and generalize the model to be independent of one predefined inventory, we propose a gloss alignment algorithm that can align definition sentences (glosses) with the same meaning from different sense inventories to collect rich lexical knowledge. We then train a model to identify semantic equivalence between a target word in context and one of its glosses using these aligned inventories, which exhibits strong transfer capability to many WSD tasks. Experiments on benchmark datasets show that the proposed method improves predictions on both frequent and rare word senses, outperforming prior work by 1.2% on the All-Words WSD Task and 4.3% on the Low-Shot WSD Task. Evaluation on WiC Task also indicates that our method can better capture word meanings in context.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Word Sense Disambiguation | FEWS | F1 (Fewshot Test) | 80.1 | SEMeq base |
| Word Sense Disambiguation | FEWS | F1 (Zero shot test) | 70.2 | SEMeq base |
| Word Sense Disambiguation | FEWS | F1 (Zeroshot Dev) | 71.5 | SEMeq base |
| Word Sense Disambiguation | FEWS | F1(FewShot Dev) | 80.4 | SEMeq base |
| Word Sense Disambiguation | FEWS | F1 (Fewshot Test) | 82.3 | SEMEq Large |
| Word Sense Disambiguation | FEWS | F1 (Zero shot test) | 72.2 | SEMEq Large |
| Word Sense Disambiguation | FEWS | F1 (Zeroshot Dev) | 73.7 | SEMEq Large |
| Word Sense Disambiguation | FEWS | F1(FewShot Dev) | 81.8 | SEMEq Large |