TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Sense Embedding Learning for Word Sense Induction

Sense Embedding Learning for Word Sense Induction

Linfeng Song, Zhiguo Wang, Haitao Mi, Daniel Gildea

2016-06-17SEMEVAL 2016 8Word Sense Induction
PaperPDF

Abstract

Conventional word sense induction (WSI) methods usually represent each instance with discrete linguistic features or cooccurrence features, and train a model for each polysemous word individually. In this work, we propose to learn sense embeddings for the WSI task. In the training stage, our method induces several sense centroids (embedding) for each polysemous word. In the testing stage, our method represents each instance as a contextual vector, and induces its sense by finding the nearest sense centroid in the embedding space. The advantages of our method are (1) distributed sense vectors are taken as the knowledge representations which are trained discriminatively, and usually have better performance than traditional count-based distributional models, and (2) a general model for the whole vocabulary is jointly trained to induce sense centroids under the mutlitask learning framework. Evaluated on SemEval-2010 WSI dataset, our method outperforms all participants and most of the recent state-of-the-art methods. We further verify the two advantages by comparing with carefully designed baselines.

Results

TaskDatasetMetricValueModel
Word Sense DisambiguationSemEval 2010 WSIAVG23.24SE-WSI-fix
Word Sense DisambiguationSemEval 2010 WSIF-Score55.1SE-WSI-fix
Word Sense DisambiguationSemEval 2010 WSIV-Measure9.8SE-WSI-fix
Word Sense InductionSemEval 2010 WSIAVG23.24SE-WSI-fix
Word Sense InductionSemEval 2010 WSIF-Score55.1SE-WSI-fix
Word Sense InductionSemEval 2010 WSIV-Measure9.8SE-WSI-fix

Related Papers

To Word Senses and Beyond: Inducing Concepts with Contextualized Language Models2024-06-28Multilingual Substitution-based Word Sense Induction2024-05-17The LSCD Benchmark: a Testbed for Diachronic Word Meaning Tasks2024-03-29A Systematic Comparison of Contextualized Word Embeddings for Lexical Semantic Change2024-02-19Word Sense Induction with Knowledge Distillation from BERT2023-04-20Words as Gatekeepers: Measuring Discipline-specific Terms and Meanings in Scholarly Publications2022-12-19Word Sense Induction with Hierarchical Clustering and Mutual Information Maximization2022-10-11RuDSI: graph-based word sense induction dataset for Russian2022-09-28