TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SANDWiCH: Semantical Analysis of Neighbours for Disambigua...

SANDWiCH: Semantical Analysis of Neighbours for Disambiguating Words in Context ad Hoc

Daniel Guzman-Olivares, Lara Quijano-Sanchez, Federico Liberatore

2025-03-07Word Sense Disambiguation
PaperPDFCode(official)

Abstract

The rise of generative chat-based Large Language Models (LLMs) over the past two years has spurred a race to develop systems that promise near-human conversational and reasoning experiences. However, recent studies indicate that the language understanding offered by these models remains limited and far from human-like performance, particularly in grasping the contextual meanings of words, an essential aspect of reasoning. In this paper, we present a simple yet computationally efficient framework for multilingual Word Sense Disambiguation (WSD). Our approach reframes the WSD task as a cluster discrimination analysis over a semantic network refined from BabelNet using group algebra. We validate our methodology across multiple WSD benchmarks, achieving a new state of the art for all languages and tasks, as well as in individual assessments by part of speech. Notably, our model significantly surpasses the performance of current alternatives, even in low-resource languages, while reducing the parameter count by 72%.

Results

TaskDatasetMetricValueModel
Word Sense DisambiguationSupervised:SemEval 200780.9SANDWiCH
Word Sense DisambiguationSupervised:SemEval 201392.6SANDWiCH
Word Sense DisambiguationSupervised:SemEval 201591.5SANDWiCH
Word Sense DisambiguationSupervised:Senseval 287.8SANDWiCH
Word Sense DisambiguationSupervised:Senseval 385.7SANDWiCH

Related Papers

Semantic similarity estimation for domain specific data using BERT and other techniques2025-06-23On Self-improving Token Embeddings2025-04-21GlossGPT: GPT for Word Sense Disambiguation using Few-shot Chain-of-Thought Prompting2025-03-01Probing Semantic Routing in Large Mixture-of-Expert Models2025-02-15TreeMatch: A Fully Unsupervised WSD System Using Dependency Knowledge on a Specific Domain2025-01-05Fietje: An open, efficient LLM for Dutch2024-12-19Word Sense Linking: Disambiguating Outside the Sandbox2024-12-12Can LLMs assist with Ambiguity? A Quantitative Evaluation of various Large Language Models on Word Sense Disambiguation2024-11-27