TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Unsupervised Deep Structured Semantic Models for Commonsen...

Unsupervised Deep Structured Semantic Models for Commonsense Reasoning

Shuohang Wang, Sheng Zhang, Yelong Shen, Xiaodong Liu, Jingjing Liu, Jianfeng Gao, Jing Jiang

2019-04-03NAACL 2019 6Coreference ResolutionCommon Sense ReasoningNatural Language Understanding
PaperPDF

Abstract

Commonsense reasoning is fundamental to natural language understanding. While traditional methods rely heavily on human-crafted features and knowledge bases, we explore learning commonsense knowledge from a large amount of raw text via unsupervised learning. We propose two neural network models based on the Deep Structured Semantic Models (DSSM) framework to tackle two classic commonsense reasoning tasks, Winograd Schema challenges (WSC) and Pronoun Disambiguation (PDP). Evaluation shows that the proposed models effectively capture contextual information in the sentence and co-reference information between pronouns and nouns, and achieve significant improvement over previous state-of-the-art approaches.

Results

TaskDatasetMetricValueModel
Coreference ResolutionWinograd Schema ChallengeAccuracy63DSSM
Coreference ResolutionWinograd Schema ChallengeAccuracy62.4UDSSM-II (ensemble)
Coreference ResolutionWinograd Schema ChallengeAccuracy59.2UDSSM-II
Coreference ResolutionWinograd Schema ChallengeAccuracy57.1UDSSM-I (ensemble)
Coreference ResolutionWinograd Schema ChallengeAccuracy54.5UDSSM-I
Natural Language UnderstandingPDP60Accuracy78.3UDSSM-II (ensemble)
Natural Language UnderstandingPDP60Accuracy76.7UDSSM-I (ensemble)
Natural Language UnderstandingPDP60Accuracy75DSSM
Natural Language UnderstandingPDP60Accuracy75UDSSM-II

Related Papers

Comparing Apples to Oranges: A Dataset & Analysis of LLM Humour Understanding from Traditional Puns to Topical Jokes2025-07-17Vision Language Action Models in Robotic Manipulation: A Systematic Review2025-07-14LoSiA: Efficient High-Rank Fine-Tuning via Subnet Localization and Optimization2025-07-06A Survey on Vision-Language-Action Models for Autonomous Driving2025-06-30State and Memory is All You Need for Robust and Reliable AI Agents2025-06-30skLEP: A Slovak General Language Understanding Benchmark2025-06-26SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models2025-06-25Semantic similarity estimation for domain specific data using BERT and other techniques2025-06-23