Unsupervised Deep Structured Semantic Models for Commonsense Reasoning

Shuohang Wang, Sheng Zhang, Yelong Shen, Xiaodong Liu, Jingjing Liu, Jianfeng Gao, Jing Jiang

2019-04-03NAACL 2019 6Coreference Resolution Common Sense Reasoning Natural Language Understanding

Abstract

Commonsense reasoning is fundamental to natural language understanding. While traditional methods rely heavily on human-crafted features and knowledge bases, we explore learning commonsense knowledge from a large amount of raw text via unsupervised learning. We propose two neural network models based on the Deep Structured Semantic Models (DSSM) framework to tackle two classic commonsense reasoning tasks, Winograd Schema challenges (WSC) and Pronoun Disambiguation (PDP). Evaluation shows that the proposed models effectively capture contextual information in the sentence and co-reference information between pronouns and nouns, and achieve significant improvement over previous state-of-the-art approaches.

Results

Task	Dataset	Metric	Value	Model
Coreference Resolution	Winograd Schema Challenge	Accuracy	63	DSSM
Coreference Resolution	Winograd Schema Challenge	Accuracy	62.4	UDSSM-II (ensemble)
Coreference Resolution	Winograd Schema Challenge	Accuracy	59.2	UDSSM-II
Coreference Resolution	Winograd Schema Challenge	Accuracy	57.1	UDSSM-I (ensemble)
Coreference Resolution	Winograd Schema Challenge	Accuracy	54.5	UDSSM-I
Natural Language Understanding	PDP60	Accuracy	78.3	UDSSM-II (ensemble)
Natural Language Understanding	PDP60	Accuracy	76.7	UDSSM-I (ensemble)
Natural Language Understanding	PDP60	Accuracy	75	DSSM
Natural Language Understanding	PDP60	Accuracy	75	UDSSM-II

Related Papers

Comparing Apples to Oranges: A Dataset & Analysis of LLM Humour Understanding from Traditional Puns to Topical Jokes2025-07-17 Vision Language Action Models in Robotic Manipulation: A Systematic Review2025-07-14 LoSiA: Efficient High-Rank Fine-Tuning via Subnet Localization and Optimization2025-07-06 A Survey on Vision-Language-Action Models for Autonomous Driving2025-06-30 State and Memory is All You Need for Robust and Reliable AI Agents2025-06-30 skLEP: A Slovak General Language Understanding Benchmark2025-06-26 SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models2025-06-25 Semantic similarity estimation for domain specific data using BERT and other techniques2025-06-23