TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Attention Is (not) All You Need for Commonsense Reasoning

Attention Is (not) All You Need for Commonsense Reasoning

Tassilo Klein, Moin Nabi

2019-05-31ACL 2019 7Coreference ResolutionNatural Language UnderstandingAll
PaperPDFCodeCode

Abstract

The recently introduced BERT model exhibits strong performance on several language understanding benchmarks. In this paper, we describe a simple re-implementation of BERT for commonsense reasoning. We show that the attentions produced by BERT can be directly utilized for tasks such as the Pronoun Disambiguation Problem and Winograd Schema Challenge. Our proposed attention-guided commonsense reasoning method is conceptually simple yet empirically powerful. Experimental analysis on multiple datasets demonstrates that our proposed system performs remarkably well on all cases while outperforming the previously reported state of the art by a margin. While results suggest that BERT seems to implicitly learn to establish complex relationships between entities, solving commonsense reasoning tasks might require more than unsupervised models learned from huge text corpora.

Results

TaskDatasetMetricValueModel
Coreference ResolutionWinograd Schema ChallengeAccuracy60.3BERT-base 110M + MAS
Coreference ResolutionWinograd Schema ChallengeAccuracy52.8USSM + Supervised DeepNet + KB
Coreference ResolutionWinograd Schema ChallengeAccuracy52USSM + KB
Natural Language UnderstandingPDP60Accuracy68.3BERT-base 110M + MAS
Natural Language UnderstandingPDP60Accuracy66.7USSM + Supervised Deepnet + 3 Knowledge Bases
Natural Language UnderstandingPDP60Accuracy53.3USSM + Supervised Deepnet

Related Papers

Modeling Code: Is Text All You Need?2025-07-15All Eyes, no IMU: Learning Flight Attitude from Vision Alone2025-07-15Vision Language Action Models in Robotic Manipulation: A Systematic Review2025-07-14Is Diversity All You Need for Scalable Robotic Manipulation?2025-07-08DESIGN AND IMPLEMENTATION OF ONLINE CLEARANCE REPORT.2025-07-07Is Reasoning All You Need? Probing Bias in the Age of Reasoning Language Models2025-07-03Prompt2SegCXR:Prompt to Segment All Organs and Diseases in Chest X-rays2025-07-01A Survey on Vision-Language-Action Models for Autonomous Driving2025-06-30