TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/"Liar, Liar Pants on Fire": A New Benchmark Dataset for Fa...

"Liar, Liar Pants on Fire": A New Benchmark Dataset for Fake News Detection

William Yang Wang

2017-05-01ACL 2017 7Fact CheckingFake News DetectionDeception Detection
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

Automatic fake news detection is a challenging problem in deception detection, and it has tremendous real-world political and social impacts. However, statistical approaches to combating fake news has been dramatically limited by the lack of labeled benchmark datasets. In this paper, we present liar: a new, publicly available dataset for fake news detection. We collected a decade-long, 12.8K manually labeled short statements in various contexts from PolitiFact.com, which provides detailed analysis report and links to source documents for each case. This dataset can be used for fact-checking research as well. Notably, this new dataset is an order of magnitude larger than previously largest public fake news datasets of similar type. Empirically, we investigate automatic fake news detection based on surface-level linguistic patterns. We have designed a novel, hybrid convolutional neural network to integrate meta-data with text. We show that this hybrid approach can improve a text-only deep learning model.

Results

TaskDatasetMetricValueModel
Fake News DetectionLIARTest Accuracy0.274Hybrid CNNs (Text + All)
Fake News DetectionLIARValidation Accuracy0.247Hybrid CNNs (Text + All)
Fake News DetectionLIARTest Accuracy0.27CNNs
Fake News DetectionLIARValidation Accuracy0.26CNNs
Fake News DetectionLIARTest Accuracy0.248Hybrid CNNs (Text + Speaker)
Fake News DetectionLIARValidation Accuracy0.277Hybrid CNNs (Text + Speaker)
Fake News DetectionLIARTest Accuracy0.233Bi-LSTMs
Fake News DetectionLIARValidation Accuracy0.223Bi-LSTMs

Related Papers

PiMRef: Detecting and Explaining Ever-evolving Spear Phishing Emails with Knowledge Base Invariants2025-07-21DCR: Quantifying Data Contamination in LLMs Evaluation2025-07-15KEN: Knowledge Augmentation and Emotion Guidance Network for Multimodal Fake News Detection2025-07-13DS@GT at CheckThat! 2025: Evaluating Context and Tokenization Strategies for Numerical Fact Verification2025-07-08Recon, Answer, Verify: Agents in Search of Truth2025-07-04Deception Detection in Dyadic Exchanges Using Multimodal Machine Learning: A Study on a Swedish Cohort2025-06-26The Next Phase of Scientific Fact-Checking: Advanced Evidence Retrieval from Complex Structured Academic Papers2025-06-25Decide less, communicate more: On the construct validity of end-to-end fact-checking in medicine2025-06-25