"Liar, Liar Pants on Fire": A New Benchmark Dataset for Fake News Detection

William Yang Wang

2017-05-01ACL 2017 7Fact Checking Fake News Detection Deception Detection

Paper PDF Code Code Code Code Code Code Code Code Code Code Code

Abstract

Automatic fake news detection is a challenging problem in deception detection, and it has tremendous real-world political and social impacts. However, statistical approaches to combating fake news has been dramatically limited by the lack of labeled benchmark datasets. In this paper, we present liar: a new, publicly available dataset for fake news detection. We collected a decade-long, 12.8K manually labeled short statements in various contexts from PolitiFact.com, which provides detailed analysis report and links to source documents for each case. This dataset can be used for fact-checking research as well. Notably, this new dataset is an order of magnitude larger than previously largest public fake news datasets of similar type. Empirically, we investigate automatic fake news detection based on surface-level linguistic patterns. We have designed a novel, hybrid convolutional neural network to integrate meta-data with text. We show that this hybrid approach can improve a text-only deep learning model.

Results

Task	Dataset	Metric	Value	Model
Fake News Detection	LIAR	Test Accuracy	0.274	Hybrid CNNs (Text + All)
Fake News Detection	LIAR	Validation Accuracy	0.247	Hybrid CNNs (Text + All)
Fake News Detection	LIAR	Test Accuracy	0.27	CNNs
Fake News Detection	LIAR	Validation Accuracy	0.26	CNNs
Fake News Detection	LIAR	Test Accuracy	0.248	Hybrid CNNs (Text + Speaker)
Fake News Detection	LIAR	Validation Accuracy	0.277	Hybrid CNNs (Text + Speaker)
Fake News Detection	LIAR	Test Accuracy	0.233	Bi-LSTMs
Fake News Detection	LIAR	Validation Accuracy	0.223	Bi-LSTMs

Related Papers

PiMRef: Detecting and Explaining Ever-evolving Spear Phishing Emails with Knowledge Base Invariants2025-07-21 DCR: Quantifying Data Contamination in LLMs Evaluation2025-07-15 KEN: Knowledge Augmentation and Emotion Guidance Network for Multimodal Fake News Detection2025-07-13 DS@GT at CheckThat! 2025: Evaluating Context and Tokenization Strategies for Numerical Fact Verification2025-07-08 Recon, Answer, Verify: Agents in Search of Truth2025-07-04 Deception Detection in Dyadic Exchanges Using Multimodal Machine Learning: A Study on a Swedish Cohort2025-06-26 The Next Phase of Scientific Fact-Checking: Advanced Evidence Retrieval from Complex Structured Academic Papers2025-06-25 Decide less, communicate more: On the construct validity of end-to-end fact-checking in medicine2025-06-25