"Liar, Liar Pants on Fire": A New Benchmark Dataset for Fake News Detection

05/01/2017
by   William Yang Wang, et al.
0

Automatic fake news detection is a challenging problem in deception detection, and it has tremendous real-world political and social impacts. However, statistical approaches to combating fake news has been dramatically limited by the lack of labeled benchmark datasets. In this paper, we present liar: a new, publicly available dataset for fake news detection. We collected a decade-long, 12.8K manually labeled short statements in various contexts from PolitiFact.com, which provides detailed analysis report and links to source documents for each case. This dataset can be used for fact-checking research as well. Notably, this new dataset is an order of magnitude larger than previously largest public fake news datasets of similar type. Empirically, we investigate automatic fake news detection based on surface-level linguistic patterns. We have designed a novel, hybrid convolutional neural network to integrate meta-data with text. We show that this hybrid approach can improve a text-only deep learning model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/10/2019

r/Fakeddit: A New Multimodal Benchmark Dataset for Fine-grained Fake News Detection

Fake news has altered society in negative ways as evidenced in politics ...
research
09/28/2020

Similarity Detection Pipeline for Crawling a Topic Related Fake News Corpus

Fake news detection is a challenging task aiming to reduce human time an...
research
07/08/2019

XFake: Explainable Fake News Detector with Visualizations

In this demo paper, we present the XFake system, an explainable fake new...
research
07/13/2021

Rating Facts under Coarse-to-fine Regimes

The rise of manipulating fake news as a political weapon has become a gl...
research
08/20/2019

Sarcasm Detection using Hybrid Neural Network

Sarcasm Detection has enjoyed great interest from the research community...
research
02/03/2022

Unified Fake News Detection using Transfer Learning of Bidirectional Encoder Representation from Transformers model

Automatic detection of fake news is needed for the public as the accessi...
research
12/13/2022

FNDaaS: Content-agnostic Detection of Fake News sites

Automatic fake news detection is a challenging problem in misinformation...

Please sign up or login with your details

Forgot password? Click here to reset