Similarity Detection Pipeline for Crawling a Topic Related Fake News Corpus

09/28/2020
by   Inna Vogel, et al.
0

Fake news detection is a challenging task aiming to reduce human time and effort to check the truthfulness of news. Automated approaches to combat fake news, however, are limited by the lack of labeled benchmark datasets, especially in languages other than English. Moreover, many publicly available corpora have specific limitations that make them difficult to use. To address this problem, our contribution is threefold. First, we propose a new, publicly available German topic related corpus for fake news detection. To the best of our knowledge, this is the first corpus of its kind. In this regard, we developed a pipeline for crawling similar news articles. As our third contribution, we conduct different learning experiments to detect fake news. The best performance was achieved using sentence level embeddings from SBERT in combination with a Bi-LSTM (k=0.88).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/01/2017

"Liar, Liar Pants on Fire": A New Benchmark Dataset for Fake News Detection

Automatic fake news detection is a challenging problem in deception dete...
research
09/20/2021

Transforming Fake News: Robust Generalisable News Classification Using Transformers

As online news has become increasingly popular and fake news increasingl...
research
08/30/2018

Modeling Empathy and Distress in Reaction to News Stories

Computational detection and understanding of empathy is an important fac...
research
03/18/2022

Fake News Detection Using Majority Voting Technique

Due to the evolution of the Web and social network platforms it becomes ...
research
02/18/2017

A Stylometric Inquiry into Hyperpartisan and Fake News

This paper reports on a writing style analysis of hyperpartisan (i.e., e...
research
10/13/2021

Fake News Detection in Spanish Using Deep Learning Techniques

This paper addresses the problem of fake news detection in Spanish using...
research
01/25/2019

FaceForensics++: Learning to Detect Manipulated Facial Images

The rapid progress in synthetic image generation and manipulation has no...

Please sign up or login with your details

Forgot password? Click here to reset