Mitigation of Diachronic Bias in Fake News Detection Dataset

08/28/2021
by   Taichi Murayama, et al.
0

Fake news causes significant damage to society.To deal with these fake news, several studies on building detection models and arranging datasets have been conducted. Most of the fake news datasets depend on a specific time period. Consequently, the detection models trained on such a dataset have difficulty detecting novel fake news generated by political changes and social changes; they may possibly result in biased output from the input, including specific person names and organizational names. We refer to this problem as Diachronic Bias because it is caused by the creation date of news in each dataset. In this study, we confirm the bias, especially proper nouns including person names, from the deviation of phrase appearances in each dataset. Based on these findings, we propose masking methods using Wikidata to mitigate the influence of person names and validate whether they make fake news detection models robust through experiments with in-domain and out-of-domain data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/05/2021

Dataset of Fake News Detection and Fact Verification: A Survey

The rapid increase in fake news, which causes significant damage to soci...
research
05/06/2022

Arabic Fake News Detection Based on Deep Contextualized Embedding Models

Social media is becoming a source of news for many people due to its eas...
research
10/01/2021

Users' ability to perceive misinformation: An information quality assessment approach

Digital information exchange enables quick creation and sharing of infor...
research
09/24/2022

On Gender Bias in Fake News

Data science research into fake news has gathered much momentum in recen...
research
04/20/2022

Generalizing to the Future: Mitigating Entity Bias in Fake News Detection

The wide dissemination of fake news is increasingly threatening both ind...
research
04/12/2021

On Unifying Misinformation Detection

In this paper, we introduce UnifiedM2, a general-purpose misinformation ...
research
09/21/2019

On the Importance of Delexicalization for Fact Verification

In this work we aim to understand and estimate the importance that a neu...

Please sign up or login with your details

Forgot password? Click here to reset