An Adversarial Benchmark for Fake News Detection Models

01/03/2022
by   Lorenzo Jaime Yu Flores, et al.
0

With the proliferation of online misinformation, fake news detection has gained importance in the artificial intelligence community. In this paper, we propose an adversarial benchmark that tests the ability of fake news detectors to reason about real-world facts. We formulate adversarial attacks that target three aspects of "understanding": compositional semantics, lexical relations, and sensitivity to modifiers. We test our benchmark using BERT classifiers fine-tuned on the LIAR arXiv:arch-ive/1705648 and Kaggle Fake-News datasets, and show that both models fail to respond to changes in compositional and lexical meaning. Our results strengthen the need for such models to be used in conjunction with other fact checking methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/05/2019

Fake News Detection via NLP is Vulnerable to Adversarial Attacks

News plays a significant role in shaping people's beliefs and opinions. ...
research
07/16/2021

How Vulnerable Are Automatic Fake News Detection Methods to Adversarial Attacks?

As the spread of false information on the internet has increased dramati...
research
02/01/2023

Exploring Semantic Perturbations on Grover

With news and information being as easy to access as they currently are,...
research
05/15/2022

Evaluating Generalizability of Fine-Tuned Models for Fake News Detection

The Covid-19 pandemic has caused a dramatic and parallel rise in dangero...
research
07/13/2021

Rating Facts under Coarse-to-fine Regimes

The rise of manipulating fake news as a political weapon has become a gl...
research
07/17/2019

Fake News Detection as Natural Language Inference

This report describes the entry by the Intelligent Knowledge Management ...
research
12/13/2022

AdvCat: Domain-Agnostic Robustness Assessment for Cybersecurity-Critical Applications with Categorical Inputs

Machine Learning-as-a-Service systems (MLaaS) have been largely develope...

Please sign up or login with your details

Forgot password? Click here to reset