Artificial Text Detection via Examining the Topology of Attention Maps

09/10/2021
by   Laida Kushnareva, et al.
0

The impressive capabilities of recent generative models to create texts that are challenging to distinguish from the human-written ones can be misused for generating fake news, product reviews, and even abusive content. Despite the prominent performance of existing methods for artificial text detection, they still lack interpretability and robustness towards unseen models. To this end, we propose three novel types of interpretable topological features for this task based on Topological Data Analysis (TDA) which is currently understudied in the field of NLP. We empirically show that the features derived from the BERT model outperform count- and neural-based baselines up to 10% on three common datasets, and tend to be the most robust towards unseen GPT-style generation models as opposed to existing methods. The probing analysis of the features reveals their sensitivity to the surface and syntactic properties. The results demonstrate that TDA is a promising line with respect to NLP tasks, specifically the ones that incorporate surface and structural information.

READ FULL TEXT

page 4

page 9

research
12/10/2022

Artificial Text Detection with Multiple Training Strategies

As the deep learning rapidly promote, the artificial texts created by ge...
research
11/02/2020

Automatic Detection of Machine Generated Text: A Critical Survey

Text generative models (TGMs) excel in producing text that matches the s...
research
08/14/2020

Graph-based Modeling of Online Communities for Fake News Detection

Over the past few years, there has been substantial effort towards autom...
research
04/19/2023

TieFake: Title-Text Similarity and Emotion-Aware Fake News Detection

Fake news detection aims to detect fake news widely spreading on social ...
research
09/04/2022

Interpretable Fake News Detection with Topic and Deep Variational Models

The growing societal dependence on social media and user generated conte...
research
11/30/2022

Topological Data Analysis for Speech Processing

We apply topological data analysis (TDA) to speech classification proble...

Please sign up or login with your details

Forgot password? Click here to reset