Automatic Detection of Machine Generated Text: A Critical Survey

11/02/2020
by   Ganesh Jawahar, et al.
0

Text generative models (TGMs) excel in producing text that matches the style of human language reasonably well. Such TGMs can be misused by adversaries, e.g., by automatically generating fake news and fake product reviews that can look authentic and fool humans. Detectors that can distinguish text generated by TGM from human written text play a vital role in mitigating such misuse of TGMs. Recently, there has been a flurry of works from both natural language processing (NLP) and machine learning (ML) communities to build accurate detectors for English. Despite the importance of this problem, there is currently no work that surveys this fast-growing literature and introduces newcomers to important research challenges. In this work, we fill this void by providing a critical survey and review of this literature to facilitate a comprehensive understanding of this problem. We conduct an in-depth error analysis of the state-of-the-art detector and discuss research directions to guide future work in this exciting area.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/02/2018

A Survey on Natural Language Processing for Fake News Detection

Fake news detection is a critical yet challenging problem in Natural Lan...
research
10/13/2022

Machine Generated Text: A Comprehensive Survey of Threat Models and Detection Methods

Advances in natural language generation (NLG) have resulted in machine g...
research
08/11/2022

A Comprehensive Survey of Natural Language Generation Advances from the Perspective of Digital Deception

In recent years there has been substantial growth in the capabilities of...
research
09/10/2021

Artificial Text Detection via Examining the Topology of Attention Maps

The impressive capabilities of recent generative models to create texts ...
research
05/23/2023

Out-of-Distribution Generalization in Text Classification: Past, Present, and Future

Machine learning (ML) systems in natural language processing (NLP) face ...
research
05/17/2023

Smaller Language Models are Better Black-box Machine-Generated Text Detectors

With the advent of fluent generative language models that can produce co...
research
07/25/2020

Constructing a Testbed for Psychometric Natural Language Processing

Psychometric measures of ability, attitudes, perceptions, and beliefs ar...

Please sign up or login with your details

Forgot password? Click here to reset