DeepTextMark: Deep Learning based Text Watermarking for Detection of Large Language Model Generated Text

05/09/2023
by   Travis Munyer, et al.
0

The capabilities of text generators have grown with the rapid development of Large Language Models (LLM). To prevent potential misuse, the ability to detect whether texts are produced by LLM has become increasingly important. Several related works have attempted to solve this problem using binary classifiers that categorize input text as human-written or LLM-generated. However, these classifiers have been shown to be unreliable. As impactful decisions could be made based on the result of the classification, the text source detection needs to be high-quality. To this end, this paper presents DeepTextMark, a deep learning-based text watermarking method for text source detection. Applying Word2Vec and Sentence Encoding for watermark insertion and a transformer-based classifier for watermark detection, DeepTextMark achieves blindness, robustness, imperceptibility, and reliability simultaneously. As discussed further in the paper, these traits are indispensable for generic text source detection, and the application focus of this paper is on the text generated by LLM. DeepTextMark can be implemented as an "add-on" to existing text generation systems. That is, the method does not require access or modification to the text generation technique. Experiments have shown high imperceptibility, high detection accuracy, enhanced robustness, reliability, and fast running speed of DeepTextMark.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/15/2022

Classifiers are Better Experts for Controllable Text Generation

This paper proposes a simple method for controllable text generation bas...
research
06/30/2023

Provable Robust Watermarking for AI-Generated Text

As AI-generated text increasingly resembles human-written content, the a...
research
06/10/2019

GLTR: Statistical Detection and Visualization of Generated Text

The rapid improvement of language models has raised the specter of abuse...
research
07/29/2023

Towards Codable Text Watermarking for Large Language Models

As large language models (LLMs) generate texts with increasing fluency a...
research
10/22/2019

Automatic Extraction of Personality from Text: Challenges and Opportunities

In this study, we examined the possibility to extract personality traits...
research
10/26/2020

Dutch Humor Detection by Generating Negative Examples

Detecting if a text is humorous is a hard task to do computationally, as...
research
06/16/2022

DIALOG-22 RuATD Generated Text Detection

Text Generation Models (TGMs) succeed in creating text that matches huma...

Please sign up or login with your details

Forgot password? Click here to reset