Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy

07/25/2023
by   Yu Fu, et al.
0

To mitigate potential risks associated with language models, recent AI detection research proposes incorporating watermarks into machine-generated text through random vocabulary restrictions and utilizing this information for detection. While these watermarks only induce a slight deterioration in perplexity, our empirical investigation reveals a significant detriment to the performance of conditional text generation. To address this issue, we introduce a simple yet effective semantic-aware watermarking algorithm that considers the characteristics of conditional text generation and the input context. Experimental results demonstrate that our proposed method yields substantial improvements across various text generation models, including BART and Flan-T5, in tasks such as summarization and data-to-text generation while maintaining detection ability.

READ FULL TEXT
research
09/08/2019

c-TextGen: Conditional Text Generation for Harmonious Human-Machine Interaction

In recent years, with the development of deep learning technology, text ...
research
05/25/2022

R2D2: Robust Data-to-Text with Replacement Detection

Unfaithful text generation is a common problem for text generation syste...
research
09/20/2023

Speak While You Think: Streaming Speech Synthesis During Text Generation

Large Language Models (LLMs) demonstrate impressive capabilities, yet in...
research
09/25/2020

Weird AI Yankovic: Generating Parody Lyrics

Lyrics parody swaps one set of words that accompany a melody with a new ...
research
05/23/2023

Enhancing Generation through Summarization Duality and Explicit Outline Control

Automatically open-ended long text generation poses significant challeng...
research
08/15/2023

Teach LLMs to Personalize – An Approach inspired by Writing Education

Personalized text generation is an emerging research area that has attra...
research
09/13/2023

Cognitive Mirage: A Review of Hallucinations in Large Language Models

As large language models continue to develop in the field of AI, text ge...

Please sign up or login with your details

Forgot password? Click here to reset