Text Revealer: Private Text Reconstruction via Model Inversion Attacks against Transformers

09/21/2022
by   Ruisi Zhang, et al.
0

Text classification has become widely used in various natural language processing applications like sentiment analysis. Current applications often use large transformer-based language models to classify input texts. However, there is a lack of systematic study on how much private information can be inverted when publishing models. In this paper, we formulate Text Revealer – the first model inversion attack for text reconstruction against text classification with transformers. Our attacks faithfully reconstruct private texts included in training data with access to the target model. We leverage an external dataset and GPT-2 to generate the target domain-like fluent text, and then perturb its hidden state optimally with the feedback from the target model. Our extensive experiments demonstrate that our attacks are effective for datasets with different text lengths and can reconstruct private texts with accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/21/2021

Ad Text Classification with Transformer-Based Natural Language Processing Methods

In this study, a natural language processing-based (NLP-based) method is...
research
06/23/2023

Deconstructing Classifiers: Towards A Data Reconstruction Attack Against Text Classification Models

Natural language processing (NLP) models have become increasingly popula...
research
01/01/2020

Stacked DeBERT: All Attention in Incomplete Data for Text Classification

In this paper, we propose Stacked DeBERT, short for Stacked Denoising Bi...
research
03/03/2022

Label-Only Model Inversion Attacks via Boundary Repulsion

Recent studies show that the state-of-the-art deep neural networks are v...
research
04/26/2022

A Robust Contrastive Alignment Method For Multi-Domain Text Classification

Multi-domain text classification can automatically classify texts in var...
research
02/21/2019

Deep Short Text Classification with Knowledge Powered Attention

Short text classification is one of important tasks in Natural Language ...
research
06/25/2022

Protoformer: Embedding Prototypes for Transformers

Transformers have been widely applied in text classification. Unfortunat...

Please sign up or login with your details

Forgot password? Click here to reset