Privacy-Preserving Text Classification on BERT Embeddings with Homomorphic Encryption

10/05/2022
by   Garam Lee, et al.
0

Embeddings, which compress information in raw text into semantics-preserving low-dimensional vectors, have been widely adopted for their efficacy. However, recent research has shown that embeddings can potentially leak private information about sensitive attributes of the text, and in some cases, can be inverted to recover the original input text. To address these growing privacy challenges, we propose a privatization mechanism for embeddings based on homomorphic encryption, to prevent potential leakage of any piece of information in the process of text classification. In particular, our method performs text classification on the encryption of embeddings from state-of-the-art models like BERT, supported by an efficient GPU implementation of CKKS encryption scheme. We show that our method offers encrypted protection of BERT embeddings, while largely preserving their utility on downstream text classification tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/19/2019

PrivFT: Private and Fast Text Classification with Homomorphic Encryption

Privacy and security have increasingly become a concern for computing se...
research
03/31/2020

Information Leakage in Embedding Models

Embeddings are functions that map raw input data to low-dimensional vect...
research
06/19/2018

Private Text Classification

Confidential text corpora exist in many forms, but do not allow arbitrar...
research
05/08/2020

Comparative Analysis of Text Classification Approaches in Electronic Health Records

Text classification tasks which aim at harvesting and/or organizing info...
research
06/24/2021

Evaluation of Representation Models for Text Classification with AutoML Tools

Automated Machine Learning (AutoML) has gained increasing success on tab...
research
06/05/2019

Privacy-Preserving Classification of Personal Text Messages with Secure Multi-Party Computation: An Application to Hate-Speech Detection

Classification of personal text messages has many useful applications in...
research
01/18/2021

Fast Privacy-Preserving Text Classification based on Secure Multiparty Computation

We propose a privacy-preserving Naive Bayes classifier and apply it to t...

Please sign up or login with your details

Forgot password? Click here to reset