Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy

10/13/2022
by   Sarina Meyer, et al.
0

In order to protect the privacy of speech data, speaker anonymization aims for hiding the identity of a speaker by changing the voice in speech recordings. This typically comes with a privacy-utility trade-off between protection of individuals and usability of the data for downstream applications. One of the challenges in this context is to create non-existent voices that sound as natural as possible. In this work, we propose to tackle this issue by generating speaker embeddings using a generative adversarial network with Wasserstein distance as cost function. By incorporating these artificial embeddings into a speech-to-text-to-speech pipeline, we outperform previous approaches in terms of privacy and utility. According to standard objective metrics and human evaluation, our approach generates intelligible and content-preserving yet privacy-protecting versions of the original recordings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2022

Generating gender-ambiguous voices for privacy-preserving speech recognition

Our voice encodes a uniquely identifiable pattern which can be used to i...
research
10/18/2021

Protecting Anonymous Speech: A Generative Adversarial Network Methodology for Removing Stylistic Indicators in Text

With Internet users constantly leaving a trail of text, whether through ...
research
08/30/2020

Speech Pseudonymisation Assessment Using Voice Similarity Matrices

The proliferation of speech technologies and rising privacy legislation ...
research
06/28/2023

Long-term Conversation Analysis: Exploring Utility and Privacy

The analysis of conversations recorded in everyday life requires privacy...
research
07/11/2022

Speaker Anonymization with Phonetic Intermediate Representations

In this work, we propose a speaker anonymization pipeline that leverages...
research
11/06/2022

Distinguishable Speaker Anonymization based on Formant and Fundamental Frequency Scaling

Speech data on the Internet are proliferating exponentially because of t...
research
11/10/2022

Privacy-Utility Balanced Voice De-Identification Using Adversarial Examples

Faced with the threat of identity leakage during voice data publishing, ...

Please sign up or login with your details

Forgot password? Click here to reset