Protecting gender and identity with disentangled speech representations

04/22/2021
by   Dimitrios Stoidis, et al.
0

Besides its linguistic content, our speech is rich in biometric information that can be inferred by classifiers. Learning privacy-preserving representations for speech signals enables downstream tasks without sharing unnecessary, private information about an individual. In this paper, we show that protecting gender information in speech is more effective than modelling speaker-identity information only when generating a non-sensitive representation of speech. Our method relies on reconstructing speech by decoding linguistic content along with gender information using a variational autoencoder. Specifically, we exploit disentangled representation learning to encode information about different attributes into separate subspaces that can be factorised independently. We present a novel way to encode gender information and disentangle two sensitive biometric identifiers, namely gender and identity, in a privacy-protecting setting. Experiments on the LibriSpeech dataset show that gender recognition and speaker verification can be reduced to a random guess, protecting against classification-based attacks, while maintaining the utility of the signal for speech recognition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2022

Generating gender-ambiguous voices for privacy-preserving speech recognition

Our voice encodes a uniquely identifiable pattern which can be used to i...
research
06/05/2021

An Attribute-Aligned Strategy for Learning Speech Representation

Advancement in speech technology has brought convenience to our life. Ho...
research
08/27/2023

Fairness and Privacy in Voice Biometrics:A Study of Gender Influences Using wav2vec 2.0

This study investigates the impact of gender information on utility, pri...
research
11/12/2019

Detection of speech events and speaker characteristics through photo-plethysmographic signal neural processing

The use of photoplethysmogram signal (PPG) for heart and sleep monitorin...
research
03/15/2022

Privacy-Preserving Speech Representation Learning using Vector Quantization

With the popularity of virtual assistants (e.g., Siri, Alexa), the use o...
research
07/26/2021

Beyond Voice Identity Conversion: Manipulating Voice Attributes by Adversarial Learning of Structured Disentangled Representations

Voice conversion (VC) consists of digitally altering the voice of an ind...
research
06/30/2023

Beyond Neural-on-Neural Approaches to Speaker Gender Protection

Recent research has proposed approaches that modify speech to defend aga...

Please sign up or login with your details

Forgot password? Click here to reset