On-Device Speaker Anonymization of Acoustic Embeddings for ASR based onFlexible Location Gradient Reversal Layer

07/25/2023
by   Md Asif Jalal, et al.
0

Smart devices serviced by large-scale AI models necessitates user data transfer to the cloud for inference. For speech applications, this means transferring private user information, e.g., speaker identity. Our paper proposes a privacy-enhancing framework that targets speaker identity anonymization while preserving speech recognition accuracy for our downstream task - Automatic Speech Recognition (ASR). The proposed framework attaches flexible gradient reversal based speaker adversarial layers to target layers within an ASR model, where speaker adversarial training anonymizes acoustic embeddings generated by the targeted layers to remove speaker identity. We propose on-device deployment by execution of initial layers of the ASR model, and transmitting anonymized embeddings to the cloud, where the rest of the model is executed while preserving privacy. Experimental results show that our method efficiently reduces speaker recognition relative accuracy by 33 improves ASR performance by achieving 6.2 reduction.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/12/2019

Privacy-Preserving Adversarial Representation Learning in ASR: Reality or Illusion?

Automatic speech recognition (ASR) is a key technology in many services ...
research
08/13/2019

End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning

This paper presents our latest investigation on end-to-end automatic spe...
research
02/12/2021

Content-Aware Speaker Embeddings for Speaker Diarisation

Recent speaker diarisation systems often convert variable length speech ...
research
06/06/2023

Improving Fairness and Robustness in End-to-End Speech Recognition through unsupervised clustering

The challenge of fairness arises when Automatic Speech Recognition (ASR)...
research
09/09/2019

Spreech: A System for Privacy-Preserving Speech Transcription

New Advances in machine learning and the abundance of speech datasets ha...
research
09/09/2019

Prεεch: A System for Privacy-Preserving Speech Transcription

New Advances in machine learning and the abundance of speech datasets ha...
research
02/23/2022

Differentially Private Speaker Anonymization

Sharing real-world speech utterances is key to the training and deployme...

Please sign up or login with your details

Forgot password? Click here to reset