Distinguishable Speaker Anonymization based on Formant and Fundamental Frequency Scaling

11/06/2022
by   Jixun Yao, et al.
0

Speech data on the Internet are proliferating exponentially because of the emergence of social media, and the sharing of such personal data raises obvious security and privacy concerns. One solution to mitigate these concerns involves concealing speaker identities before sharing speech data, also referred to as speaker anonymization. In our previous work, we have developed an automatic speaker verification (ASV)-model-free anonymization framework to protect speaker privacy while preserving speech intelligibility. Although the framework ranked first place in VoicePrivacy 2022 challenge, the anonymization was imperfect, since the speaker distinguishability of the anonymized speech was deteriorated. To address this issue, in this paper, we directly model the formant distribution and fundamental frequency (F0) to represent speaker identity and anonymize the source speech by the uniformly scaling formant and F0. By directly scaling the formant and F0, the speaker distinguishability degradation of the anonymized speech caused by the introduction of other speakers is prevented. The experimental results demonstrate that our proposed framework can improve the speaker distinguishability and significantly outperforms our previous framework in voice distinctiveness. Furthermore, our proposed method also can trade off the privacy-utility by using different scaling factors.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/30/2019

Speaker Anonymization Using X-vector and Neural Waveform Models

The social media revolution has produced a plethora of web services to w...
research
09/15/2023

Improving Voice Conversion for Dissimilar Speakers Using Perceptual Losses

The rising trend of using voice as a means of interacting with smart dev...
research
09/12/2023

SynVox2: Towards a privacy-friendly VoxCeleb2 dataset

The success of deep learning in speaker recognition relies heavily on th...
research
07/15/2021

Improving Security in McAdams Coefficient-Based Speaker Anonymization by Watermarking Method

Speaker anonymization aims to suppress speaker individuality to protect ...
research
10/26/2022

Privacy-preserving Automatic Speaker Diarization

Automatic Speaker Diarization (ASD) is an enabling technology with numer...
research
04/12/2022

Enhancement of Pitch Controllability using Timbre-Preserving Pitch Augmentation in FastPitch

The recently developed pitch-controllable text-to-speech (TTS) model, i....
research
10/13/2022

Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy

In order to protect the privacy of speech data, speaker anonymization ai...

Please sign up or login with your details

Forgot password? Click here to reset