Estimating Uniqueness of Human Voice UsingI-Vector Representation

08/27/2020
by   Erkam Sinan Tandogan, et al.
0

We study the individuality of human voice with re-spect to a widely used feature representation of speech utterances,namely, the i-vector model. As a first step toward this goal, wecompare and contrast uniqueness measures proposed consideringdifferent biometric modalities. Then, we introduce a more appro-priate uniqueness measure that evaluates the entropy of i-vectorswhile taking into account speaker level variations. Estimates areobtained on two newly generated datasets designed to capturevariabilities between and within speakers. The first dataset speechsamples of more than 20 thousand speakers obtained fromTEDx Talks videos. The second one includes samples of morethan one and a half thousand actors that are extracted frommovie dialogues. By using this data, we analyzed how severalfactors, such as the number of speakers, number of samples perspeakers, and different levels of within-speaker variation affectestimates. Most notably, we determined that the discretizationof i-vector elements does not necessarily cause a reduction inspeaker recognition performance. Our results show that thedegree of uniqueness offered by i-vector based representationmay reach 43-52 bits in a confined setting; however, under lessconstrained variations estimates reduce significantly to 13-20 bitlevel, depending on coarseness of quantization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/08/2021

CUHK-EE Voice Cloning System for ICASSP 2021 M2VoC Challenge

This paper presents the CUHK-EE voice cloning system for ICASSP 2021 M2V...
research
05/05/2022

Speaker Recognition in the Wild

In this paper, we propose a pipeline to find the number of speakers, as ...
research
04/10/2020

Generating Multilingual Voices Using Speaker Space Translation Based on Bilingual Speaker Data

We present progress towards bilingual Text-to-Speech which is able to tr...
research
06/23/2021

Enrollment-less training for personalized voice activity detection

We present a novel personalized voice activity detection (PVAD) learning...
research
04/24/2018

Perceptual Evaluation of the Effectiveness of Voice Disguise by Age Modification

Voice disguise, purposeful modification of one's speaker identity with t...
research
10/23/2013

Can Facial Uniqueness be Inferred from Impostor Scores?

In Biometrics, facial uniqueness is commonly inferred from impostor simi...
research
07/24/2018

Speakers account for asymmetries in visual perspective so listeners don't have to

Debates over adults' theory of mind use have been fueled by surprising f...

Please sign up or login with your details

Forgot password? Click here to reset