I-vector Transformation Using Conditional Generative Adversarial Networks for Short Utterance Speaker Verification

04/01/2018
by   Jiacen Zhang, et al.
0

I-vector based text-independent speaker verification (SV) systems often have poor performance with short utterances, as the biased phonetic distribution in a short utterance makes the extracted i-vector unreliable. This paper proposes an i-vector compensation method using a generative adversarial network (GAN), where its generator network is trained to generate a compensated i-vector from a short-utterance i-vector and its discriminator network is trained to determine whether an i-vector is generated by the generator or the one extracted from a long utterance. Additionally, we assign two other learning tasks to the GAN to stabilize its training and to make the generated ivector more speaker-specific. Speaker verification experiments on the NIST SRE 2008 "10sec-10sec" condition show that our method reduced the equal error rate by 11.3

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2018

Short utterance compensation in speaker verification via cosine-based teacher-student learning of speaker embeddings

Input utterance with short duration is one of the most critical threats ...
research
03/24/2018

MTGAN: Speaker Verification through Multitasking Triplet Generative Adversarial Networks

In this paper, we propose an enhanced triplet method that improves the e...
research
07/13/2018

Parametric generation of conditional geological realizations using generative neural networks

We introduce a method for parametric generation of conditional geologica...
research
08/06/2019

An End-to-End Text-independent Speaker Verification Framework with a Keyword Adversarial Network

This paper presents an end-to-end text-independent speaker verification ...
research
04/06/2020

Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs

In realistic settings, a speaker recognition system needs to identify a ...
research
03/31/2016

System Combination for Short Utterance Speaker Recognition

For text-independent short-utterance speaker recognition (SUSR), the per...
research
12/06/2018

Generative Adversarial Network based Speaker Adaptation for High Fidelity WaveNet Vocoder

Neural networks based vocoders, typically the WaveNet, have achieved spe...

Please sign up or login with your details

Forgot password? Click here to reset