VAE-based Domain Adaptation for Speaker Verification

08/27/2019
by   Xueyi Wang, et al.
0

Deep speaker embedding has achieved satisfactory performance in speaker verification. By enforcing the neural model to discriminate the speakers in the training set, deep speaker embedding (called `x-vectors`) can be derived from the hidden layers. Despite its good performance, the present embedding model is highly domain sensitive, which means that it often works well in domains whose acoustic condition matches that of the training data (in-domain), but degrades in mismatched domains (out-of-domain). In this paper, we present a domain adaptation approach based on Variational Auto-Encoder (VAE). This model transforms x-vectors to a regularized latent space; within this latent space, a small amount of data from the target domain is sufficient to accomplish the adaptation. Our experiments demonstrated that by this VAE-adaptation approach, speaker embeddings can be easily transformed to the target domain, leading to noticeable performance improvement.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/07/2019

VAE-based regularization for deep speaker embedding

Deep speaker embedding has achieved state-of-the-art performance in spea...
research
12/12/2020

DEAAN: Disentangled Embedding and Adversarial Adaptation Network for Robust Speaker Representation Learning

Despite speaker verification has achieved significant performance improv...
research
08/05/2019

Cross-lingual Text-independent Speaker Verification using Unsupervised Adversarial Discriminative Domain Adaptation

Speaker verification systems often degrade significantly when there is a...
research
09/19/2022

The Royalflush System for VoxCeleb Speaker Recognition Challenge 2022

In this technical report, we describe the Royalflush submissions for the...
research
10/30/2020

Deep Speaker Vector Normalization with Maximum Gaussianality Training

Deep speaker embedding represents the state-of-the-art technique for spe...
research
11/06/2016

Domain Adaptation For Formant Estimation Using Deep Learning

In this paper we present a domain adaptation technique for formant estim...
research
08/17/2023

The DKU-MSXF Speaker Verification System for the VoxCeleb Speaker Recognition Challenge 2023

This paper is the system description of the DKU-MSXF System for the trac...

Please sign up or login with your details

Forgot password? Click here to reset