Speaker Adaptive Training using Model Agnostic Meta-Learning

10/23/2019
by   Ondřej Klejch, et al.
0

Speaker adaptive training (SAT) of neural network acoustic models learns models in a way that makes them more suitable for adaptation to test conditions. Conventionally, model-based speaker adaptive training is performed by having a set of speaker dependent parameters that are jointly optimised with speaker independent parameters in order to remove speaker variation. However, this does not scale well if all neural network weights are to be adapted to the speaker. In this paper we formulate speaker adaptive training as a meta-learning task, in which an adaptation process using gradient descent is encoded directly into the training of the model. We compare our approach with test-only adaptation of a standard baseline model and a SAT-LHUC model with a learned speaker adaptation schedule and demonstrate that the meta-learning approach achieves comparable results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/30/2018

Learning to adapt: a meta-learning approach for speaker adaptation

The performance of automatic speech recognition systems can be improved ...
research
11/07/2021

Meta-TTS: Meta-Learning for Few-Shot Speaker Adaptive Text-to-Speech

Personalizing a speech synthesis system is a highly desired application,...
research
12/27/2018

Tied Hidden Factors in Neural Networks for End-to-End Speaker Recognition

In this paper we propose a method to model speaker and session variabili...
research
03/29/2021

Improved Meta-learning training for Speaker Verification

Meta-learning (ML) has recently become a research hotspot in speaker ver...
research
01/12/2016

Learning Hidden Unit Contributions for Unsupervised Acoustic Model Adaptation

This work presents a broad study on the adaptation of neural network aco...
research
10/17/2017

Embedding-Based Speaker Adaptive Training of Deep Neural Networks

An embedding-based speaker adaptive training (SAT) approach is proposed ...
research
10/24/2019

Meta-learning for robust child-adult classification from speech

Computational modeling of naturalistic conversations in clinical applica...

Please sign up or login with your details

Forgot password? Click here to reset