Teach an all-rounder with experts in different domains

07/09/2019
by   Zhao You, et al.
0

In many automatic speech recognition (ASR) tasks, an ideal model has to be applicable over multiple domains. In this paper, we propose to teach an all-rounder with experts in different domains. Concretely, we build a multi-domain acoustic model by applying the teacher-student training framework. First, for each domain, a teacher model (domain-dependent model) is trained by fine-tuning a multi-condition model with domain-specific subset. Then all these teacher models are used to teach one single student model simultaneously. We perform experiments on two predefined domain setups. One is domains with different speaking styles, the other is nearfield, far-field and far-field with noise. Moreover, two types of models are examined: deep feedforward sequential memory network (DFSMN) and long short term memory (LSTM). Experimental results show that the model trained with this framework outperforms not only multi-condition model but also domain-dependent model. Specially, our training method provides up to 10.4 baseline model (multi-condition model).

READ FULL TEXT
research
02/20/2018

Distilling Knowledge Using Parallel Data for Far-field Speech Recognition

In order to improve the performance for far-field speech recognition, th...
research
03/09/2020

Toward Cross-Domain Speech Recognition with End-to-End Models

In the area of multi-domain speech recognition, research in the past foc...
research
02/01/2020

Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning

In this work, we investigated the teacher-student training paradigm to t...
research
07/29/2022

Domain Specific Wav2vec 2.0 Fine-tuning For The SE R 2022 Challenge

This paper presents our efforts to build a robust ASR model for the shar...
research
10/22/2022

Understanding Domain Learning in Language Models Through Subpopulation Analysis

We investigate how different domains are encoded in modern neural networ...
research
07/06/2021

Generalizing Nucleus Recognition Model in Multi-source Images via Pruning

Ki67 is a significant biomarker in the diagnosis and prognosis of cancer...
research
08/16/2018

Toward domain-invariant speech recognition via large scale training

Current state-of-the-art automatic speech recognition systems are traine...

Please sign up or login with your details

Forgot password? Click here to reset