Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization

04/27/2023
by   Hamza Kheddar, et al.
0

Automatic speech recognition (ASR) has recently become an important challenge when using deep learning (DL). It requires large-scale training datasets and high computational and storage resources. Moreover, DL techniques and machine learning (ML) approaches in general, hypothesize that training and testing data come from the same domain, with the same input feature space and data distribution characteristics. This assumption, however, is not applicable in some real-world artificial intelligence (AI) applications. Moreover, there are situations where gathering real data is challenging, expensive, or rarely occurring, which can not meet the data requirements of DL models. deep transfer learning (DTL) has been introduced to overcome these issues, which helps develop high-performing models using real datasets that are small or slightly different but related to the training data. This paper presents a comprehensive survey of DTL-based ASR frameworks to shed light on the latest developments and helps academics and professionals understand current challenges. Specifically, after presenting the DTL background, a well-designed taxonomy is adopted to inform the state-of-the-art. A critical analysis is then conducted to identify the limitations and advantages of each framework. Moving on, a comparative study is introduced to highlight the current challenges before deriving opportunities for future research.

READ FULL TEXT

page 3

page 5

research
04/21/2021

Accented Speech Recognition: A Survey

Automatic Speech Recognition (ASR) systems generalize poorly on accented...
research
09/16/2022

An Automatic Speech Recognition System for Bengali Language based on Wav2Vec2 and Transfer Learning

An independent, automated method of decoding and transcribing oral speec...
research
03/03/2023

End-to-End Speech Recognition: A Survey

In the last decade of automatic speech recognition (ASR) research, the i...
research
09/29/2021

Towards a theory of out-of-distribution learning

What is learning? 20^st century formalizations of learning theory – whic...
research
03/19/2021

SoK: A Modularized Approach to Study the Security of Automatic Speech Recognition Systems

With the wide use of Automatic Speech Recognition (ASR) in applications ...
research
01/02/2020

Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends

Research on speech processing has traditionally considered the task of d...
research
11/01/2022

Angular upsampling in diffusion MRI using contextual HemiHex sub-sampling in q-space

Artificial Intelligence (Deep Learning(DL)/ Machine Learning(ML)) techni...

Please sign up or login with your details

Forgot password? Click here to reset