Transformer-based speech recognition (ASR) model with deep layers exhibi...
Whisper is a powerful automatic speech recognition (ASR) model. Neverthe...
Augmented Language Models (ALMs) empower large language models with the
...
Dense retrieval (DR) converts queries and documents into dense embedding...
Very deep models for speaker recognition (SR) have demonstrated remarkab...
Probabilistic linear discriminant analysis (PLDA) is commonly used in sp...
State-of-art speaker verification (SV) systems use a back-end model to s...
Professional news media organizations have always touted the importance ...
This technical report describes our submission to the 2021 SLT Children
...
Speech signal is constituted and contributed by various informative fact...
This study addresses the problem of unsupervised subword unit discovery ...