Conformer-based end-to-end models have become ubiquitous these days and ...
Recently, there has been an increasing interest in unifying streaming an...
In this work, we define barge-in verification as a supervised learning t...
End-to-end speech recognition models trained using joint Connectionist
T...
Neural Language Models (NLM), when trained and evaluated with context
sp...
We live in a world where 60
languages fluently. Members of these communi...
In this work, we explore a multimodal semi-supervised learning approach ...
Automatic speech recognition (ASR) systems in the medical domain that fo...
Automatic speech recognition (ASR) systems in the medical domain that fo...
Automatic speaker verification (ASV) is one of the most natural and
conv...
We present a neural text-to-speech system for fine-grained prosody trans...
Neural text-to-speech synthesis (NTTS) models have shown significant pro...
This paper proposes a new approach to duration modelling for statistical...
Text-to-Speech synthesis in Indian languages has a seen lot of progress ...