Spoken language identification refers to the task of automatically predi...
ESPnet-ST-v2 is a revamp of the open-source ESPnet-ST toolkit necessitat...
The black-box nature of end-to-end speech translation (E2E ST) systems m...
Collecting sufficient labeled data for spoken language understanding (SL...
End-to-end spoken language understanding (SLU) systems are gaining popul...
Connectionist Temporal Classification (CTC) is a widely used approach fo...
End-to-end (E2E) models are becoming increasingly popular for spoken lan...
Conformer has proven to be effective in many speech processing tasks. It...
State-of-the-art encoder-decoder models (e.g. for machine translation (M...
We introduce FLEURS, the Few-shot Learning Evaluation of Universal
Repre...
Conversational bilingual speech encompasses three types of utterances: t...
As Automatic Speech Processing (ASR) systems are getting better, there i...
The multi-decoder (MD) end-to-end speech translation model has demonstra...
Building language-universal speech recognition systems entails producing...
This paper describes the ESPnet-ST group's IWSLT 2021 submission in the
...
Decomposable tasks are complex and comprise of a hierarchy of sub-tasks....
End-to-end approaches for sequence tasks are becoming increasingly popul...
We live in a world where 60
languages fluently. Members of these communi...
Multilingual models can improve language processing, particularly for lo...
Automatic phonemic transcription tools are useful for low-resource langu...
Inspired by modular software design principles of independence,
intercha...
While low resource speech recognition has attracted a lot of attention f...
Multilingual acoustic models have been successfully applied to low-resou...
We present an end-to-end speech recognition model that learns interactio...
We present a novel conversational-context aware end-to-end speech recogn...
This paper describes the ARIEL-CMU submissions to the Low Resource Human...
Building multilingual and crosslingual models help bring different langu...
Developing a practical speech recognizer for a low resource language is
...
Techniques for multi-lingual and cross-lingual speech recognition can he...