What does it take to create the Babel Fish, a tool that can help individ...
Speech-to-speech translation (S2ST) enables spoken communication between...
It has been known that direct speech-to-speech translation (S2ST) models...
Expressive speech-to-speech translation (S2ST) aims to transfer prosodic...
Direct speech-to-speech translation (S2ST), in which all components can ...
We study speech-to-speech translation (S2ST) that translates speech from...
We present SpeechMatrix, a large-scale multilingual corpus of
speech-to-...
Spoken language understanding (SLU) is a task aiming to extract high-lev...
Self-supervised representations have been extensively studied for
discri...
Direct speech-to-speech translation (S2ST) models suffer from data scarc...
Textless spoken language processing research aims to extend the applicab...
As the computational requirements for machine learning systems and the s...
We present a textless speech-to-speech translation (S2ST) system that ca...
We present the first direct simultaneous speech-to-speech translation
(S...
This paper presents fairseq S^2, a fairseq extension for speech synthesi...
Speech pre-training has primarily demonstrated efficacy on classificatio...
We present a direct speech-to-speech translation (S2ST) model that trans...
Self-supervised learning of speech representations has been a very activ...
We introduce VoxPopuli, a large-scale multilingual corpus providing 100K...
Few-shot algorithms aim at learning new tasks provided only a handful of...
This paper describes Facebook AI's submission to WMT20 shared news
trans...
For sequence transduction tasks like speech recognition, a strong struct...
We revisit self-training in the context of end-to-end speech recognition...
We propose a fully convolutional sequence-to-sequence encoder architectu...