Sebastian Stueker

research

∙ 05/07/2021

Efficient Weight factorization for Multilingual Speech Recognition

End-to-end multilingual speech recognition involves using a single model...

0 Ngoc-Quan Pham, et al. ∙

research

∙ 10/07/2020

Super-Human Performance in Online Low-latency Recognition of Conversational Speech

Achieving super-human performance in recognizing human speech has been a...

0 Thai-Son Nguyen, et al. ∙

research

∙ 05/20/2020

Relative Positional Encoding for Speech Recognition and Direct Translation

Transformer models are powerful sequence-to-sequence architectures that ...

0 Ngoc-Quan Pham, et al. ∙

research

∙ 03/22/2020

High Performance Sequence-to-Sequence Model for Streaming Speech Recognition

Recently sequence-to-sequence models have started to achieve state-of-th...

0 Thai-Son Nguyen, et al. ∙

research

∙ 03/22/2020

Low Latency ASR for Simultaneous Speech Translation

User studies have shown that reducing the latency of our simultaneous le...

0 Thai-Son Nguyen, et al. ∙

research

∙ 10/29/2019

Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation

Sequence-to-Sequence (S2S) models recently started to show state-of-the-...

57 Thai-Son Nguyen, et al. ∙

research

∙ 03/31/2019

Learning Shared Encoding Representation for End-to-End Speech Recognition Models

In this work, we learn a shared encoding representation for a multi-task...

0 Thai-Son Nguyen, et al. ∙

research

∙ 02/02/2019

Using multi-task learning to improve the performance of acoustic-to-word and conventional hybrid models

Acoustic-to-word (A2W) models that allow direct mapping from acoustic si...

0 Thai-Son Nguyen, et al. ∙

research

∙ 02/14/2018

Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop

We summarize the accomplishments of a multi-disciplinary workshop explor...

0 Odette Scharenborg, et al. ∙

Sebastian Stueker

Featured Co-authors

Sign in with Google

Consider DeepAI Pro