This paper introduces a new speech dataset called “LibriTTS-R” designed ...
Speech restoration (SR) is a task of converting degraded speech signals ...
Denoising diffusion probabilistic models (DDPMs) and generative adversar...
Neural vocoder using denoising diffusion probabilistic model (DDPM) has ...
End-to-end speech recognition is a promising technology for enabling com...
This study aims to improve the performance of automatic speech recogniti...
Single-channel speech enhancement (SE) is an important task in speech
pr...
Conventional spoken language understanding systems consist of two main
c...
Current state-of-the-art automatic speech recognition systems are traine...
In this paper, we describe how to efficiently implement an acoustic room...
Attention-based encoder-decoder architectures such as Listen, Attend, an...
Sequence-to-sequence models provide a simple and elegant solution for
bu...