Michiel Bacchiani

research

∙ 05/30/2023

LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus

This paper introduces a new speech dataset called “LibriTTS-R” designed ...

0 Yuma Koizumi, et al. ∙

research

∙ 03/03/2023

Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations

Speech restoration (SR) is a task of converting degraded speech signals ...

0 Yuma Koizumi, et al. ∙

research

∙ 10/03/2022

WaveFit: An Iterative and Non-autoregressive Neural Vocoder based on Fixed-Point Iteration

Denoising diffusion probabilistic models (DDPMs) and generative adversar...

0 Yuma Koizumi, et al. ∙

research

∙ 03/31/2022

SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping

Neural vocoder using denoising diffusion probabilistic model (DDPM) has ...

0 Yuma Koizumi, et al. ∙

research

∙ 02/16/2022

Knowledge Transfer from Large-scale Pretrained Language Models to End-to-end Speech Recognizers

End-to-end speech recognition is a promising technology for enabling com...

0 Yotaro Kubo, et al. ∙

research

∙ 11/01/2021

SNRi Target Training for Joint Speech Enhancement and Recognition

This study aims to improve the performance of automatic speech recogniti...

0 Yuma Koizumi, et al. ∙

research

∙ 06/30/2021

DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement

Single-channel speech enhancement (SE) is an important task in speech pr...

0 Yuma Koizumi, et al. ∙

research

∙ 09/24/2018

From Audio to Semantics: Approaches to end-to-end spoken language understanding

Conventional spoken language understanding systems consist of two main c...

0 Parisa Haghani, et al. ∙

research

∙ 08/16/2018

Toward domain-invariant speech recognition via large scale training

Current state-of-the-art automatic speech recognition systems are traine...

0 Arun Narayanan, et al. ∙

research

∙ 12/09/2017

Efficient Implementation of the Room Simulator for Training Deep Neural Network Acoustic Models

In this paper, we describe how to efficiently implement an acoustic room...

0 Chanwoo Kim, et al. ∙

research

∙ 12/05/2017

State-of-the-art Speech Recognition With Sequence-to-Sequence Models

Attention-based encoder-decoder architectures such as Listen, Attend, an...

0 Chung-Cheng Chiu, et al. ∙

research

∙ 12/05/2017

Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model

Sequence-to-sequence models provide a simple and elegant solution for bu...

0 Bo Li, et al. ∙

Michiel Bacchiani

Featured Co-authors

Sign in with Google

Consider DeepAI Pro