Shigeki Karita

research

∙ 06/07/2023

Lenient Evaluation of Japanese Speech Recognition: Modeling Naturally Occurring Spelling Inconsistency

Word error rate (WER) and character error rate (CER) are standard metric...

0 Shigeki Karita, et al. ∙

research

∙ 05/30/2023

LibriTTS-R: A Restored Multi-Speaker Text-to-Speech Corpus

This paper introduces a new speech dataset called “LibriTTS-R” designed ...

0 Yuma Koizumi, et al. ∙

research

∙ 03/03/2023

Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations

Speech restoration (SR) is a task of converting degraded speech signals ...

0 Yuma Koizumi, et al. ∙

research

∙ 02/16/2022

Knowledge Transfer from Large-scale Pretrained Language Models to End-to-end Speech Recognizers

End-to-end speech recognition is a promising technology for enabling com...

0 Yotaro Kubo, et al. ∙

research

∙ 11/01/2021

SNRi Target Training for Joint Speech Enhancement and Recognition

This study aims to improve the performance of automatic speech recogniti...

0 Yuma Koizumi, et al. ∙

research

∙ 06/30/2021

DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement

Single-channel speech enhancement (SE) is an important task in speech pr...

0 Yuma Koizumi, et al. ∙

research

∙ 06/09/2021

A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition

End-to-end (E2E) modeling is advantageous for automatic speech recogniti...

0 Shigeki Karita, et al. ∙

research

∙ 12/23/2020

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans

This paper describes the recent development of ESPnet (https://github.co...

0 Shinji Watanabe, et al. ∙

research

∙ 10/24/2020

Unsupervised Learning of Disentangled Speech Content and Style Representation

We present an approach for unsupervised learning of speech representatio...

0 Andros Tjandra, et al. ∙

research

∙ 04/21/2020

ESPnet-ST: All-in-One Speech Translation Toolkit

We present ESPnet-ST, which is designed for the quick development of spe...

0 Hirofumi Inaguma, et al. ∙

research

∙ 09/13/2019

A Comparative Study on Transformer vs RNN in Speech Applications

Sequence-to-sequence models have been widely used in end-to-end speech p...

0 Shigeki Karita, et al. ∙

research

∙ 03/30/2018

ESPnet: End-to-End Speech Processing Toolkit

This paper introduces a new open source platform for end-to-end speech p...

0 Shinji Watanabe, et al. ∙

Shigeki Karita

Featured Co-authors

Sign in with Google

Consider DeepAI Pro