b'Yusuke Yasuda'

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Xin Wang
382 publications
Shinji Watanabe
239 publications
Junichi Yamagishi
127 publications
Tomoki Toda
66 publications
Yi Zhao
50 publications
Shinnosuke Takamichi
50 publications
Tomoki Hayashi
38 publications
Jiatong Shi
30 publications
Erica Cooper
25 publications
Ryuichi Yamamoto
19 publications
Takaaki Saeki
18 publications

research

∙ 12/16/2022

Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder

Text-to-speech synthesis (TTS) is a task to convert texts into speech. T...

0 Yusuke Yasuda, et al. ∙

research

∙ 12/16/2022

Investigation of Japanese PnG BERT language model in text-to-speech synthesis for pitch accent language

End-to-end text-to-speech synthesis (TTS) can generate highly natural sy...

0 Yusuke Yasuda, et al. ∙

research

∙ 10/15/2021

ESPnet2-TTS: Extending the Edge of TTS Research

This paper describes ESPnet2-TTS, an end-to-end text-to-speech (E2E-TTS)...

0 Tomoki Hayashi, et al. ∙

research

∙ 11/10/2020

Pretraining Strategies, Waveform Model Choice, and Acoustic Configurations for Multi-Speaker End-to-End Speech Synthesis

We explore pretraining strategies including choice of base corpus with t...

7 Erica Cooper, et al. ∙

research

∙ 10/22/2020

How Similar or Different Is Rakugo Speech Synthesizer to Professional Performers?

We have been working on speech synthesis for rakugo (a traditional Japan...

0 Shuhei Kato, et al. ∙

research

∙ 10/19/2020

End-to-End Text-to-Speech using Latent Duration based on VQ-VAE

Explicit duration modeling is a key to achieving robust and efficient al...

8 Yusuke Yasuda, et al. ∙

research

∙ 05/20/2020

Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis

Neural sequence-to-sequence text-to-speech synthesis (TTS) can produce h...

9 Yusuke Yasuda, et al. ∙

research

∙ 10/28/2019

Effect of choice of probability distribution, randomness, and search methods for alignment modeling in sequence-to-sequence text-to-speech synthesis using hard alignment

Sequence-to-sequence text-to-speech (TTS) is dominated by soft-attention...

0 Yusuke Yasuda, et al. ∙

research

∙ 08/30/2019

Initial investigation of an encoder-decoder end-to-end TTS framework using marginalization of monotonic hard latent alignments

End-to-end text-to-speech (TTS) synthesis is a method that directly conv...

0 Yusuke Yasuda, et al. ∙

research

∙ 10/29/2018

Investigation of enhanced Tacotron text-to-speech synthesis systems with self-attention for pitch accent language

End-to-end speech synthesis is a promising approach that directly conver...

0 Yusuke Yasuda, et al. ∙

Yusuke Yasuda

Featured Co-authors

Sign in with Google

Consider DeepAI Pro