Pause insertion, also known as phrase break prediction and phrasing, is ...
Automatic speech recognition (ASR) systems developed in recent years hav...
We present the UTokyo-SaruLab mean opinion score (MOS) prediction system...
Multi-speaker speech synthesis is a technique for modeling multiple spea...
This paper presents a deep Gaussian process (DGP) model with a recurrent...
Thanks to improvements in machine learning techniques, including deep
le...
This paper proposes a generative moment matching network (GMMN)-based
po...
This paper presents sampling-based speech parameter generation using
mom...