
-
Unsupervised Cross-Lingual Speech Emotion Recognition Using DomainAdversarial Neural Network
By using deep learning approaches, Speech Emotion Recog-nition (SER) on ...
read it
-
Syntactic representation learning for neural network based TTS with syntactic parse tree traversal
Syntactic structure of a sentence text is correlated with the prosodic s...
read it
-
Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder Input
Non-autoregressive (NAR) transformer models have achieved significantly ...
read it
-
Improving pronunciation assessment via ordinal regression with anchored reference samples
Sentence level pronunciation assessment is important for Computer Assist...
read it
-
Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams
Generating 3D speech-driven talking head has received more and more atte...
read it
-
Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement
With the popularity of deep neural network, speech synthesis task has ac...
read it
-
Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention Networks
Self-attention network (SAN) can benefit significantly from the bi-direc...
read it
-
Study on Feature Subspace of Archetypal Emotions for Speech Emotion Recognition
Feature subspace selection is an important part in speech emotion recogn...
read it