
-
Dependency Parsing based Semantic Representation Learning with Graph Neural Network for Enhancing Expressiveness of Text-to-Speech
Semantic information of a sentence is crucial for improving the expressi...
read it
-
Industry Practice of Coverage-Guided Enterprise-Level DBMS Fuzzing
As an infrastructure for data persistence and analysis, Database Managem...
read it
-
Adversarial defense for automatic speaker verification by cascaded self-supervised learning models
Automatic speaker verification (ASV) is one of the core technologies in ...
read it
-
Adversarially learning disentangled speech representations for robust multi-factor voice conversion
Factorizing speech as disentangled speech representations is vital to ac...
read it
-
Unsupervised Cross-Lingual Speech Emotion Recognition Using DomainAdversarial Neural Network
By using deep learning approaches, Speech Emotion Recog-nition (SER) on ...
read it
-
Syntactic representation learning for neural network based TTS with syntactic parse tree traversal
Syntactic structure of a sentence text is correlated with the prosodic s...
read it
-
Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder Input
Non-autoregressive (NAR) transformer models have achieved significantly ...
read it
-
Improving pronunciation assessment via ordinal regression with anchored reference samples
Sentence level pronunciation assessment is important for Computer Assist...
read it
-
Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams
Generating 3D speech-driven talking head has received more and more atte...
read it
-
Noise Robust TTS for Low Resource Speakers using Pre-trained Model and Speech Enhancement
With the popularity of deep neural network, speech synthesis task has ac...
read it
-
Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention Networks
Self-attention network (SAN) can benefit significantly from the bi-direc...
read it
-
Study on Feature Subspace of Archetypal Emotions for Speech Emotion Recognition
Feature subspace selection is an important part in speech emotion recogn...
read it