For Mandarin end-to-end (E2E) automatic speech recognition (ASR) tasks,
...
The acoustic and linguistic features are important cues for the spoken
l...
Speech enhancement (SE) performance has improved considerably since the ...
In order to reduce domain discrepancy to improve the performance of
cros...
Speaker embedding is an important front-end module to explore discrimina...
In acoustic signal processing, the target signals usually carry semantic...
This paper presents a novel discriminator-constrained optimal transport
...
Automatic speaker verification (ASV) systems, which determine whether tw...
The discrepancy between the cost function used for training a speech
enh...
Generative probability models are widely used for speaker verification (...
Synthesized speech from articulatory movements can have real-world use f...
The task for speaker verification (SV) is to decide an utterance is spok...
Due to the mismatch of statistical distributions of acoustic speech betw...
Speech enhancement (SE) aims to improve speech quality and intelligibili...
In noisy conditions, knowing speech contents facilitates listeners to mo...
Due to the simple design pipeline, end-to-end (E2E) neural models for sp...
Deep learning-based models have greatly advanced the performance of spee...
Convolutional neural network (CNN) is an indispensable building block fo...
In a noisy environment, a lossy speech signal can be automatically resto...
Reverberation, which is generally caused by sound reflections from walls...
Reverberation, which is generally caused by sound reflections from walls...
Speech enhancement model is used to map a noisy speech to a clean speech...
This paper aims to address two issues existing in the current speech
enh...
This study proposes a fully convolutional network (FCN) model for raw
wa...