The performance of Whisper in low-resource languages is still far from
p...
This paper focuses on investigating the learning operators for identifyi...
Matrix-variate time series data are largely available in applications.
H...
Speech enhancement (SE) performance has improved considerably since the ...
Speech enhancement is a critical component of many user-oriented audio
a...
Although deep learning (DL) has achieved notable progress in speech
enha...
Without the need of a clean reference, non-intrusive speech assessment
m...
Numerous compression and acceleration strategies have achieved outstandi...
Most of the deep learning-based speech enhancement models are learned in...
Although current deep generative adversarial networks (GANs) could synth...
A large number of Internet of Things (IoT) devices today are powered by
...
The discrepancy between the cost function used for training a speech
enh...
Multi-task learning (MTL) and attention mechanism have been proven to
ef...
Speech enhancement (SE) aims to improve speech quality and intelligibili...
Estimating 3D human poses from a monocular video is still a challenging ...
This work describes the speaker verification system developed by Human
L...
The Transformer architecture has shown its superior ability than recurre...
Deep learning-based models have greatly advanced the performance of spee...
Integrating modalities, such as video signals with speech, has been show...
Most recent studies on deep learning based speech enhancement (SE) focus...