Large-scale speech self-supervised learning (SSL) has emerged to the mai...
Several fast text-to-speech (TTS) models have been proposed for real-tim...
Speaker verification (SV) has recently attracted considerable research
i...
While deep learning has made impressive progress in speech synthesis and...
Several papers have proposed deep-learning-based models to predict the m...
Keyword spotting (KWS) and speaker verification (SV) have been studied
i...
Currently, the most widely used approach for speaker verification is the...
Currently, the most widely used approach for speaker verification is the...
In realistic settings, a speaker recognition system needs to identify a
...
Acoustic word embeddings — fixed-dimensional vector representations of
a...
Voice activity detection (VAD), which classifies frames as speech or
non...
In this paper, we propose a new pooling method called spatial pyramid
en...
Previous researches on acoustic word embeddings used in query-by-example...