Our research focuses on solving the zero-shot text classification proble...
This paper proposes to use both audio input and subject information to
p...
Audio-visual synchronization aims to determine whether the mouth movemen...
Audio-visual active speaker detection (AVASD) is well-developed, and now...
The countermeasure (CM) model is developed to protect Automatic Speaker
...
In this paper, we propose a new dataset named EGDB, that con-tains
trans...
Due to the rapid development of deep learning, we can now successfully
s...
In this paper, we introduce a novel attentional similarity module for th...
Separating two sources from an audio mixture is an important task with m...