Applying changes to an input speech signal to change the perceived speak...
Existing systems for sound event localization and detection (SELD) typic...
In this paper, we present a new dataset of music performance videos whic...
Separating different music instruments playing the same piece is a
chall...
Both acoustic and visual information influence human perception of speec...
Likelihood-based generative models are a promising resource to detect
ou...
The explainability of Convolutional Neural Networks (CNNs) is a particul...
Can we perform an end-to-end sound source separation (SSS) with a variab...