Spoken language recognition (SLR) is the task of automatically identifyi...
Many audio processing tasks require perceptual assessment. However, the ...
Consumer-grade music recordings such as those captured by mobile devices...
What audio embedding approach generalizes best to a wide range of downst...
Modifying the pitch and timing of an audio signal are fundamental audio
...
Recent advances in deep learning have expanded possibilities to generate...
Text-based speech editors expedite the process of editing speech recordi...
Many speech processing methods based on deep learning require an automat...
We propose a high order numerical homogenization method for dissipative
...
Deep representation learning offers a powerful paradigm for mapping inpu...
Music similarity search is useful for a variety of creative tasks such a...
Speech synthesis has recently seen significant improvements in fidelity,...
Real-world audio recordings are often degraded by factors such as noise,...
Non-parallel many-to-many voice conversion remains an interesting but
ch...
Assessment of many audio processing tasks relies on subjective evaluatio...
Editing talking-head video to change the speech content or to remove fil...