We proposed Audio Difference Captioning (ADC) as a new extension task of...
Self-supervised learning general-purpose audio representations have
demo...
We present the task description of the Detection and Classification of
A...
This paper provides a baseline system for First-shot-compliant unsupervi...
Masked Autoencoders is a simple yet powerful self-supervised learning me...
We propose a novel framework for target speech extraction based on seman...
The amount of audio data available on public websites is growing rapidly...
We present the task description of the Detection and Classification of
A...
Many application studies rely on audio DNN models pre-trained on a
large...
Recent general-purpose audio representations show state-of-the-art
perfo...
Pre-trained models are essential as feature extractors in modern machine...
We present the task description and discussion on the results of the DCA...
Inspired by the recent progress in self-supervised learning for computer...
The goal of audio captioning is to translate input audio into its descri...
In this paper we study the problem of acoustic scene classification, i.e...