Automated Audio Captioning (AAC) systems attempt to generate a natural
l...
Automated Audio Captioning (AAC) involves generating natural language
de...
Automated Audio Captioning (AAC) aims to develop systems capable of
desc...
In computer vision, convolutional neural networks (CNN) such as ConvNeXt...
Dilated Convolution with Learnable Spacings (DCLS) is a recently propose...
In this work, we propose to study the performance of a model trained wit...
Automatic Audio Captioning (AAC) is the task that aims to describe an au...
Meetings are a common activity in professional contexts, and it remains
...
Dilated convolution is basically a convolution with a wider kernel creat...
Automatic recognition systems for child speech are lagging behind those
...
Multi-label audio tagging consists of assigning sets of tags to audio
re...
Recently, semi-supervised learning (SSL) methods, in the framework of de...
Deep Neural Networks (DNNs) are the current state-of-the-art models in m...
Recently, it has been shown that spiking neural networks (SNNs) can be
t...
Sound event detection (SED) aims at identifying audio events (audio tagg...
The design of new methods and models when only weakly-labeled data are
a...
In this paper, we describe the outcomes of the challenge organized and r...
Detecting bird sounds in audio recordings automatically, if accurate eno...