General-purpose Tagging of Freesound Audio with AudioSet Labels: Task Description, Dataset, and Baseline

07/26/2018
by   Eduardo Fonseca, et al.
0

This paper describes Task 2 of the DCASE 2018 Challenge, titled "General-purpose audio tagging of Freesound content with AudioSet labels". This task was hosted on the Kaggle platform as "Freesound General-Purpose Audio Tagging Challenge". The goal of the task is to build an audio tagging system that can recognize the category of an audio clip from a subset of 41 heterogeneous categories drawn from the AudioSet Ontology. We present the task, the dataset prepared for the competition, and a baseline system.

READ FULL TEXT
research
10/30/2018

General audio tagging with ensembling convolutional neural network and statistical features

Audio tagging aims to infer descriptive labels from audio clips. Audio t...
research
06/06/2017

A General-Purpose Tagger with Convolutional Neural Networks

We present a general-purpose tagger based on convolutional neural networ...
research
06/07/2019

Audio tagging with noisy labels and minimal supervision

This paper introduces Task 2 of the DCASE2019 Challenge, titled "Audio t...
research
11/26/2018

Combining High-Level Features of Raw Audio Waves and Mel-Spectrograms for Audio Tagging

In this paper, we describe our contribution to Task 2 of the DCASE 2018 ...
research
08/30/2023

General Purpose Audio Effect Removal

Although the design and application of audio effects is well understood,...
research
01/26/2021

The Pixels and Sounds of Emotion: General-Purpose Representations of Arousal in Games

What if emotion could be captured in a general and subject-agnostic fash...
research
12/27/2017

A Light-Weight Multimodal Framework for Improved Environmental Audio Tagging

The lack of strong labels has severely limited the state-of-the-art full...

Please sign up or login with your details

Forgot password? Click here to reset