NELS - Never-Ending Learner of Sounds

01/17/2018
by   Benjamin Elizalde, et al.
0

Sounds are essential to how humans perceive and interact with the world and are captured in recordings and shared on the Internet on a minute-by-minute basis. These recordings, which are predominantly videos, constitute the largest archive of sounds we know. However, most of these recordings have undescribed content making necessary methods for automatic sound analysis, indexing and retrieval. These methods have to address multiple challenges, such as the relation between sounds and language, numerous and diverse sound classes, and large-scale evaluation. We propose a system that continuously learns from the web relations between sounds and language, improves sound recognition models over time and evaluates its learning competency in the large-scale without references. We introduce the Never-Ending Learner of Sounds (NELS), a project for continuously learning of sounds and their associated knowledge, available on line in nels.cs.cmu.edu

READ FULL TEXT
research
02/20/2020

Multi-label Sound Event Retrieval Using a Deep Learning-based Siamese Structure with a Pairwise Presence Matrix

Realistic recordings of soundscapes often have multiple sound events co-...
research
09/17/2023

Sound Source Distance Estimation in Diverse and Dynamic Acoustic Conditions

Localizing a moving sound source in the real world involves determining ...
research
10/14/2021

HumBugDB: A Large-scale Acoustic Mosquito Dataset

This paper presents the first large-scale multi-species dataset of acous...
research
11/02/2017

Framework for evaluation of sound event detection in web videos

The largest source of sound events is web videos. Most videos lack sound...
research
06/04/2022

STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events

This report presents the Sony-TAu Realistic Spatial Soundscapes 2022 (ST...
research
01/28/2023

Do Orcas Have Semantic Language? Machine Learning to Predict Orca Behaviors Using Partially Labeled Vocalization Data

Orcinus orca (killer whales) exhibit complex calls. They last about a se...
research
02/12/2020

Active Learning for Sound Event Detection

This paper proposes an active learning system for sound event detection ...

Please sign up or login with your details

Forgot password? Click here to reset