Robust sound event detection in bioacoustic sensor networks

05/20/2019
by   Vincent Lostanlen, et al.
0

Bioacoustic sensors, sometimes known as autonomous recording units (ARUs), can record sounds of wildlife over long periods of time in scalable and minimally invasive ways. Deriving per-species abundance estimates from these sensors requires detection, classification, and quantification of animal vocalizations as individual acoustic events. Yet, variability in ambient noise, both over time and across sensors, hinders the reliability of current automated systems for sound event detection (SED), such as convolutional neural networks (CNN) in the time-frequency domain. In this article, we develop, benchmark, and combine several machine listening techniques to improve the generalizability of SED models across heterogeneous acoustic environments. As a case study, we consider the problem of detecting avian flight calls from a ten-hour recording of nocturnal bird migration, recorded by a network of six ARUs in the presence of heterogeneous background noise. Starting from a CNN yielding state-of-the-art accuracy on this task, we introduce two noise adaptation techniques, respectively integrating short-term (60-millisecond) and long-term (30-minute) context. First, we apply per-channel energy normalization (PCEN) in the time-frequency domain, which applies short-term automatic gain control to every subband in the mel-frequency spectrogram. Secondly, we replace the last dense layer in the network by a context-adaptive neural network (CA-NN) layer, i.e. an affine layer whose weights are dynamically adapted at prediction time by an auxiliary network taking long-term summary statistics of spectrotemporal features as input. We show that both techniques are helpful and complementary. [...] We release a pre-trained version of our best performing system under the name of BirdVoxDetect, a ready-to-use detector of avian flight calls in field recordings.

READ FULL TEXT
research
12/14/2019

Learning discriminative and robust time-frequency representations for environmental sound classification

Convolutional neural networks (CNN) are one of the best-performing neura...
research
08/11/2021

Robust Feature Learning on Long-Duration Sounds for Acoustic Scene Classification

Acoustic scene classification (ASC) aims to identify the type of scene (...
research
11/15/2019

Adaptive Multi-scale Detection of Acoustic Events

The goal of acoustic (or sound) events detection (AED or SED) is to pred...
research
06/19/2023

Female mosquito detection by means of AI techniques inside release containers in the context of a Sterile Insect Technique program

The Sterile Insect Technique (SIT) is a biological pest control techniqu...
research
06/24/2022

An Intensity and Phase Stacked Analysis of Phase-OTDR System using Deep Transfer Learning and Recurrent Neural Networks

Distributed acoustic sensors (DAS) are effective apparatus which are wid...
research
10/03/2018

CRED: A Deep Residual Network of Convolutional and Recurrent Units for Earthquake Signal Detection

Earthquake signal detection is at the core of observational seismology. ...

Please sign up or login with your details

Forgot password? Click here to reset