Deep Networks tag the location of bird vocalisations on audio spectrograms

11/12/2017
by   Lefteris Fanioudakis, et al.
0

This work focuses on reliable detection and segmentation of bird vocalizations as recorded in the open field. Acoustic detection of avian sounds can be used for the automatized monitoring of multiple bird taxa and querying in long-term recordings for species of interest. These tasks are tackled in this work, by suggesting two approaches: A) First, DenseNets are applied to weekly labeled data to infer the attention map of the dataset (i.e. Salience and CAM). We push further this idea by directing attention maps to the YOLO v2 Deepnet-based, detection framework to localize bird vocalizations. B) A deep autoencoder, namely the U-net, maps the audio spectrogram of bird vocalizations to its corresponding binary mask that encircles the spectral blobs of vocalizations while suppressing other audio sources. We focus solely on procedures requiring minimum human attendance, suitable to scan massive volumes of data, in order to analyze them, evaluate insights and hypotheses and identify patterns of bird activity. Hopefully, this approach will be valuable to researchers, conservation practitioners, and decision makers that need to design policies on biodiversity issues.

READ FULL TEXT

page 2

page 4

research
01/08/2019

Presence-absence estimation in audio recordings of tropical frog communities

One non-invasive way to study frog communities is by analyzing long-term...
research
02/25/2019

Automatic Detection and Compression for Passive Acoustic Monitoring of the African Forest Elephant

In this work, we consider applying machine learning to the analysis and ...
research
08/09/2023

Transferable Models for Bioacoustics with Human Language Supervision

Passive acoustic monitoring offers a scalable, non-invasive method for t...
research
02/14/2023

Detecting human and non-human vocal productions in large scale audio recordings

We propose an automatic data processing pipeline to extract vocal produc...
research
06/24/2016

Fully DNN-based Multi-label regression for audio tagging

Acoustic event detection for content analysis in most cases relies on lo...
research
01/06/2022

Implementing simple spectral denoising for environmental audio recordings

This technical report details changes applied to a noise filter to facil...

Please sign up or login with your details

Forgot password? Click here to reset