Scalable Preprocessing of High Volume Bird Acoustic Data

02/02/2018
by   Alexander Brown, et al.
0

In this work, we examine the problem of efficiently preprocessing high volume bird acoustic data. We combine several existing preprocessing steps including noise reduction approaches into a single efficient pipeline by examining each process individually. We then utilise a distributed computing architecture to improve execution time. Using a master-slave model with data parallelisation, we developed a near-linear automated scalable system, capable of preprocessing bird acoustic recordings 21.76 times faster with 32 cores over 8 virtual machines, compared to a serial process. This work contributes to the research area of bioacoustic analysis, which is currently very active because of its potential to monitor animals quickly at low cost. Overcoming noise interference is a significant challenge in many bioacoustic studies, and the volume of data in these studies is increasing. Our work makes large scale bird acoustic analyses more feasible by parallelising important bird acoustic processing tasks to significantly reduce execution times.

READ FULL TEXT

page 21

page 22

research
04/25/2019

Machine Learning For Distributed Acoustic Sensors, Classic versus Image and Deep Neural Networks Approach

Distributed Acoustic Sensing (DAS) using fiber optic cables is a promisi...
research
02/23/2023

Engineering Massively Parallel MST Algorithms

We develop and extensively evaluate highly scalable distributed-memory a...
research
07/12/2023

Towards a privacy-preserving distributed cloud service for preprocessing very large medical images

Digitized histopathology glass slides, known as Whole Slide Images (WSIs...
research
09/11/2015

High Performance Computer Acoustic Data Accelerator: A New System for Exploring Marine Mammal Acoustics for Big Data Applications

This paper presents a new software model designed for distributed sonic ...
research
08/05/2021

MSTRE-Net: Multistreaming Acoustic Modeling for Automatic Lyrics Transcription

This paper makes several contributions to automatic lyrics transcription...
research
07/29/2019

MIRaGe: Multichannel Database Of Room Impulse Responses Measured On High-Resolution Cube-Shaped Grid In Multiple Acoustic Conditions

We introduce a database of multi-channel recordings performed in an acou...
research
06/10/2022

Smallset Timelines: A Visual Representation of Data Preprocessing Decisions

Data preprocessing is a crucial stage in the data analysis pipeline, wit...

Please sign up or login with your details

Forgot password? Click here to reset