SSL-Net: A Synergistic Spectral and Learning-based Network for Efficient Bird Sound Classification

09/15/2023
by   Yiyuan Yang, et al.
0

Efficient and accurate bird sound classification is of important for ecology, habitat protection and scientific research, as it plays a central role in monitoring the distribution and abundance of species. However, prevailing methods typically demand extensively labeled audio datasets and have highly customized frameworks, imposing substantial computational and annotation loads. In this study, we present an efficient and general framework called SSL-Net, which combines spectral and learned features to identify different bird sounds. Encouraging empirical results gleaned from a standard field-collected bird audio dataset validate the efficacy of our method in extracting features efficiently and achieving heightened performance in bird sound classification, even when working with limited sample sizes. Furthermore, we present three feature fusion strategies, aiding engineers and researchers in their selection through quantitative analysis.

READ FULL TEXT

page 1

page 3

research
01/24/2019

Multi-stream Network With Temporal Attention For Environmental Sound Classification

Environmental sound classification systems often do not perform robustly...
research
01/05/2023

Automatic Sound Event Detection and Classification of Great Ape Calls Using Neural Networks

We present a novel approach to automatically detect and classify great a...
research
07/19/2022

GAFX: A General Audio Feature eXtractor

Most machine learning models for audio tasks are dealing with a handcraf...
research
03/04/2023

A General Framework for Learning Procedural Audio Models of Environmental Sounds

This paper introduces the Procedural (audio) Variational autoEncoder (Pr...
research
06/19/2023

Visually-Guided Sound Source Separation with Audio-Visual Predictive Coding

The framework of visually-guided sound source separation generally consi...
research
12/15/2018

Deep Synthesizer Parameter Estimation

Sound synthesis is a complex field that requires domain expertise. Manua...
research
09/26/2021

Soundata: A Python library for reproducible use of audio datasets

Soundata is a Python library for loading and working with audio datasets...

Please sign up or login with your details

Forgot password? Click here to reset