Diverse Neural Audio Embeddings – Bringing Features back !

09/15/2023
by   Prateek Verma, et al.
0

With the advent of modern AI architectures, a shift has happened towards end-to-end architectures. This pivot has led to neural architectures being trained without domain-specific biases/knowledge, optimized according to the task. We in this paper, learn audio embeddings via diverse feature representations, in this case, domain-specific. For the case of audio classification over hundreds of categories of sound, we learn robust separate embeddings for diverse audio properties such as pitch, timbre, and neural representation, along with also learning it via an end-to-end architecture. We observe handcrafted embeddings, e.g., pitch and timbre-based, although on their own, are not able to beat a fully end-to-end representation, yet adding these together with end-to-end embedding helps us, significantly improve performance. This work would pave the way to bring some domain expertise with end-to-end models to learn robust, diverse representations, surpassing the performance of just training end-to-end models.

READ FULL TEXT
research
04/25/2022

End-to-End Audio Strikes Back: Boosting Augmentations Towards An Efficient Audio Classification Network

While efficient architectures and a plethora of augmentations for end-to...
research
04/18/2019

End-to-End Environmental Sound Classification using a 1D Convolutional Neural Network

In this paper, we present an end-to-end approach for environmental sound...
research
12/01/2017

Utilizing Domain Knowledge in End-to-End Audio Processing

End-to-end neural network based approaches to audio modelling are genera...
research
07/20/2021

PERSA+: A Deep Learning Front-End for Context-Agnostic Audio Classification

Deep learning has been applied to diverse audio semantics tasks, enablin...
research
03/20/2022

A Study on Robustness to Perturbations for Representations of Environmental Sound

Audio applications involving environmental sound analysis increasingly u...
research
04/26/2017

Limits of End-to-End Learning

End-to-end learning refers to training a possibly complex learning syste...

Please sign up or login with your details

Forgot password? Click here to reset