Audio Impairment Recognition Using a Correlation-Based Feature Representation

03/22/2020
by   Alessandro Ragano, et al.
0

Audio impairment recognition is based on finding noise in audio files and categorising the impairment type. Recently, significant performance improvement has been obtained thanks to the usage of advanced deep learning models. However, feature robustness is still an unresolved issue and it is one of the main reasons why we need powerful deep learning architectures. In the presence of a variety of musical styles, hand-crafted features are less efficient in capturing audio degradation characteristics and they are prone to failure when recognising audio impairments and could mistakenly learn musical concepts rather than impairment types. In this paper, we propose a new representation of hand-crafted features that is based on the correlation of feature pairs. We experimentally compare the proposed correlation-based feature representation with a typical raw feature representation used in machine learning and we show superior performance in terms of compact feature dimensionality and improved computational speed in the test stage whilst achieving comparable accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2020

COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations

Audio representation learning based on deep neural networks (DNNs) emerg...
research
04/26/2016

An Enhanced Deep Feature Representation for Person Re-identification

Feature representation and metric learning are two critical components i...
research
10/06/2021

An Investigation of the Effectiveness of Phase for Audio Classification

While log-amplitude mel-spectrogram has widely been used as the feature ...
research
10/19/2022

Audio Tampering Detection Based on Shallow and Deep Feature Representation Learning

Digital audio tampering detection can be used to verify the authenticity...
research
07/23/2015

Deep Fishing: Gradient Features from Deep Nets

Convolutional Networks (ConvNets) have recently improved image recogniti...
research
02/27/2015

Plagiarism Detection in Polyphonic Music using Monaural Signal Separation

Given the large number of new musical tracks released each year, automat...
research
06/10/2018

Instance Search via Instance Level Segmentation and Feature Representation

Instance search is an interesting task as well as a challenging issue du...

Please sign up or login with your details

Forgot password? Click here to reset