XAI-based Comparison of Input Representations for Audio Event Classification

04/27/2023
by   Annika Frommholz, et al.
0

Deep neural networks are a promising tool for Audio Event Classification. In contrast to other data like natural images, there are many sensible and non-obvious representations for audio data, which could serve as input to these models. Due to their black-box nature, the effect of different input representations has so far mostly been investigated by measuring classification performance. In this work, we leverage eXplainable AI (XAI), to understand the underlying classification strategies of models trained on different input representations. Specifically, we compare two model architectures with regard to relevant input features used for Audio Event Detection: one directly processes the signal as the raw waveform, and the other takes in its time-frequency spectrogram representation. We show how relevance heatmaps obtained via "Siren"Layer-wise Relevance Propagation uncover representation-dependent decision strategies. With these insights, we can make a well-informed decision about the best input representation in terms of robustness and representativity and confirm that the model's classification strategies align with human requirements.

READ FULL TEXT

page 3

page 6

research
04/08/2019

Audio Classification of Bit-Representation Waveform

This paper investigates waveform representation for audio signal classif...
research
07/09/2018

Interpreting and Explaining Deep Neural Networks for Classification of Audio Signals

Interpretability of deep neural networks is a recently emerging area of ...
research
11/19/2021

Interpreting deep urban sound classification using Layer-wise Relevance Propagation

After constructing a deep neural network for urban sound classification,...
research
11/11/2022

Depth and Representation in Vision Models

Deep learning models develop successive representations of their input i...
research
07/30/2021

A Multi-Head Relevance Weighting Framework For Learning Raw Waveform Audio Representations

In this work, we propose a multi-head relevance weighting framework to l...
research
03/11/2023

Explainable AI for Time Series via Virtual Inspection Layers

The field of eXplainable Artificial Intelligence (XAI) has greatly advan...
research
02/23/2023

Fact or Artifact? Revise Layer-wise Relevance Propagation on various ANN Architectures

Layer-wise relevance propagation (LRP) is a widely used and powerful tec...

Please sign up or login with your details

Forgot password? Click here to reset