Deep Feature Learning for Medical Acoustics

08/05/2022
by   Alessandro Maria Poirè, et al.
0

The purpose of this paper is to compare different learnable frontends in medical acoustics tasks. A framework has been implemented to classify human respiratory sounds and heartbeats in two categories, i.e. healthy or affected by pathologies. After obtaining two suitable datasets, we proceeded to classify the sounds using two learnable state-of-art frontends – LEAF and nnAudio – plus a non-learnable baseline frontend, i.e. Mel-filterbanks. The computed features are then fed into two different CNN models, namely VGG16 and EfficientNet. The frontends are carefully benchmarked in terms of the number of parameters, computational resources, and effectiveness. This work demonstrates how the integration of learnable frontends in neural audio classification systems may improve performance, especially in the field of medical acoustics. However, the usage of such frameworks makes the needed amount of data even larger. Consequently, they are useful if the amount of data available for training is adequately large to assist the feature learning process.

READ FULL TEXT

page 7

page 8

research
01/21/2021

LEAF: A Learnable Frontend for Audio Classification

Mel-filterbanks are fixed, engineered audio features which emulate human...
research
04/26/2023

Learnable Ophthalmology SAM

Segmentation is vital for ophthalmology image analysis. But its various ...
research
12/02/2022

Learning Disentangled Label Representations for Multi-label Classification

Although various methods have been proposed for multi-label classificati...
research
05/20/2022

Advanced Feature Learning on Point Clouds using Multi-resolution Features and Learnable Pooling

Existing point cloud feature learning networks often incorporate sequenc...
research
06/01/2023

Dilated Convolution with Learnable Spacings: beyond bilinear interpolation

Dilated Convolution with Learnable Spacings (DCLS) is a recently propose...
research
02/09/2021

Deep Multilabel CNN for Forensic Footwear Impression Descriptor Identification

In recent years deep neural networks have become the workhorse of comput...
research
07/12/2022

EfficientLEAF: A Faster LEarnable Audio Frontend of Questionable Use

In audio classification, differentiable auditory filterbanks with few pa...

Please sign up or login with your details

Forgot password? Click here to reset