Surgical Mask Detection with Convolutional Neural Networks and Data Augmentations on Spectrograms

08/11/2020
by   Steffen Illium, et al.
0

In many fields of research, labeled datasets are hard to acquire. This is where data augmentation promises to overcome the lack of training data in the context of neural network engineering and classification tasks. The idea here is to reduce model over-fitting to the feature distribution of a small under-descriptive training dataset. We try to evaluate such data augmentation techniques to gather insights in the performance boost they provide for several convolutional neural networks on mel-spectrogram representations of audio data. We show the impact of data augmentation on the binary classification task of surgical mask detection in samples of human voice (ComParE Challenge 2020). Also we consider four varying architectures to account for augmentation robustness. Results show that most of the baselines given by ComParE are outperformed.

READ FULL TEXT
research
08/12/2020

Mask Detection and Breath Monitoring from Speech: on Data Augmentation, Feature Representation and Modeling

This paper introduces our approaches for the Mask and Breathing Sub-Chal...
research
05/02/2022

Assessing unconstrained surgical cuttings in VR using CNNs

We present a Convolutional Neural Network (CNN) suitable to assess uncon...
research
06/28/2023

Improving Primate Sounds Classification using Binary Presorting for Deep Learning

In the field of wildlife observation and conservation, approaches involv...
research
04/14/2023

1-D Residual Convolutional Neural Network coupled with Data Augmentation and Regularization Techniques for the ICPHM 2023 Data Challenge

In this article, we present our contribution to the ICPHM 2023 Data Chal...
research
08/16/2021

IADA: Iterative Adversarial Data Augmentation Using Formal Verification and Expert Guidance

Neural networks (NNs) are widely used for classification tasks for their...
research
04/16/2021

Data Augmentation for Voice-Assistant NLU using BERT-based Interchangeable Rephrase

We introduce a data augmentation technique based on byte pair encoding a...
research
02/23/2022

Image Classification on Small Datasets via Masked Feature Mixing

Deep convolutional neural networks require large amounts of labeled data...

Please sign up or login with your details

Forgot password? Click here to reset