Are you wearing a mask? Improving mask detection from speech using augmentation by cycle-consistent GANs

06/17/2020
by   Nicolae-Catalin Ristea, et al.
0

The task of detecting whether a person wears a face mask from speech is useful in modelling speech in forensic investigations, communication between surgeons or people protecting themselves against infectious diseases such as COVID-19. In this paper, we propose a novel data augmentation approach for mask detection from speech. Our approach is based on (i) training Generative Adversarial Networks (GANs) with cycle-consistency loss to translate unpaired utterances between two classes (with mask and without mask), and on (ii) generating new training utterances using the cycle-consistent GANs, assigning opposite labels to each translated utterance. Original and translated utterances are converted into spectrograms which are provided as input to a set of ResNet neural networks with various depths. The networks are combined into an ensemble through a Support Vector Machines (SVM) classifier. With this system, we participated in the Mask Sub-Challenge (MSC) of the INTERSPEECH 2020 Computational Paralinguistics Challenge, surpassing the baseline proposed by the organizers by 2.8 boost of 0.9 augmentation approach yields better results than other baseline and state-of-the-art augmentation methods.

READ FULL TEXT
research
08/12/2020

Mask Detection and Breath Monitoring from Speech: on Data Augmentation, Feature Representation and Modeling

This paper introduces our approaches for the Mask and Breathing Sub-Chal...
research
01/10/2019

Data Augmentation of Room Classifiers using Generative Adversarial Networks

The classification of acoustic environments allows for machines to bette...
research
01/16/2021

Adversarial cycle-consistent synthesis of cerebral microbleeds for data augmentation

We propose a novel framework for controllable pathological image synthes...
research
01/05/2021

CycleGAN for Interpretable Online EMT Compensation

Purpose: Electromagnetic Tracking (EMT) can partially replace X-ray guid...
research
03/22/2022

Mask Usage Recognition using Vision Transformer with Transfer Learning and Data Augmentation

The COVID-19 pandemic has disrupted various levels of society. The use o...
research
04/02/2021

Data Augmentation with Manifold Barycenters

The training of Generative Adversarial Networks (GANs) requires a large ...

Please sign up or login with your details

Forgot password? Click here to reset