Deep learning based cough detection camera using enhanced features

07/28/2021
by   Gyeong-Tae Lee, et al.
0

Coughing is a typical symptom of COVID-19. To detect and localize coughing sounds remotely, a convolutional neural network (CNN) based deep learning model was developed in this work and integrated with a sound camera for the visualization of the cough sounds. The cough detection model is a binary classifier of which the input is a two second acoustic feature and the output is one of two inferences (Cough or Others). Data augmentation was performed on the collected audio files to alleviate class imbalance and reflect various background noises in practical environments. For effective featuring of the cough sound, conventional features such as spectrograms, mel-scaled spectrograms, and mel-frequency cepstral coefficients (MFCC) were reinforced by utilizing their velocity (V) and acceleration (A) maps in this work. VGGNet, GoogLeNet, and ResNet were simplified to binary classifiers, and were named V-net, G-net, and R-net, respectively. To find the best combination of features and networks, training was performed for a total of 39 cases and the performance was confirmed using the test F1 score. Finally, a test F1 score of 91.9 feature (named Spectroflow), an acoustic feature effective for use in cough detection. The trained cough detection model was integrated with a sound camera (i.e., one that visualizes sound sources using a beamforming microphone array). In a pilot test, the cough detection camera detected coughing sounds with an F1 score of 90.0 was tracked in real time.

READ FULL TEXT

page 6

page 7

page 16

page 20

page 23

research
08/02/2018

DCASE 2018 Challenge Surrey Cross-Task convolutional neural network baseline

The Detection and Classification of Acoustic Scenes and Events (DCASE) c...
research
05/12/2020

Classification of Infant Crying in Real-World Home Environments Using Deep Learning

In the domain of social signal processing, automated audio recognition i...
research
02/28/2023

Incremental Learning of Acoustic Scenes and Sound Events

In this paper, we propose a method for incremental learning of two disti...
research
10/21/2020

Automating Abnormality Detection in Musculoskeletal Radiographs through Deep Learning

This paper introduces MuRAD (Musculoskeletal Radiograph Abnormality Dete...
research
06/27/2021

A Machine Learning Model for Early Detection of Diabetic Foot using Thermogram Images

Diabetes foot ulceration (DFU) and amputation are a cause of significant...
research
07/13/2019

Towards Robust Voice Pathology Detection

Automatic objective non-invasive detection of pathological voice based o...

Please sign up or login with your details

Forgot password? Click here to reset