Classification of Speech with and without Face Mask using Acoustic Features

10/08/2020
by   Rohan Kumar Das, et al.
0

The understanding and interpretation of speech can be affected by various external factors. The use of face masks is one such factors that can create obstruction to speech while communicating. This may lead to degradation of speech processing and affect humans perceptually. Knowing whether a speaker wears a mask may be useful for modeling speech for different applications. With this motivation, finding whether a speaker wears face mask from a given speech is included as a task in Computational Paralinguistics Evaluation (ComParE) 2020. We study novel acoustic features based on linear filterbanks, instantaneous phase and long-term information that can capture the artifacts for classification of speech with and without face mask. These acoustic features are used along with the state-of-the-art baselines of ComParE functionals, bag-of-audio-words, DeepSpectrum and auDeep features for ComParE 2020. The studies reveal the effectiveness of acoustic features, and their score level fusion with the ComParE 2020 baselines leads to an unweighted average recall of 73.50

READ FULL TEXT
research
08/17/2020

Do face masks introduce bias in speech technologies? The case of automated scoring of speaking proficiency

The COVID-19 pandemic has led to a dramatic increase in the use of face ...
research
08/07/2020

Applying Speech Tempo-Derived Features, BoAW and Fisher Vectors to Detect Elderly Emotion and Speech in Surgical Masks

The 2020 INTERSPEECH Computational Paralinguistics Challenge (ComParE) c...
research
08/11/2020

Acoustic effects of medical, cloth, and transparent face masks on speech signals

Face masks muffle speech and make communication more difficult, especial...
research
06/29/2021

FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

Methods for modeling and controlling prosody with acoustic features have...
research
02/24/2016

Accent Classification with Phonetic Vowel Representation

Previous accent classification research focused mainly on detecting acce...
research
08/30/2021

Speaker-Conditioned Hierarchical Modeling for Automated Speech Scoring

Automatic Speech Scoring (ASS) is the computer-assisted evaluation of a ...

Please sign up or login with your details

Forgot password? Click here to reset