Toward Real-World Pathological Voice Detection

12/05/2021
by   Heng-Cheng Kuo, et al.
0

Voice disorders significantly undermine people's ability to speak in their daily lives. Without early diagnoses and treatments, these disorders may drastically deteriorate. Thus, automatic detection systems at home are desired for people inaccessible to disease assessments. However, more accurate systems usually require more cumbersome machine learning models, whereas the memory and computational resources of the systems at home are limited. Moreover, the performance of the systems may be weakened due to domain mismatch between clinic and real-world data. Therefore, we aimed to develop a compressed and domain-robust pathological voice detection system. Domain adversarial training was utilized to address domain mismatch by extracting domain-invariant features. In addition, factorized convolutional neural networks were exploited to compress the feature extractor model. The results showed that only 4 degradation of unweighted average recall occurred in the target domain compared to the source domain, indicating that the domain mismatch was effectively eliminated. Furthermore, our system reduced both usages of memory and computation by over 73.9 resolved domain mismatch and may be applicable to embedded systems at home with limited resources.

READ FULL TEXT
research
11/26/2018

Robustness against the channel effect in pathological voice detection

Many people are suffering from voice disorders, which can adversely affe...
research
06/07/2018

Domain Adversarial Training for Accented Speech Recognition

In this paper, we propose a domain adversarial training (DAT) algorithm ...
research
12/20/2020

Domain-adaptive Fall Detection Using Deep Adversarial Training

Fall detection (FD) systems are important assistive technologies for hea...
research
05/28/2019

Automatic Quality Control and Enhancement for Voice-Based Remote Parkinson's Disease Detection

The performance of voice-based Parkinson's disease (PD) detection system...
research
07/19/2018

Noise Adaptive Speech Enhancement using Domain Adversarial Training

In this study, we propose a novel noise adaptive speech enhancement (SE)...
research
05/28/2023

Cross-Domain Policy Adaptation via Value-Guided Data Filtering

Generalizing policies across different domains with dynamics mismatch po...
research
03/12/2023

Color Mismatches in Stereoscopic Video: Real-World Dataset and Deep Correction Method

We propose a real-world dataset of stereoscopic videos for color-mismatc...

Please sign up or login with your details

Forgot password? Click here to reset