Robustness against the channel effect in pathological voice detection

11/26/2018
by   Yi-Te Hsu, et al.
0

Many people are suffering from voice disorders, which can adversely affect the quality of their lives. In response, some researchers have proposed algorithms for automatic assessment of these disorders, based on voice signals. However, these signals can be sensitive to the recording devices. Indeed, the channel effect is a pervasive problem in machine learning for healthcare. In this study, we propose a detection system for pathological voice, which is robust against the channel effect. This system is based on a bidirectional LSTM network. To increase the performance robustness against channel mismatch, we integrate domain adversarial training (DAT) to eliminate the differences between the devices. When we train on data recorded on a high-quality microphone and evaluate on smartphone data without labels, our robust detection system increases the PR-AUC from 0.8448 to 0.9455 (and 0.9522 with target sample labels). To the best of our knowledge, this is the first study applying unsupervised domain adaptation to pathological voice detection. Notably, our system does not need target device sample labels, which allows for generalization to many new devices.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2019

VOICe: A Sound Event Detection Dataset For Generalizable Domain Adaptation

The performance of sound event detection methods can significantly degra...
research
12/05/2021

Toward Real-World Pathological Voice Detection

Voice disorders significantly undermine people's ability to speak in the...
research
11/05/2020

A Comparison Study on Infant-Parent Voice Diarization

We design a framework for studying prelinguistic child voicefrom 3 to 24...
research
06/04/2018

Revisiting Singing Voice Detection: a Quantitative Review and the Future Outlook

Since the vocal component plays a crucial role in popular music, singing...
research
11/17/2022

Robust Vocal Quality Feature Embeddings for Dysphonic Voice Detection

Approximately 1.2 As a result, automatic dysphonic voice detection has a...
research
05/14/2022

Integration of Text and Graph-based Features for Detecting Mental Health Disorders from Voice

With the availability of voice-enabled devices such as smart phones, men...
research
06/05/2021

Lightweight Dual-channel Target Speaker Separation for Mobile Voice Communication

Nowadays, there is a strong need to deploy the target speaker separation...

Please sign up or login with your details

Forgot password? Click here to reset