Robust Vocal Quality Feature Embeddings for Dysphonic Voice Detection

11/17/2022
by   Jianwei Zhang, et al.
0

Approximately 1.2 As a result, automatic dysphonic voice detection has attracted considerable academic and clinical interest. However, existing methods for automated voice assessment often fail to generalize outside the training conditions or to other related applications. In this paper, we propose a deep learning framework for generating acoustic feature embeddings sensitive to vocal quality and robust across different corpora. A contrastive loss is combined with a classification loss to train our deep learning model jointly. Data warping methods are used on input voice samples to improve the robustness of our method. Empirical results demonstrate that our method not only achieves high in-corpus and cross-corpus classification accuracy but also generates good embeddings sensitive to voice quality and robust across different corpora. We also compare our results against three baseline methods on clean and three variations of deteriorated in-corpus and cross-corpus datasets and demonstrate that the proposed model consistently outperforms the baseline methods.

READ FULL TEXT
research
06/04/2020

PJS: phoneme-balanced Japanese singing voice corpus

This paper presents a free Japanese singing voice corpus that can be use...
research
05/16/2023

Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion

Nowadays, recognition-synthesis-based methods have been quite popular wi...
research
01/20/2020

JVS-MuSiC: Japanese multispeaker singing-voice corpus

Thanks to developments in machine learning techniques, it has become pos...
research
12/20/2021

Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus

High-fidelity multi-singer singing voice synthesis is challenging for ne...
research
04/07/2022

Detecting Vocal Fatigue with Neural Embeddings

Vocal fatigue refers to the feeling of tiredness and weakness of voice d...
research
11/26/2018

Robustness against the channel effect in pathological voice detection

Many people are suffering from voice disorders, which can adversely affe...
research
06/16/2023

Cross-corpus Readability Compatibility Assessment for English Texts

Text readability assessment has gained significant attention from resear...

Please sign up or login with your details

Forgot password? Click here to reset