Parameterization of Sequence of MFCCs for DNN-based voice disorder detection

12/14/2018
by   Tomasz Grzywalski, et al.
0

In this article a DNN-based system for detection of three common voice disorders (vocal nodules, polyps and cysts; laryngeal neoplasm; unilateral vocal paralysis) is presented. The input to the algorithm is (at least 3-second long) audio recording of sustained vowel sound /a:/. The algorithm was developed as part of the "2018 FEMH Voice Data Challenge" organized by Far Eastern Memorial Hospital and obtained score value (defined in the challenge specification) of 77.44. This was the second best result before final submission. Final challenge results are not yet known during writing of this document. The document also reports changes that were made for the final submission which improved the score value in cross-validation by 0.6

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/14/2022

Intel Labs at Ego4D Challenge 2022: A Better Baseline for Audio-Visual Diarization

This report describes our approach for the Audio-Visual Diarization (AVD...
research
07/12/2022

NEC: Speaker Selective Cancellation via Neural Enhanced Ultrasound Shadowing

In this paper, we propose NEC (Neural Enhanced Cancellation), a defense ...
research
02/24/2021

Deep Learning Approach for Singer Voice Classification of Vietnamese Popular Music

Singer voice classification is a meaningful task in the digital era. Wit...
research
04/18/2023

A Voice Disease Detection Method Based on MFCCs and Shallow CNN

The incidence rate of voice diseases is increasing year by year. The use...
research
06/26/2023

The Singing Voice Conversion Challenge 2023

We present the latest iteration of the voice conversion challenge (VCC) ...
research
11/21/2021

Automatic Detection of Depression from Stratified Samples of Audio Data

Depression is a common mental disorder which has been affecting millions...
research
06/27/2023

TranssionADD: A multi-frame reinforcement based sequence tagging model for audio deepfake detection

Thanks to recent advancements in end-to-end speech modeling technology, ...

Please sign up or login with your details

Forgot password? Click here to reset