Voice Pathology Detection Using Deep Learning: a Preliminary Study

07/12/2019
by   Pavol Harar, et al.
0

This paper describes a preliminary investigation of Voice Pathology Detection using Deep Neural Networks (DNN). We used voice recordings of sustained vowel /a/ produced at normal pitch from German corpus Saarbruecken Voice Database (SVD). This corpus contains voice recordings and electroglottograph signals of more than 2 000 speakers. The idea behind this experiment is the use of convolutional layers in combination with recurrent Long-Short-Term-Memory (LSTM) layers on raw audio signal. Each recording was split into 64 ms Hamming windowed segments with 30 ms overlap. Our trained model achieved 71.36 accuracy with 65.04 and 68.08 testing files. This is a promising result in favor of this approach because it is comparable to similar previously published experiment that used different methodology. Further investigation is needed to achieve the state-of-the-art results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/04/2018

Voice Disorder Detection Using Long Short Term Memory (LSTM) Model

Automated detection of voice disorders with computational methods is a r...
research
09/08/2022

Developing a multi-variate prediction model for the detection of COVID-19 from Crowd-sourced Respiratory Voice Data

COVID-19 has affected more than 223 countries worldwide. There is a pres...
research
01/29/2016

Lipreading with Long Short-Term Memory

Lipreading, i.e. speech recognition from visual-only recordings of a spe...
research
12/12/2021

Learning Nigerian accent embeddings from speech: preliminary results based on SautiDB-Naija corpus

This paper describes foundational efforts with SautiDB-Naija, a novel co...
research
09/20/2018

LSTM-based Whisper Detection

This article presents a whisper speech detector in the far-field domain....
research
02/22/2022

Continuous Speech for Improved Learning Pathological Voice Disorders

Goal: Numerous studies had successfully differentiated normal and abnorm...
research
07/13/2019

Towards Robust Voice Pathology Detection

Automatic objective non-invasive detection of pathological voice based o...

Please sign up or login with your details

Forgot password? Click here to reset