Toward Knowledge-Driven Speech-Based Models of Depression: Leveraging Spectrotemporal Variations in Speech Vowels

10/05/2022
by   Kexin Feng, et al.
0

Psychomotor retardation associated with depression has been linked with tangible differences in vowel production. This paper investigates a knowledge-driven machine learning (ML) method that integrates spectrotemporal information of speech at the vowel-level to identify the depression. Low-level speech descriptors are learned by a convolutional neural network (CNN) that is trained for vowel classification. The temporal evolution of those low-level descriptors is modeled at the high-level within and across utterances via a long short-term memory (LSTM) model that takes the final depression decision. A modified version of the Local Interpretable Model-agnostic Explanations (LIME) is further used to identify the impact of the low-level spectrotemporal vowel variation on the decisions and observe the high-level temporal change of the depression likelihood. The proposed method outperforms baselines that model the spectrotemporal information in speech without integrating the vowel-based information, as well as ML models trained with conventional prosodic and spectrotemporal features. The conducted explainability analysis indicates that spectrotemporal information corresponding to non-vowel segments less important than the vowel-based information. Explainability of the high-level information capturing the segment-by-segment decisions is further inspected for participants with and without depression. The findings from this work can provide the foundation toward knowledge-driven interpretable decision-support systems that can assist clinicians to better understand fine-grain temporal changes in speech data, ultimately augmenting mental health diagnosis and care.

READ FULL TEXT
research
10/27/2022

A knowledge-driven vowel-based approach of depression classification from speech using data augmentation

We propose a novel explainable machine learning (ML) model that identifi...
research
08/12/2002

Knowledge Representation

This work analyses main features that should be present in knowledge rep...
research
06/02/2018

An Interpretable Deep Hierarchical Semantic Convolutional Neural Network for Lung Nodule Malignancy Classification

While deep learning methods are increasingly being applied to tasks such...
research
06/21/2021

Cross-layer Navigation Convolutional Neural Network for Fine-grained Visual Classification

Fine-grained visual classification (FGVC) aims to classify sub-classes o...
research
06/17/2021

Extracting Different Levels of Speech Information from EEG Using an LSTM-Based Model

Decoding the speech signal that a person is listening to from the human ...
research
04/12/2021

End-to-End Mandarin Tone Classification with Short Term Context Information

In this paper, we propose an end-to-end Mandarin tone classification met...
research
06/04/2019

Detecting Syntactic Change Using a Neural Part-of-Speech Tagger

We train a diachronic long short-term memory (LSTM) part-of-speech tagge...

Please sign up or login with your details

Forgot password? Click here to reset