Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings

02/13/2022
by   Nadee Seneviratne, et al.
0

Multimodal depression classification has gained immense popularity over the recent years. We develop a multimodal depression classification system using articulatory coordination features extracted from vocal tract variables and text transcriptions obtained from an automatic speech recognition tool that yields improvements of area under the receiver operating characteristics curve compared to uni-modal classifiers (7.5 respectively). We show that in the case of limited training data, a segment-level classifier can first be trained to then obtain a session-wise prediction without hindering the performance, using a multi-stage convolutional recurrent neural network. A text model is trained using a Hierarchical Attention Network (HAN). The multimodal system is developed by combining embeddings from the session-level audio model and the HAN text model

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/09/2021

Speech based Depression Severity Level Classification Using a Multi-Stage Dilated CNN-LSTM Model

Speech based depression classification has gained immense popularity ove...
research
10/20/2020

Replacing Human Audio with Synthetic Audio for On-device Unspoken Punctuation Prediction

We present a novel multi-modal unspoken punctuation prediction system fo...
research
02/26/2023

Efficient Ensemble Architecture for Multimodal Acoustic and Textual Embeddings in Punctuation Restoration using Time-Delay Neural Networks

Punctuation restoration plays an essential role in the post-processing p...
research
09/03/2019

Multimodal Deep Learning for Mental Disorders Prediction from Audio Speech Samples

Key features of mental illnesses are reflected in speech. Our research f...
research
05/02/2020

MultiQT: Multimodal Learning for Real-Time Question Tracking in Speech

We address a challenging and practical task of labeling questions in spe...
research
09/07/2021

Predicting Mood Disorder Symptoms with Remotely Collected Videos Using an Interpretable Multimodal Dynamic Attention Fusion Network

We developed a novel, interpretable multimodal classification method to ...
research
11/13/2020

Deep Learning Based Generalized Models for Depression Classification

Depression detection using vocal biomarkers is a highly researched area....

Please sign up or login with your details

Forgot password? Click here to reset