Multi-level Attention network using text, audio and video for Depression Prediction

09/03/2019
by   Anupama Ray, et al.
0

Depression has been the leading cause of mental-health illness worldwide. Major depressive disorder (MDD), is a common mental health disorder that affects both psychologically as well as physically which could lead to loss of lives. Due to the lack of diagnostic tests and subjectivity involved in detecting depression, there is a growing interest in using behavioural cues to automate depression diagnosis and stage prediction. The absence of labelled behavioural datasets for such problems and the huge amount of variations possible in behaviour makes the problem more challenging. This paper presents a novel multi-level attention based network for multi-modal depression prediction that fuses features from audio, video and text modalities while learning the intra and inter modality relevance. The multi-level attention reinforces overall learning by selecting the most influential features within each modality for the decision making. We perform exhaustive experimentation to create different regression models for audio, video and text modalities. Several fusions models with different configurations are constructed to understand the impact of each feature and modality. We outperform the current baseline by 17.52

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/11/2020

Multimodal Depression Severity Prediction from medical bio-markers using Machine Learning Tools and Technologies

Depression has been a leading cause of mental-health illnesses across th...
research
03/06/2018

Multi-level Attention Model for Weakly Supervised Audio Classification

In this paper, we propose a multi-level attention model to solve the wea...
research
05/04/2023

Learning Missing Modal Electronic Health Records with Unified Multi-modal Data Embedding and Modality-Aware Attention

Electronic Health Record (EHR) provides abundant information through var...
research
05/02/2018

Investigating Audio, Visual, and Text Fusion Methods for End-to-End Automatic Personality Prediction

We propose a tri-modal architecture to predict Big Five personality trai...
research
06/23/2022

DeepSafety:Multi-level Audio-Text Feature Extraction and Fusion Approach for Violence Detection in Conversations

Natural Language Processing has recently made understanding human intera...
research
11/30/2020

Depression Status Estimation by Deep Learning based Hybrid Multi-Modal Fusion Model

Preliminary detection of mild depression could immensely help in effecti...
research
07/24/2022

On the Validation of Multi-Level Personalised Health Condition Model

This paper presents a verification-based methodology to validate the mod...

Please sign up or login with your details

Forgot password? Click here to reset