Deep Learning for Suicide and Depression Identification with Unsupervised Label Correction

02/18/2021
by   Ayaan Haque, et al.
0

Early detection of suicidal ideation in depressed individuals can allow for adequate medical attention and support, which in many cases is life-saving. Recent NLP research focuses on classifying, from a given piece of text, if an individual is suicidal or clinically healthy. However, there have been no major attempts to differentiate between depression and suicidal ideation, which is an important clinical challenge. Due to the scarce availability of EHR data, suicide notes, or other similar verified sources, web query data has emerged as a promising alternative. Online sources, such as Reddit, allow for anonymity that prompts honest disclosure of symptoms, making it a plausible source even in a clinical setting. However, these online datasets also result in lower performance, which can be attributed to the inherent noise in web-scraped labels, which necessitates a noise-removal process. Thus, we propose SDCNL, a suicide versus depression classification method through a deep learning approach. We utilize online content from Reddit to train our algorithm, and to verify and correct noisy labels, we propose a novel unsupervised label correction method which, unlike previous work, does not require prior noise distribution information. Our extensive experimentation with multiple deep word embedding models and classifiers display the strong performance of the method in anew, challenging classification application. We make our code and dataset available at https://github.com/ayaanzhaque/SDCNL

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/02/2022

SELC: Self-Ensemble Label Correction Improves Learning with Noisy Labels

Deep neural networks are prone to overfitting noisy labels, resulting in...
research
12/05/2021

Hard Sample Aware Noise Robust Learning for Histopathology Image Classification

Deep learning-based histopathology image classification is a key techniq...
research
10/14/2020

Deep Learning from Small Amount of Medical Data with Noisy Labels: A Meta-Learning Approach

Computer vision systems recently made a big leap thanks to deep neural n...
research
02/28/2022

Inkorrect: Online Handwriting Spelling Correction

We introduce Inkorrect, a data- and label-efficient approach for online ...
research
01/30/2021

Learning From How Human Correct

In industry NLP application, our manually labeled data has a certain num...
research
03/15/2020

NoiseRank: Unsupervised Label Noise Reduction with Dependence Models

Label noise is increasingly prevalent in datasets acquired from noisy ch...
research
04/25/2019

Unsupervised label noise modeling and loss correction

Despite being robust to small amounts of label noise, convolutional neur...

Please sign up or login with your details

Forgot password? Click here to reset