Multiple Sclerosis Severity Classification From Clinical Text

10/29/2020
by   Alister D Costa, et al.
0

Multiple Sclerosis (MS) is a chronic, inflammatory and degenerative neurological disease, which is monitored by a specialist using the Expanded Disability Status Scale (EDSS) and recorded in unstructured text in the form of a neurology consult note. An EDSS measurement contains an overall "EDSS" score and several functional subscores. Typically, expert knowledge is required to interpret consult notes and generate these scores. Previous approaches used limited context length Word2Vec embeddings and keyword searches to predict scores given a consult note, but often failed when scores were not explicitly stated. In this work, we present MS-BERT, the first publicly available transformer model trained on real clinical data other than MIMIC. Next, we present MSBC, a classifier that applies MS-BERT to generate embeddings and predict EDSS and functional subscores. Lastly, we explore combining MSBC with other models through the use of Snorkel to generate scores for unlabelled consult notes. MSBC achieves state-of-the-art performance on all metrics and prediction tasks and outperforms the models generated from the Snorkel ensemble. We improve Macro-F1 by 0.12 (to 0.88) for predicting EDSS and on average by 0.29 (to 0.63) for predicting functional subscores over previous Word2Vec CNN and rule-based approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/24/2023

Modelling Temporal Document Sequences for Clinical ICD Coding

Past studies on the ICD coding problem focus on predicting clinical code...
research
04/08/2023

Predicting multiple sclerosis disease severity with multimodal deep neural networks

Multiple Sclerosis (MS) is a chronic disease developed in human brain an...
research
08/24/2020

Prediction of ICD Codes with Clinical BERT Embeddings and Text Augmentation with Label Balancing using MIMIC-III

This paper achieves state of the art results for the ICD code prediction...
research
06/16/2023

Revealing the impact of social circumstances on the selection of cancer therapy through natural language processing of social work notes

We aimed to investigate the impact of social circumstances on cancer the...
research
04/22/2020

Supervised Grapheme-to-Phoneme Conversion of Orthographic Schwas in Hindi and Punjabi

Hindi grapheme-to-phoneme (G2P) conversion is mostly trivial, with one e...
research
04/17/2021

Hierarchical Transformer Networks for Longitudinal Clinical Document Classification

We present the Hierarchical Transformer Networks for modeling long-term ...

Please sign up or login with your details

Forgot password? Click here to reset