MIMICause : Defining, identifying and predicting types of causal relationships between biomedical concepts from clinical notes

by   Vivek Khetan, et al.
Arizona State University
Duke University
indian institute of science

Understanding of causal narratives communicated in clinical notes can help make strides towards personalized healthcare. In this work, MIMICause, we propose annotation guidelines, develop an annotated corpus and provide baseline scores to identify types and direction of causal relations between a pair of biomedical concepts in clinical notes; communicated implicitly or explicitly, identified either in a single sentence or across multiple sentences. We annotate a total of 2714 de-identified examples sampled from the 2018 n2c2 shared task dataset and train four different language model based architectures. Annotation based on our guidelines achieved a high inter-annotator agreement i.e. Fleiss' kappa score of 0.72 and our model for identification of causal relation achieved a macro F1 score of 0.56 on test data. The high inter-annotator agreement for clinical text shows the quality of our annotation guidelines while the provided baseline F1 score sets the direction for future research towards understanding narratives in clinical texts.


page 1

page 2

page 3

page 4


Annotating and normalizing biomedical NEs with limited knowledge

Named entity recognition (NER) is the very first step in the linguistic ...

Building a comprehensive syntactic and semantic corpus of Chinese clinical texts

Objective: To build a comprehensive corpus covering syntactic and semant...

Developing a cardiovascular disease risk factor annotated corpus of Chinese electronic medical records

Cardiovascular disease (CVD) has become the leading cause of death in Ch...

The Medical Scribe: Corpus Development and Model Performance Analyses

There is a growing interest in creating tools to assist in clinical note...

This before That: Causal Precedence in the Biomedical Domain

Causal precedence between biochemical interactions is crucial in the bio...

Large Language Models for Biomedical Causal Graph Construction

Automatic causal graph construction is of high importance in medical res...

Enriched Annotations for Tumor Attribute Classification from Pathology Reports with Limited Labeled Data

Precision medicine has the potential to revolutionize healthcare, but mu...

Please sign up or login with your details

Forgot password? Click here to reset