Unsupervised Learning for Computational Phenotyping

12/26/2016
by   Chris Hodapp, et al.
0

With large volumes of health care data comes the research area of computational phenotyping, making use of techniques such as machine learning to describe illnesses and other clinical concepts from the data itself. The "traditional" approach of using supervised learning relies on a domain expert, and has two main limitations: requiring skilled humans to supply correct labels limits its scalability and accuracy, and relying on existing clinical descriptions limits the sorts of patterns that can be found. For instance, it may fail to acknowledge that a disease treated as a single condition may really have several subtypes with different phenotypes, as seems to be the case with asthma and heart disease. Some recent papers cite successes instead using unsupervised learning. This shows great potential for finding patterns in Electronic Health Records that would otherwise be hidden and that can lead to greater understanding of conditions and treatments. This work implements a method derived strongly from Lasko et al., but implements it in Apache Spark and Python and generalizes it to laboratory time-series data in MIMIC-III. It is released as an open-source tool for exploration, analysis, and visualization, available at https://github.com/Hodapp87/mimic3_phenotyping

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/18/2019

MedCAT – Medical Concept Annotation Tool

Biomedical documents such as Electronic Health Records (EHRs) contain a ...
research
11/14/2022

Phenotype Detection in Real World Data via Online MixEHR Algorithm

Understanding patterns of diagnoses, medications, procedures, and labora...
research
05/17/2019

Unsupervised Machine Learning for the Discovery of Latent Disease Clusters and Patient Subgroups Using Electronic Health Records

Machine learning has become ubiquitous and a key technology on mining el...
research
02/13/2021

Clustering Left-Censored Multivariate Time-Series

Unsupervised learning seeks to uncover patterns in data. However, differ...
research
12/12/2014

Machine Learning for Neuroimaging with Scikit-Learn

Statistical machine learning methods are increasingly used for neuroimag...
research
12/02/2016

Voxelwise nonlinear regression toolbox for neuroimage analysis: Application to aging and neurodegenerative disease modeling

This paper describes a new neuroimaging analysis toolbox that allows for...

Please sign up or login with your details

Forgot password? Click here to reset