Visualization of Emergency Department Clinical Data for Interpretable Patient Phenotyping

07/05/2019
by   Nathan C. Hurley, et al.
0

Visual summarization of clinical data collected on patients contained within the electronic health record (EHR) may enable precise and rapid triage at the time of patient presentation to an emergency department (ED). The triage process is critical in the appropriate allocation of resources and in anticipating eventual patient disposition, typically admission to the hospital or discharge home. EHR data are high-dimensional and complex, but offer the opportunity to discover and characterize underlying data-driven patient phenotypes. These phenotypes will enable improved, personalized therapeutic decision making and prognostication. In this work, we focus on the challenge of two-dimensional patient projections. A low dimensional embedding offers visual interpretability lost in higher dimensions. While linear dimensionality reduction techniques such as principal component analysis are often used towards this aim, they are insufficient to describe the variance of patient data. In this work, we employ the newly-described non-linear embedding technique called uniform manifold approximation and projection (UMAP). UMAP seeks to capture both local and global structures in high-dimensional data. We then use Gaussian mixture models to identify clusters in the embedded data and use the adjusted Rand index (ARI) to establish stability in the discovery of these clusters. This technique is applied to five common clinical chief complaints from a real-world ED EHR dataset, describing the emergent properties of discovered clusters. We observe clinically-relevant cluster attributes, suggesting that visual embeddings of EHR data using non-linear dimensionality reduction is a promising approach to reveal data-driven patient phenotypes. In the five chief complaints, we find between 2 and 6 clusters, with the peak mean pairwise ARI between subsequent training iterations to range from 0.35 to 0.74.

READ FULL TEXT
research
09/04/2019

Latent Gaussian process with composite likelihoods for data-driven disease stratification

Data-driven techniques for identifying disease subtypes using medical re...
research
12/11/2020

Casting Multiple Shadows: High-Dimensional Interactive Data Visualisation with Tours and Embeddings

Non-linear dimensionality reduction (NLDR) methods such as t-distributed...
research
03/23/2023

Clustering based on Mixtures of Sparse Gaussian Processes

Creating low dimensional representations of a high dimensional data set ...
research
02/18/2021

Joint Characterization of Multiscale Information in High Dimensional Data

High dimensional data can contain multiple scales of variance. Analysis ...
research
12/31/2022

Definition and clinical validation of Pain Patient States from high-dimensional mobile data: application to a chronic pain cohort

The technical capacity to monitor patients with a mobile device has dras...
research
08/21/2020

Visual Analysis of Large Multivariate Scattered Data using Clustering and Probabilistic Summaries

Rapidly growing data sizes of scientific simulations pose significant ch...
research
10/01/2022

Identifying Selections Operating on HIV-1 Reverse Transcriptase via Uniform Manifold Approximation and Projection

We analyze 14,651 HIV1 reverse transcriptase (HIV RT) sequences from the...

Please sign up or login with your details

Forgot password? Click here to reset