Clustering Left-Censored Multivariate Time-Series

02/13/2021
by   Irene Y. Chen, et al.
0

Unsupervised learning seeks to uncover patterns in data. However, different kinds of noise may impede the discovery of useful substructure from real-world time-series data. In this work, we focus on mitigating the interference of left-censorship in the task of clustering. We provide conditions under which clusters and left-censorship may be identified; motivated by this result, we develop a deep generative, continuous-time model of time-series data that clusters while correcting for censorship time. We demonstrate accurate, stable, and interpretable results on synthetic data that outperform several benchmarks. To showcase the utility of our framework on real-world problems, we study how left-censorship can adversely affect the task of disease phenotyping, resulting in the often incorrect assumption that longitudinal patient data are aligned by disease stage. In reality, patients at the time of diagnosis are at different stages of the disease – both late and early due to differences in when patients seek medical care and such discrepancy can confound unsupervised learning algorithms. On two clinical datasets, our model corrects for this form of censorship and recovers known clinical subtypes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2020

Temporal Phenotyping using Deep Predictive Clustering of Disease Progression

Due to the wider availability of modern electronic health records, patie...
research
08/31/2021

Clustering of Pain Dynamics in Sickle Cell Disease from Sparse, Uneven Samples

Irregularly sampled time series data are common in a variety of fields. ...
research
02/24/2023

T-Phenotype: Discovering Phenotypes of Predictive Temporal Patterns in Disease Progression

Clustering time-series data in healthcare is crucial for clinical phenot...
research
04/12/2021

A smoothed and probabilistic PARAFAC model with covariates

Analysis and clustering of multivariate time-series data attract growing...
research
12/26/2016

Unsupervised Learning for Computational Phenotyping

With large volumes of health care data comes the research area of comput...
research
01/11/2023

Clustering disease trajectories in contrastive feature space for biomarker discovery in age-related macular degeneration

Age-related macular degeneration (AMD) is the leading cause of blindness...
research
08/03/2017

Detecting early signs of depressive and manic episodes in patients with bipolar disorder using the signature-based model

Recurrent major mood episodes and subsyndromal mood instability cause su...

Please sign up or login with your details

Forgot password? Click here to reset