Trace Clustering on Very Large Event Data in Healthcare Using Frequent Sequence Patterns

01/10/2020
by   Xixi Lu, et al.
0

Trace clustering has increasingly been applied to find homogenous process executions. However, current techniques have difficulties in finding a meaningful and insightful clustering of patients on the basis of healthcare data. The resulting clusters are often not in line with those of medical experts, nor do the clusters guarantee to help return meaningful process maps of patients' clinical pathways. After all, a single hospital may conduct thousands of distinct activities and generate millions of events per year. In this paper, we propose a novel trace clustering approach by using sample sets of patients provided by medical experts. More specifically, we learn frequent sequence patterns on a sample set, rank each patient based on the patterns, and use an automated approach to determine the corresponding cluster. We find each cluster separately, while the frequent sequence patterns are used to discover a process map. The approach is implemented in ProM and evaluated using a large data set obtained from a university medical center. The evaluation shows F1-scores of 0.7 for grouping kidney injury, 0.9 for diabetes, and 0.64 for head/neck tumor, while the process maps show meaningful behavioral patterns of the clinical pathways of these groups, according to the domain experts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/03/2019

Identifying Patient Groups based on Frequent Patterns of Patient Samples

Grouping patients meaningfully can give insights about the different typ...
research
10/13/2021

Expert-driven Trace Clustering with Instance-level Constraints

Within the field of process mining, several different trace clustering a...
research
08/22/2023

Patient Clustering via Integrated Profiling of Clinical and Digital Data

We introduce a novel profile-based patient clustering model designed for...
research
02/15/2023

Mimetic Muscle Rehabilitation Analysis Using Clustering of Low Dimensional 3D Kinect Data

Facial nerve paresis is a severe complication that arises post-head and ...
research
08/02/2022

Enabling scalable clinical interpretation of ML-based phenotypes using real world data

The availability of large and deep electronic healthcare records (EHR) d...
research
01/14/2018

Hire the Experts: Combinatorial Auction Based Scheme for Experts Selection in E-Healthcare

During the last decade, scheduling the healthcare services (such as staf...
research
12/31/2022

Definition and clinical validation of Pain Patient States from high-dimensional mobile data: application to a chronic pain cohort

The technical capacity to monitor patients with a mobile device has dras...

Please sign up or login with your details

Forgot password? Click here to reset