tSPM+; a high-performance algorithm for mining transitive sequential patterns from clinical data

09/08/2023
by   Jonas Hügel, et al.
0

The increasing availability of large clinical datasets collected from patients can enable new avenues for computational characterization of complex diseases using different analytic algorithms. One of the promising new methods for extracting knowledge from large clinical datasets involves temporal pattern mining integrated with machine learning workflows. However, mining these temporal patterns is a computational intensive task and has memory repercussions. Current algorithms, such as the temporal sequence pattern mining (tSPM) algorithm, are already providing promising outcomes, but still leave room for optimization. In this paper, we present the tSPM+ algorithm, a high-performance implementation of the tSPM algorithm, which adds a new dimension by adding the duration to the temporal patterns. We show that the tSPM+ algorithm provides a speed up to factor 980 and a up to 48 fold improvement in memory consumption. Moreover, we present a docker container with an R-package, We also provide vignettes for an easy integration into already existing machine learning workflows and use the mined temporal sequences to identify Post COVID-19 patients and their symptoms according to the WHO definition.

READ FULL TEXT
research
08/13/2023

Discovering the Symptom Patterns of COVID-19 from Recovered and Deceased Patients Using Apriori Association Rule Mining

The COVID-19 pandemic has a devastating impact globally, claiming millio...
research
02/26/2022

TaSPM: Targeted Sequential Pattern Mining

Sequential pattern mining (SPM) is an important technique of pattern min...
research
10/20/2020

Extracting Seasonal Gradual Patterns from Temporal Sequence Data Using Periodic Patterns Mining

Mining frequent episodes aims at recovering sequential patterns from tem...
research
03/13/2017

SPARTan: Scalable PARAFAC2 for Large & Sparse Data

In exploratory tensor mining, a common problem is how to analyze a set o...
research
07/16/2019

Sequential Pattern mining of Longitudinal Adverse Events After Left Ventricular Assist Device Implant

Left ventricular assist devices (LVADs) are an increasingly common thera...
research
04/25/2018

Revealing patterns in HIV viral load data and classifying patients via a novel machine learning cluster summarization method

HIV RNA viral load (VL) is an important outcome variable in studies of H...
research
09/11/2017

Discriminant chronicles mining: Application to care pathways analytics

Pharmaco-epidemiology (PE) is the study of uses and effects of drugs in ...

Please sign up or login with your details

Forgot password? Click here to reset