Defying the Circadian Rhythm: Clustering Participant Telemetry in the UK Biobank Data

11/17/2020
by   Nikola Pocuca, et al.
0

The UK Biobank dataset follows over 500,000 volunteers and contains a diverse set of information related to societal outcomes. Among this vast collection, a large quantity of telemetry collected from wrist-worn accelerometers provides a snapshot of participant activity. Using this data, a population of shift workers, subjected to disrupted circadian rhythms, is analysed using a mixture model-based approach to yield protective effects from physical activity on survival outcomes. In this paper, we develop a scalable, standardized, and unique methodology that efficiently clusters a vast quantity of participant telemetry. By building upon the work of Doherty et al. (2017), we introduce a standardized, low-dimensional feature for clustering purposes. Participants are clustered using a matrix variate mixture model-based approach. Once clustered, survival analysis is performed to demonstrate distinct lifetime outcomes for individuals within each cluster. In summary, we process, cluster, and analyse a subset of UK Biobank participants to show the protective effects from physical activity on circadian disrupted individuals.

READ FULL TEXT
research
03/04/2023

Bayesian clustering of high-dimensional data via latent repulsive mixtures

Model-based clustering of moderate or large dimensional data is notoriou...
research
05/02/2022

MEGH: A parametric class of general hazard models for clustered survival data

In many applications of survival data analysis, the individuals are trea...
research
04/23/2019

Identifying Precipitation Regimes in China Using Model-Based Clustering of Spatial Functional Data

The identification of precipitation regimes is important for many purpos...
research
05/21/2022

Bayesian Clustering of Neural Activity with a Mixture of Dynamic Poisson Factor Analyzers

Modern neural recording techniques allow neuroscientists to observe the ...
research
11/10/2017

Robust Clustering with Subpopulation-specific Deviations

The National Birth Defects Prevention Study (NBDPS) was a case-control s...
research
12/07/2022

A parallelizable model-based approach for marginal and multivariate clustering

This paper develops a clustering method that takes advantage of the stur...
research
01/12/2019

Are Clusterings of Multiple Data Views Independent?

In the Pioneer 100 (P100) Wellness Project (Price and others, 2017), mul...

Please sign up or login with your details

Forgot password? Click here to reset