A Bernoulli Mixture Model to Understand and Predict Children Longitudinal Wheezing Patterns

In this research, we estimate that around 27.99(±2.15)% of the population has experienced wheezing before turning 1 in the United Kingdom. Furthermore, the Bernoulli Mixture Model classification is found to work best with K=4 clusters in order to better balance the separability of the clusters with their explanatory nature, based on a cohort of N=1184. The probability of the group of parents in the jth cluster to say that their children have wheezed during the ith age is assumed P_ij∼Beta(1/2, 1/2), the probabilities of assignment to each cluster is R ∼Dirichlet_K(α), the assignment of the nth patient to each cluster is Z_n | R ∼Categorical(R), and the nth patient wheezed during the ith age is X_in | P_ij, Z_n ∼Bernoulli(P_i,Z_n); where i∈{1,...,6}, j∈{1,...,K}, and n∈{1,..., N}. The classification is then performed through the E-M optimization algorithm. We found that this clustering method groups efficiently the patients with late-childhood wheezing, persistent wheezing, early-childhood wheezing, and none or sporadic wheezing. Furthermore, we found that this method is not dependent on the data-set, and can include data-sets with missing entries.

READ FULL TEXT

page 9

page 10

page 11

research
08/02/2019

Identification of gatekeeper diseases on the way to cardiovascular mortality

Multimorbidity, the co-occurrence of two or more chronic diseases such a...
research
12/29/2022

Cluster-level Group Representativity Fairness in k-means Clustering

There has been much interest recently in developing fair clustering algo...
research
07/04/2019

A Mean Field Games approach to Cluster Analysis

In this paper, we develop a Mean Field Games approach to Cluster Analysi...
research
11/10/2017

Robust Clustering with Subpopulation-specific Deviations

The National Birth Defects Prevention Study (NBDPS) was a case-control s...
research
05/17/2020

Model-Based Longitudinal Clustering with Varying Cluster Assignments

It is often of interest to perform clustering on longitudinal data, yet ...
research
10/26/2010

A GMBCG Galaxy Cluster Catalog of 55,424 Rich Clusters from SDSS DR7

We present a large catalog of optically selected galaxy clusters from th...
research
09/17/2019

Multiclass classification of growth curves using random change points and heterogeneous random effects

Faltering growth among children is a nutritional problem prevalent in lo...

Please sign up or login with your details

Forgot password? Click here to reset