Generalized k-Means in GLMs with Applications to the Outbreak of COVID-19 in the United States

08/09/2020
by   Tonglin Zhang, et al.
0

Generalized k-means can be incorporated with any similarity or dissimilarity measure for clustering. By choosing the dissimilarity measure as the well known likelihood ratio or F-statistic, this work proposes a method based on generalized k-means to group statistical models. Given the number of clusters k, the method is established under hypothesis tests between statistical models. If k is unknown, then the method can be combined with GIC to automatically select the best k for clustering. The article investigates both AIC and BIC as the special cases. Theoretical and simulation results show that the number of clusters can be identified by BIC but not AIC. The resulting method for GLMs is used to group the state-level time series patterns for the outbreak of COVID-19 in the United States. A further study shows that the statistical models between the clusters are significantly different from each other. This study confirms the result given by the proposed method based on generalized k-means.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/02/2020

Vector quantisation and partitioning of COVID-19 temporal dynamics in the United States

The statistical dynamics of a pathogen within a population depend on a r...
research
07/12/2020

Changing Clusters of Indian States with respect to number of Cases of COVID-19 using incrementalKMN Method

The novel Coronavirus (COVID-19) incidence in India is currently experie...
research
09/14/2021

Learning trends of COVID-19 using semi-supervised clustering

A finite mixture model is used to learn trends from the currently availa...
research
11/26/2021

SARS-CoV-2 Dissemination using a Network of the United States Counties

During 2020 and 2021, severe acute respiratory syndrome coronavirus 2 (S...
research
07/21/2020

Clustering patterns connecting COVID-19 dynamics and Human mobility using optimal transport

Social distancing and stay-at-home are among the few measures that are k...
research
01/13/2022

Context binning, model clustering and adaptivity for data compression of genetic data

Rapid growth of genetic databases means huge savings from improvements i...
research
05/20/2023

On the Relationship between Markov Switching Models and Fuzzy Clustering: a Nonparametric Method to Detect the Number of States

Markov Switching models have had increasing success in time series analy...

Please sign up or login with your details

Forgot password? Click here to reset