The Lazy Bootstrap. A Fast Resampling Method for Evaluating Latent Class Model Fit

01/29/2018
by   Geert H. van Kollenburg, et al.
0

The latent class model is a powerful unsupervised clustering algorithm for categorical data. Many statistics exist to test the fit of the latent class model. However, traditional methods to evaluate those fit statistics are not always useful. Asymptotic distributions are not always known, and empirical reference distributions can be very time consuming to obtain. In this paper we propose a fast resampling scheme with which any type of model fit can be assessed. We illustrate it here on the latent class model, but the methodology can be applied in any situation. The principle behind the lazy bootstrap method is to specify a statistic which captures the characteristics of the data that a model should capture correctly. If those characteristics in the observed data and in model-generated data are very different we can assume that the model could not have produced the observed data. With this method we achieve the flexibility of tests from the Bayesian framework, while only needing maximum likelihood estimates. We provide a step-wise algorithm with which the fit of a model can be assessed based on the characteristics we as researcher find important. In a Monte Carlo study we show that the method has very low type I errors, for all illustrated statistics. Power to reject a model depended largely on the type of statistic that was used and on sample size. We applied the method to an empirical data set on clinical subgroups with risk of Myocardial infarction and compared the results directly to the parametric bootstrap. The results of our method were highly similar to those obtained by the parametric bootstrap, while the required computations differed three orders of magnitude in favour of our method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/27/2019

Goodness-of-fit test for the bivariate Hermite distribution

This paper studies the goodness of fit test for the bivariate Hermite di...
research
10/30/2019

New weighted L^2-type tests for the inverse Gaussian distribution

We propose a new class of goodness-of-fit tests for the inverse Gaussian...
research
09/20/2021

Machine Learning-Based Estimation and Goodness-of-Fit for Large-Scale Confirmatory Item Factor Analysis

We investigate novel parameter estimation and goodness-of-fit (GOF) asse...
research
01/14/2020

A Higher-Order Correct Fast Moving-Average Bootstrap for Dependent Data

We develop and implement a novel fast bootstrap for dependent data. Our ...
research
01/04/2007

Bootstrap for neural model selection

Bootstrap techniques (also called resampling computation techniques) hav...
research
09/15/2020

Identifying latent classes with ordered categorical indicators

A Monte Carlo simulation was used to determine which assumptions for ord...

Please sign up or login with your details

Forgot password? Click here to reset