Finite mixture modeling of censored and missing data using the multivariate skew-normal distribution

Finite mixture models have been widely used to model and analyze data from a heterogeneous populations. Moreover, data of this kind can be missing or subject to some upper and/or lower detection limits because of the restriction of experimental apparatuses. Another complication arises when measures of each population depart significantly from normality, for instance, asymmetric behavior. For such data structures, we propose a robust model for censored and/or missing data based on finite mixtures of multivariate skew-normal distributions. This approach allows us to model data with great flexibility, accommodating multimodality and skewness, simultaneously, depending on the structure of the mixture components. We develop an analytically simple, yet efficient, EM- type algorithm for conducting maximum likelihood estimation of the parameters. The algorithm has closed-form expressions at the E-step that rely on formulas for the mean and variance of the truncated multivariate skew-normal distributions. Furthermore, a general information-based method for approximating the asymptotic covariance matrix of the estimators is also presented. Results obtained from the analysis of both simulated and real datasets are reported to demonstrate the effectiveness of the proposed method. The proposed algorithm and method are implemented in the new R package CensMFM.

READ FULL TEXT
research
02/08/2019

Testing the Order of Multivariate Normal Mixture Models

Finite mixtures of multivariate normal distributions have been widely us...
research
04/08/2020

Robust Mixture Modeling using Weighted Complete Estimating Equations

Mixture modeling that takes account of potential heterogeneity in data i...
research
06/14/2020

Heckman selection-t model: parameter estimation via the EM-algorithm

Heckman selection model is perhaps the most popular econometric model in...
research
01/26/2018

Multivariate normal mixture modeling, clustering and classification with the rebmix package

The rebmix package provides R functions for random univariate and multiv...
research
06/02/2019

Clustering Multivariate Data using Factor Analytic Bayesian Mixtures with an Unknown Number of Components

Recent work on overfitting Bayesian mixtures of distributions offers a p...
research
02/27/2015

Minimum message length estimation of mixtures of multivariate Gaussian and von Mises-Fisher distributions

Mixture modelling involves explaining some observed evidence using a com...
research
11/13/2021

A Hybrid EM Algorithm for Linear Two-Way Interactions with Missing Data

We study an EM algorithm for estimating product-term regression models w...

Please sign up or login with your details

Forgot password? Click here to reset