Improved Inference of Gaussian Mixture Copula Model for Clustering and Reproducibility Analysis using Automatic Differentiation

10/24/2020
by   Siva Rajesh Kasa, et al.
0

Copulas provide a modular parameterization of multivariate distributions that decouples the modeling of marginals from the dependencies between them. Gaussian Mixture Copula Model (GMCM) is a highly flexible copula that can model many kinds of multi-modal dependencies, as well as asymmetric and tail dependencies. They have been effectively used in clustering non-Gaussian data and in Reproducibility Analysis, a meta-analysis method designed to verify the reliability and consistency of multiple high-throughput experiments. Parameter estimation for GMCM is challenging due to its intractable likelihood. The best previous methods have maximized a proxy-likelihood through a Pseudo Expectation Maximization (PEM) algorithm. They have no guarantees of convergence or convergence to the correct parameters. In this paper, we use Automatic Differentiation (AD) tools to develop a method, called AD-GMCM, that can maximize the exact GMCM likelihood. In our simulation studies and experiments with real data, AD-GMCM finds more accurate parameter estimates than PEM and yields better performance in clustering and Reproducibility Analysis. We discuss the advantages of an AD-based approach, to address problems related to monotonic increase of likelihood and parameter identifiability in GMCM. We also analyze, for GMCM, two well-known cases of degeneracy of maximum likelihood in GMM that can lead to spurious clustering solutions. Our analysis shows that, unlike GMM, GMCM is not affected in one of the cases.

READ FULL TEXT

page 3

page 14

page 17

research
12/13/2018

Automatic Differentiation in Mixture Models

In this article, we discuss two specific classes of models - Gaussian Mi...
research
02/05/2021

Vine copula mixture models and clustering for non-Gaussian data

The majority of finite mixture models suffer from not allowing asymmetri...
research
07/08/2020

Model-based Clustering using Automatic Differentiation: Confronting Misspecification and High-Dimensional Data

We study two practically important cases of model based clustering using...
research
03/24/2023

Tackling the infinite likelihood problem when fitting mixtures of shifted asymmetric Laplace distributions

Mixtures of shifted asymmetric Laplace distributions were introduced as ...
research
07/13/2020

Mixture of linear experts model for censored data: A novel approach with scale-mixture of normal distributions

The classical mixture of linear experts (MoE) model is one of the widesp...
research
03/01/2023

Mixture of regressions with multivariate responses for discovering subtypes in Alzheimer's biomarkers with detection limits

There is no gold standard for the diagnosis of Alzheimer's disease (AD),...
research
02/22/2023

Improving Model Choice in Classification: An Approach Based on Clustering of Covariance Matrices

This work introduces a refinement of the Parsimonious Model for fitting ...

Please sign up or login with your details

Forgot password? Click here to reset