An Introduction to the Practical and Theoretical Aspects of Mixture-of-Experts Modeling

07/12/2017
by   Hien D. Nguyen, et al.
1

Mixture-of-experts (MoE) models are a powerful paradigm for modeling of data arising from complex data generating processes (DGPs). In this article, we demonstrate how different MoE models can be constructed to approximate the underlying DGPs of arbitrary types of data. Due to the probabilistic nature of MoE models, we propose the maximum quasi-likelihood (MQL) estimator as a method for estimating MoE model parameters from data, and we provide conditions under which MQL estimators are consistent and asymptotically normal. The blockwise minorization-maximizatoin (blockwise-MM) algorithm framework is proposed as an all-purpose method for constructing algorithms for obtaining MQL estimators. An example derivation of a blockwise-MM algorithm is provided. We then present a method for constructing information criteria for estimating the number of components in MoE models and provide justification for the classic Bayesian information criterion (BIC). We explain how MoE models can be used to conduct classification, clustering, and regression and we illustrate these applications via a pair of worked examples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/12/2016

An Introduction to MM Algorithms for Machine Learning and Statistical

MM (majorization--minimization) algorithms are an increasingly popular t...
research
09/14/2017

A Novel Algorithm for Clustering of Data on the Unit Sphere via Mixture Models

A new maximum approximate likelihood (ML) estimation algorithm for the m...
research
06/22/2015

Non-Normal Mixtures of Experts

Mixture of Experts (MoE) is a popular framework for modeling heterogenei...
research
04/18/2021

Non-asymptotic model selection in block-diagonal mixture of polynomial experts models

Model selection, via penalized likelihood type criteria, is a standard t...
research
06/28/2021

QM/MM Methods for Crystalline Defects. Part 3: Machine-Learned Interatomic Potentials

We develop and analyze a framework for consistent QM/MM (quantum/classic...
research
06/26/2022

Prediction Errors for Penalized Regressions based on Generalized Approximate Message Passing

We discuss the prediction accuracy of assumed statistical models in term...
research
01/06/2021

Logistic Normal Multinomial Factor Analyzers for Clustering Microbiome Data

The human microbiome plays an important role in human health and disease...

Please sign up or login with your details

Forgot password? Click here to reset