Latent group structure and regularized regression

08/21/2019
by   Konstantinos Perrakis, et al.
0

Regression modelling typically assumes homogeneity of the conditional distribution of responses Y given features X. For inhomogeneous data, with latent groups having potentially different underlying distributions, the hidden group structure can be crucial for estimation and prediction, and standard regression models may be severely confounded. Worse, in the multivariate setting, the presence of such inhomogeneity can easily pass undetected. To allow for robust and interpretable regression modelling in the heterogeneous data setting we put forward a class of mixture models that couples together both the multivariate marginal on X and the conditional Y | X to capture the latent group structure. This joint modelling approach allows for group-specific regression parameters, automatically controlling for the latent confounding that may otherwise pose difficulties, and offers a novel way to deal with suspected distributional shifts in the data. We show how the latent variable model can be regularized to provide scalable solutions with explicit sparsity. Estimation is handled via an expectation-maximization algorithm. We illustrate the key ideas via empirical examples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2022

Scalable Regularised Joint Mixture Models

In many applications, data can be heterogeneous in the sense of spanning...
research
05/05/2022

Bivariate vine copula based quantile regression

The statistical analysis of univariate quantiles is a well developed res...
research
03/12/2021

Mixture composite regression models with multi-type feature selection

The aim of this paper is to present a mixture composite regression model...
research
12/08/2022

Strong identifiability and parameter learning in regression with heterogeneous response

Mixtures of regression are a powerful class of models for regression lea...
research
10/06/2017

Gradient boosting in Markov-switching generalized additive models for location, scale and shape

We propose a novel class of flexible latent-state time series regression...
research
05/04/2023

On factor copula-based mixed regression models

In this article, a copula-based method for mixed regression models is pr...
research
10/21/2019

Marginally Interpretable Linear Transformation Models for Clustered Observations

Clustered observations are ubiquitous in controlled and observational st...

Please sign up or login with your details

Forgot password? Click here to reset