Cluster Regularization via a Hierarchical Feature Regression

07/10/2021
by   Johann Pfitzinger, et al.
0

Prediction tasks with high-dimensional nonorthogonal predictor sets pose a challenge for least squares based fitting procedures. A large and productive literature exists, discussing various regularized approaches to improving the out-of-sample robustness of parameter estimates. This paper proposes a novel cluster-based regularization - the hierarchical feature regression (HFR) -, which mobilizes insights from the domains of machine learning and graph theory to estimate parameters along a supervised hierarchical representation of the predictor set, shrinking parameters towards group targets. The method is innovative in its ability to estimate optimal compositions of predictor groups, as well as the group targets endogenously. The HFR can be viewed as a supervised factor regression, with the strength of shrinkage governed by a penalty on the extent of idiosyncratic variation captured in the fitting process. The method demonstrates good predictive accuracy and versatility, outperforming a panel of benchmark regularized estimators across a diverse set of simulated regression tasks, including dense, sparse and grouped data generating processes. An application to the prediction of economic growth is used to illustrate the HFR's effectiveness in an empirical setting, with favorable comparisons to several frequentist and Bayesian alternatives.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2021

Group selection and shrinkage with application to sparse semiparametric modeling

Sparse regression and classification estimators capable of group selecti...
research
10/07/2018

Sparse Regression with Multi-type Regularized Feature Modeling

Within the statistical and machine learning literature, regularization t...
research
05/21/2018

Multilevel Models Allow Modular Specification of What and Where to Regularize, Especially in Small Area Estimation

Through the lense of multilevel model (MLM) specification and regulariza...
research
06/19/2018

Simultaneous Signal Subspace Rank and Model Selection with an Application to Single-snapshot Source Localization

This paper proposes a novel method for model selection in linear regress...
research
10/29/2020

Group-regularized ridge regression via empirical Bayes noise level cross-validation

Features in predictive models are not exchangeable, yet common supervise...
research
08/28/2013

Compound Poisson Processes, Latent Shrinkage Priors and Bayesian Nonconvex Penalization

In this paper we discuss Bayesian nonconvex penalization for sparse lear...
research
07/05/2022

Regularized Predictive Models for Beef Eating Quality of Individual Meals

Faced with changing markets and evolving consumer demands, beef industri...

Please sign up or login with your details

Forgot password? Click here to reset