VC-PCR: A Prediction Method based on Supervised Variable Selection and Clustering

02/02/2022
by   Rebecca Marion, et al.
0

Sparse linear prediction methods suffer from decreased prediction accuracy when the predictor variables have cluster structure (e.g. there are highly correlated groups of variables). To improve prediction accuracy, various methods have been proposed to identify variable clusters from the data and integrate cluster information into a sparse modeling process. But none of these methods achieve satisfactory performance for prediction, variable selection and variable clustering simultaneously. This paper presents Variable Cluster Principal Component Regression (VC-PCR), a prediction method that supervises variable selection and variable clustering in order to solve this problem. Experiments with real and simulated data demonstrate that, compared to competitor methods, VC-PCR achieves better prediction, variable selection and clustering performance when cluster structure is present.

READ FULL TEXT
research
05/25/2023

Flexible Variable Selection for Clustering and Classification

The importance of variable selection for clustering has been recognized ...
research
03/27/2022

Interpretable Machine Learning Models for Modal Split Prediction in Transportation Systems

Modal split prediction in transportation networks has the potential to s...
research
04/24/2018

Classifying variable-structures: a general framework

In this work, we unify recent variable-clustering techniques within a co...
research
03/23/2011

Clustered regression with unknown clusters

We consider a collection of prediction experiments, which are clustered ...
research
04/07/2018

A group-based approach to the least squares regression for handling multicollinearity from strongly correlated variables

Multicollinearity due to strongly correlated predictor variables is a lo...
research
12/15/2022

Variable Clustering via Distributionally Robust Nodewise Regression

We study a multi-factor block model for variable clustering and connect ...
research
12/16/2022

Multi-Task Learning for Sparsity Pattern Heterogeneity: A Discrete Optimization Approach

We extend best-subset selection to linear Multi-Task Learning (MTL), whe...

Please sign up or login with your details

Forgot password? Click here to reset