Supervised Multivariate Learning with Simultaneous Feature Auto-grouping and Dimension Reduction

12/17/2021
by   Yiyuan She, et al.
0

Modern high-dimensional methods often adopt the "bet on sparsity" principle, while in supervised multivariate learning statisticians may face "dense" problems with a large number of nonzero coefficients. This paper proposes a novel clustered reduced-rank learning (CRL) framework that imposes two joint matrix regularizations to automatically group the features in constructing predictive factors. CRL is more interpretable than low-rank modeling and relaxes the stringent sparsity assumption in variable selection. In this paper, new information-theoretical limits are presented to reveal the intrinsic cost of seeking for clusters, as well as the blessing from dimensionality in multivariate learning. Moreover, an efficient optimization algorithm is developed, which performs subspace learning and clustering with guaranteed convergence. The obtained fixed-point estimators, though not necessarily globally optimal, enjoy the desired statistical accuracy beyond the standard likelihood setup under some regularity conditions. Moreover, a new kind of information criterion, as well as its scale-free form, is proposed for cluster and rank selection, and has a rigorous theoretical support without assuming an infinite sample size. Extensive simulations and real-data experiments demonstrate the statistical accuracy and interpretability of the proposed method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/17/2011

Joint variable and rank selection for parsimonious estimation of high-dimensional matrices

We propose dimension reduction methods for sparse, high-dimensional mult...
research
03/25/2014

Selective Factor Extraction in High Dimensions

This paper studies simultaneous feature selection and extraction in supe...
research
12/13/2019

Best Subset Selection in Reduced Rank Regression

Reduced rank regression is popularly used for modeling the relationship ...
research
09/25/2017

Understanding a Version of Multivariate Symmetric Uncertainty to assist in Feature Selection

In this paper, we analyze the behavior of the multivariate symmetric unc...
research
07/12/2017

A Cluster Fusion Penalty for Grouping Response Variables in Multivariate Regression Models

We propose a method for estimating coefficients in multivariate regressi...
research
07/07/2023

Scalable High-Dimensional Multivariate Linear Regression for Feature-Distributed Data

Feature-distributed data, referred to data partitioned by features and s...
research
11/29/2022

Simultaneous Best Subset Selection and Dimension Reduction via Primal-Dual Iterations

Sparse reduced rank regression is an essential statistical learning meth...

Please sign up or login with your details

Forgot password? Click here to reset