Supervised Homogeneity Fusion: a Combinatorial Approach

01/04/2022
by   Wen Wang, et al.
0

Fusing regression coefficients into homogenous groups can unveil those coefficients that share a common value within each group. Such groupwise homogeneity reduces the intrinsic dimension of the parameter space and unleashes sharper statistical accuracy. We propose and investigate a new combinatorial grouping approach called L_0-Fusion that is amenable to mixed integer optimization (MIO). On the statistical aspect, we identify a fundamental quantity called grouping sensitivity that underpins the difficulty of recovering the true groups. We show that L_0-Fusion achieves grouping consistency under the weakest possible requirement of the grouping sensitivity: if this requirement is violated, then the minimax risk of group misspecification will fail to converge to zero. Moreover, we show that in the high-dimensional regime, one can apply L_0-Fusion coupled with a sure screening set of features without any essential loss of statistical efficiency, while reducing the computational cost substantially. On the algorithmic aspect, we provide a MIO formulation for L_0-Fusion along with a warm start strategy. Simulation and real data analysis demonstrate that L_0-Fusion exhibits superiority over its competitors in terms of grouping accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/09/2012

High-Dimensional Screening Using Multiple Grouping of Variables

Screening is the problem of finding a superset of the set of non-zero en...
research
08/03/2023

Regression models with repeated functional data

Linear regression and classification models with repeated functional dat...
research
06/25/2021

Feature Grouping and Sparse Principal Component Analysis

Sparse Principal Component Analysis (SPCA) is widely used in data proces...
research
06/11/2020

Grouped GEE Analysis for Longitudinal Data

Generalized estimating equation (GEE) is widely adopted for regression m...
research
06/10/2020

Robust Grouped Variable Selection Using Distributionally Robust Optimization

We propose a Distributionally Robust Optimization (DRO) formulation with...
research
08/20/2020

An Examination of Grouping and Spatial Organization Tasks for High-Dimensional Data Exploration

How do analysts think about grouping and spatial operations? This overar...
research
11/30/2018

Sensitivity-driven adaptive sparse stochastic approximations in plasma microinstability analysis

Quantifying uncertainty in predictive simulations for real-world problem...

Please sign up or login with your details

Forgot password? Click here to reset