Semi-analytic approximate stability selection for correlated data in generalized linear models

03/19/2020
by   Takashi Takahashi, et al.
0

We consider the variable selection problem of generalized linear models (GLMs). Stability selection (SS) is a promising method proposed for solving this problem. Although SS provides practical variable selection criteria, it is computationally demanding because it needs to fit GLMs to many re-sampled datasets. We propose a novel approximate inference algorithm that can conduct SS without the repeated fitting. The algorithm is based on the replica method of statistical mechanics and vector approximate message passing of information theory. For datasets characterized by rotation-invariant matrix ensembles, we derive state evolution equations that macroscopically describe the dynamics of the proposed algorithm. We also show that their fixed points are consistent with the replica symmetric solution obtained by the replica method. Numerical experiments indicate that the algorithm exhibits fast convergence and high approximation accuracy for both synthetic and real-world data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2019

Replicated Vector Approximate Message Passing For Resampling Problem

Resampling techniques are widely used in statistical inference and ensem...
research
01/09/2020

Macroscopic Analysis of Vector Approximate Message Passing in a Model Mismatch Setting

Vector approximate message passing (VAMP) is an efficient approximate in...
research
02/28/2018

Semi-Analytic Resampling in Lasso

An approximate method for conducting resampling in Lasso, the ℓ_1 penali...
research
09/20/2021

Variable Selection in GLM and Cox Models with Second-Generation P-Values

Variable selection has become a pivotal choice in data analyses that imp...
research
03/22/2023

Scalable Bayesian bi-level variable selection in generalized linear models

Motivated by a real-world application in cardiology, we develop an algor...
research
08/01/2023

Best-Subset Selection in Generalized Linear Models: A Fast and Consistent Algorithm via Splicing Technique

In high-dimensional generalized linear models, it is crucial to identify...
research
07/26/2022

An exhaustive variable selection study for linear models of soundscape emotions: rankings and Gibbs analysis

In the last decade, soundscapes have become one of the most active topic...

Please sign up or login with your details

Forgot password? Click here to reset