DeepAI AI Chat
Log In Sign Up

Semi-analytic approximate stability selection for correlated data in generalized linear models

03/19/2020
by   Takashi Takahashi, et al.
0

We consider the variable selection problem of generalized linear models (GLMs). Stability selection (SS) is a promising method proposed for solving this problem. Although SS provides practical variable selection criteria, it is computationally demanding because it needs to fit GLMs to many re-sampled datasets. We propose a novel approximate inference algorithm that can conduct SS without the repeated fitting. The algorithm is based on the replica method of statistical mechanics and vector approximate message passing of information theory. For datasets characterized by rotation-invariant matrix ensembles, we derive state evolution equations that macroscopically describe the dynamics of the proposed algorithm. We also show that their fixed points are consistent with the replica symmetric solution obtained by the replica method. Numerical experiments indicate that the algorithm exhibits fast convergence and high approximation accuracy for both synthetic and real-world data.

READ FULL TEXT

page 1

page 2

page 3

page 4

05/23/2019

Replicated Vector Approximate Message Passing For Resampling Problem

Resampling techniques are widely used in statistical inference and ensem...
01/09/2020

Macroscopic Analysis of Vector Approximate Message Passing in a Model Mismatch Setting

Vector approximate message passing (VAMP) is an efficient approximate in...
02/28/2018

Semi-Analytic Resampling in Lasso

An approximate method for conducting resampling in Lasso, the ℓ_1 penali...
09/20/2021

Variable Selection in GLM and Cox Models with Second-Generation P-Values

Variable selection has become a pivotal choice in data analyses that imp...
03/22/2023

Scalable Bayesian bi-level variable selection in generalized linear models

Motivated by a real-world application in cardiology, we develop an algor...
08/05/2015

Bayesian Approximate Kernel Regression with Variable Selection

Nonlinear kernel regression models are often used in statistics and mach...