Exhaustive search for sparse variable selection in linear regression

07/07/2017
by   Yasuhiko Igarashi, et al.
0

We propose a K-sparse exhaustive search (ES-K) method and a K-sparse approximate exhaustive search method (AES-K) for selecting variables in linear regression. With these methods, K-sparse combinations of variables are tested exhaustively assuming that the optimal combination of explanatory variables is K-sparse. By collecting the results of exhaustively computing ES-K, various approximate methods for selecting sparse variables can be summarized as density of states. With this density of states, we can compare different methods for selecting sparse variables such as relaxation and sampling. For large problems where the combinatorial explosion of explanatory variables is crucial, the AES-K method enables density of states to be effectively reconstructed by using the replica-exchange Monte Carlo method and the multiple histogram method. Applying the ES-K and AES-K methods to type Ia supernova data, we confirmed the conventional understanding in astronomy when an appropriate K is given beforehand. However, we found the difficulty to determine K from the data. Using virtual measurement and analysis, we argue that this is caused by data shortage.

READ FULL TEXT
research
10/20/2022

Adaptive greedy forward variable selection for linear regression models with incomplete data using multiple imputation

Variable selection is crucial for sparse modeling in this age of big dat...
research
11/15/2018

Histogram-Free Multicanonical Monte Carlo Sampling to Calculate the Density of States

We report a new multicanonical Monte Carlo algorithm to obtain the densi...
research
05/29/2018

Statistical mechanical analysis of sparse linear regression as a variable selection problem

An algorithmic limit of compressed sensing or related variable-selection...
research
08/07/2020

Perfect Reconstruction of Sparse Signals via Greedy Monte-Carlo Search

We propose a Monte-Carlo-based method for reconstructing sparse signals ...
research
07/03/2023

Variable selection in a specific regression time series of counts

Time series of counts occurring in various applications are often overdi...
research
07/26/2022

An exhaustive variable selection study for linear models of soundscape emotions: rankings and Gibbs analysis

In the last decade, soundscapes have become one of the most active topic...
research
04/18/2020

Accumulator Bet Selection Through Stochastic Diffusion Search

An accumulator is a bet that presents a rather unique payout structure, ...

Please sign up or login with your details

Forgot password? Click here to reset