Optimal subdata selection for linear model selection

06/19/2023
by   Vasilis Chasiotis, et al.
0

If the assumed model does not accurately capture the underlying structure of the data, a statistical method is likely to yield sub-optimal results, and so model selection is crucial in order to conduct any statistical analysis. However, in case of massive datasets, the selection of an appropriate model from a large pool of candidates becomes computationally challenging, and limited research has been conducted on data selection for model selection. In this study, we conduct subdata selection based on the A-optimality criterion, allowing to perform model selection on a smaller subset of the data. We evaluate our approach based on the probability of selecting the best model and on the estimation efficiency through simulation experiments and two real data applications.

READ FULL TEXT
research
06/26/2018

LOO and WAIC as Model Selection Methods for Polytomous Items

Watanabe-Akaike information criterion (WAIC; Watanabe, 2010) and leave-o...
research
10/22/2018

Model Selection Techniques -- An Overview

In the era of big data, analysts usually explore various statistical mod...
research
05/27/2021

Model Selection for Production System via Automated Online Experiments

A challenge that machine learning practitioners in the industry face is ...
research
07/10/2018

Fast Model-Selection through Adapting Design of Experiments Maximizing Information Gain

To perform model-selection efficiently, we must run informative experime...
research
06/28/2019

FIESTA: Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms

We present FIESTA, a model selection approach that significantly reduces...
research
02/07/2019

Model Selection for Simulator-based Statistical Models: A Kernel Approach

We propose a novel approach to model selection for simulator-based stati...
research
05/09/2015

Simultaneous Clustering and Model Selection for Multinomial Distribution: A Comparative Study

In this paper, we study different discrete data clustering methods, whic...

Please sign up or login with your details

Forgot password? Click here to reset