A Multi-objective Exploratory Procedure for Regression Model Selection

03/28/2012
by   Ankur Sinha, et al.
0

Variable selection is recognized as one of the most critical steps in statistical modeling. The problems encountered in engineering and social sciences are commonly characterized by over-abundance of explanatory variables, non-linearities and unknown interdependencies between the regressors. An added difficulty is that the analysts may have little or no prior knowledge on the relative importance of the variables. To provide a robust method for model selection, this paper introduces the Multi-objective Genetic Algorithm for Variable Selection (MOGA-VS) that provides the user with an optimal set of regression models for a given data-set. The algorithm considers the regression problem as a two objective task, and explores the Pareto-optimal (best subset) models by preferring those models over the other which have less number of regression coefficients and better goodness of fit. The model exploration can be performed based on in-sample or generalization error minimization. The model selection is proposed to be performed in two steps. First, we generate the frontier of Pareto-optimal regression models by eliminating the dominated models without any user intervention. Second, a decision making process is executed which allows the user to choose the most preferred model using visualisations and simple metrics. The method has been evaluated on a recently published real dataset on Communities and Crime within United States.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2019

Robust Model Selection for Finite Mixture of Regression Models Through Trimming

In this article, we introduce a new variable selection technique through...
research
06/22/2022

Optimally Weighted Ensembles of Regression Models: Exact Weight Optimization and Applications

Automated model selection is often proposed to users to choose which mac...
research
01/19/2017

Parameter Selection Algorithm For Continuous Variables

In this article, we propose a new algorithm for supervised learning meth...
research
10/25/2018

Model Selection using Multi-Objective Optimization

Choices in scientific research and management require balancing multiple...
research
09/15/2023

Information Criterion for a Large Scale Subset Regression Models

The information criterion for determining the number of explanatory vari...
research
05/22/2016

Causality on Longitudinal Data: Stable Specification Search in Constrained Structural Equation Modeling

A typical problem in causal modeling is the instability of model structu...

Please sign up or login with your details

Forgot password? Click here to reset