Sélection de variables par le GLM-Lasso pour la prédiction du risque palustre

09/09/2015
by   Bienvenue Kouwayè, et al.
0

In this study, we propose an automatic learning method for variables selection based on Lasso in epidemiology context. One of the aim of this approach is to overcome the pretreatment of experts in medicine and epidemiology on collected data. These pretreatment consist in recoding some variables and to choose some interactions based on expertise. The approach proposed uses all available explanatory variables without treatment and generate automatically all interactions between them. This lead to high dimension. We use Lasso, one of the robust methods of variable selection in high dimension. To avoid over fitting a two levels cross-validation is used. Because the target variable is account variable and the lasso estimators are biased, variables selected by lasso are debiased by a GLM and used to predict the distribution of the main vector of malaria which is Anopheles. Results show that only few climatic and environmental variables are the mains factors associated to the malaria risk exposure.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/04/2015

Lasso based feature selection for malaria risk exposure prediction

In life sciences, the experts generally use empirical knowledge to recod...
research
06/24/2016

Regression Trees and Random forest based feature selection for malaria risk exposure prediction

This paper deals with prediction of anopheles number, the main vector of...
research
02/11/2008

On the ℓ_1-ℓ_q Regularized Regression

In this paper we consider the problem of grouped variable selection in h...
research
09/18/2019

Evaluating Effects of Tuition Fees: Lasso for the Case of Germany

We study the effect of the introduction of university tuition fees on th...
research
09/22/2020

The Linear Lasso: a location model resolution

We use location model methodology to guide the least squares analysis of...
research
08/31/2020

Variable selection in social-environmental data: Sparse regression and tree ensemble machine learning approaches

Objective: Social-environmental data obtained from the U.S. Census is an...
research
08/30/2023

Adaptive Lasso, Transfer Lasso, and Beyond: An Asymptotic Perspective

This paper presents a comprehensive exploration of the theoretical prope...

Please sign up or login with your details

Forgot password? Click here to reset