Robust adaptive Lasso in high-dimensional logistic regression with an application to genomic classification of cancer patients

08/20/2021
by   Ayanendranath Basu, et al.
0

Penalized logistic regression is extremely useful for binary classification with a large number of covariates (significantly higher than the sample size), having several real life applications, including genomic disease classification. However, the existing methods based on the likelihood based loss function are sensitive to data contamination and other noise and, hence, robust methods are needed for stable and more accurate inference. In this paper, we propose a family of robust estimators for sparse logistic models utilizing the popular density power divergence based loss function and the general adaptively weighted LASSO penalties. We study the local robustness of the proposed estimators through its influence function and also derive its oracle properties and asymptotic distribution. With extensive empirical illustrations, we clearly demonstrate the significantly improved performance of our proposed estimators over the existing ones with particular gain in robustness. Our proposal is finally applied to analyse four different real datasets for cancer classification, obtaining robust and accurate models, that simultaneously performs gene selection and patient classification.

READ FULL TEXT
research
06/11/2020

Weighted Lasso Estimates for Sparse Logistic Regression: Non-asymptotic Properties with Measurement Error

When we are interested in high-dimensional system and focus on classific...
research
02/05/2021

A new robust approach for multinomial logistic regression with complex design model

Robust estimators and Wald-type tests are developed for the multinomial ...
research
01/11/2023

Optirank: classification for RNA-Seq data with optimal ranking reference genes

Classification algorithms using RNA-Sequencing (RNA-Seq) data as input a...
research
01/28/2022

Asymptotic behaviour of penalized robust estimators in logistic regression when dimension increases

Penalized M-estimators for logistic regression models have been previous...
research
04/03/2019

Robust semiparametric inference for polytomous logistic regression with complex survey design

Analyzing polytomous response from a complex survey scheme, like stratif...
research
10/26/2022

High-dimensional Measurement Error Models for Lipschitz Loss

Recently emerging large-scale biomedical data pose exciting opportunitie...
research
05/02/2023

Robust and Adaptive Functional Logistic Regression

We introduce and study a family of robust estimators for the functional ...

Please sign up or login with your details

Forgot password? Click here to reset