Split Modeling for High-Dimensional Logistic Regression

A novel method is proposed to learn an ensemble of logistic classification models in the context of high-dimensional binary classification. The models in the ensemble are built simultaneously by optimizing a multi-convex objective function. To enforce diversity between the models the objective function penalizes overlap between the models in the ensemble. We study the bias and variance of the individual models as well as their correlation and discuss how our method learns the ensemble by exploiting the accuracy-diversity trade-off for ensemble models. In contrast to other ensembling approaches, the resulting ensemble model is fully interpretable as a logistic regression model and at the same time yields excellent prediction accuracy as demonstrated in an extensive simulation study and gene expression data applications. An open-source compiled software library implementing the proposed method is briefly discussed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/26/2017

High-dimensional classification by sparse logistic regression

We consider high-dimensional binary classification by sparse logistic re...
research
01/16/2022

A review and recommendations on variable selection methods in regression models for binary data

The selection of essential variables in logistic regression is vital bec...
research
10/15/2018

Unsupervised Ensemble Learning via Ising Model Approximation with Application to Phenotyping Prediction

Unsupervised ensemble learning has long been an interesting yet challeng...
research
12/14/2022

Deep Negative Correlation Classification

Ensemble learning serves as a straightforward way to improve the perform...
research
12/30/2013

A Fused Elastic Net Logistic Regression Model for Multi-Task Binary Classification

Multi-task learning has shown to significantly enhance the performance o...
research
04/05/2021

DexDeepFM: Ensemble Diversity Enhanced Extreme Deep Factorization Machine Model

Predicting user positive response (e.g., purchases and clicks) probabili...
research
07/19/2016

Multi-category Angle-based Classifier Refit

Classification is an important statistical learning tool. In real applic...

Please sign up or login with your details

Forgot password? Click here to reset