Unsupervised Ensemble Learning via Ising Model Approximation with Application to Phenotyping Prediction

10/15/2018
by   Luwan Zhang, et al.
0

Unsupervised ensemble learning has long been an interesting yet challenging problem that comes to prominence in recent years with the increasing demand of crowdsourcing in various applications. In this paper, we propose a novel method-- unsupervised ensemble learning via Ising model approximation (unElisa) that combines a pruning step with a predicting step. We focus on the binary case and use an Ising model to characterize interactions between the ensemble and the underlying true classifier. The presence of an edge between an observed classifier and the true classifier indicates a direct dependence whereas the absence indicates the corresponding one provides no additional information and shall be eliminated. This observation leads to the pruning step where the key is to recover the neighborhood of the true classifier. We show that it can be recovered successfully with exponentially decaying error in the high-dimensional setting by performing nodewise ℓ_1-regularized logistic regression. The pruned ensemble allows us to get a consistent estimate of the Bayes classifier for predicting. We also propose an augmented version of majority voting by reversing all labels given by a subgroup of the pruned ensemble. We demonstrate the efficacy of our method through extensive numerical experiments and through the application to EHR-based phenotyping prediction on Rheumatoid Arthritis (RA) using data from Partners Healthcare System.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2018

Direct Uncertainty Prediction with Applications to Healthcare

Large labeled datasets for supervised learning are frequently constructe...
research
02/17/2021

Split Modeling for High-Dimensional Logistic Regression

A novel method is proposed to learn an ensemble of logistic classificati...
research
10/28/2019

Ensemble Quantile Classifier

Both the median-based classifier and the quantile-based classifier are u...
research
06/13/2018

Ensemble Pruning based on Objection Maximization with a General Distributed Framework

Ensemble pruning, selecting a subset of individual learners from an orig...
research
07/13/2021

Exploiting Image Translations via Ensemble Self-Supervised Learning for Unsupervised Domain Adaptation

We introduce an unsupervised domain adaption (UDA) strategy that combine...
research
09/15/2021

On-the-Fly Ensemble Pruning in Evolving Data Streams

Ensemble pruning is the process of selecting a subset of componentclassi...
research
11/06/2021

On pseudo-absence generation and machine learning for locust breeding ground prediction in Africa

Desert locust outbreaks threaten the food security of a large part of Af...

Please sign up or login with your details

Forgot password? Click here to reset