Consensual aggregation of clusters based on Bregman divergences to improve predictive models

09/20/2019
by   Aurélie Fisher, et al.
0

A new procedure to construct predictive models in supervised learning problems by paying attention to the clustering structure of the input data is introduced. We are interested in situations where the input data consists of more than one unknown cluster, and where there exist different underlying models on these clusters. Thus, instead of constructing a single predictive model on the whole dataset, we propose to use a K-means clustering algorithm with different options of Bregman divergences, to recover the clustering structure of the input data. Then one dedicated predictive model is fit per cluster. For each divergence, we construct a simple local predictor on each observed cluster. We obtain one estimator, the collection of the K simple local predictors, per divergence, and we propose to combine them in a smart way based on a consensus idea. Several versions of consensual aggregation in both classification and regression problems are considered. A comparison of the performances of all constructed estimators on different simulated and real data assesses the excellent performance of our method. In a large variety of prediction problems, the consensual aggregation procedure outperforms all the other models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/20/2019

A clusterwise supervised learning procedure based on aggregation of distances

Nowadays, many machine learning procedures are available on the shelve a...
research
02/01/2018

Hierarchical Aggregation Approach for Distributed clustering of spatial datasets

In this paper, we present a new approach of distributed clustering for s...
research
04/26/2023

Automated calibration of consensus weighted distance-based clustering approaches using sharp

In consensus clustering, a clustering algorithm is used in combination w...
research
03/08/2018

Aggregation using input-output trade-off

In this paper, we introduce a new learning strategy based on a seminal i...
research
07/30/2021

Inference for Dependent Data with Learned Clusters

This paper presents and analyzes an approach to cluster-based inference ...
research
01/16/2017

Datenqualität in Regressionsproblemen

Regression models are increasingly built using datasets which do not fol...
research
01/20/2020

Deep Image Clustering with Tensor Kernels and Unsupervised Companion Objectives

In this paper we develop a new model for deep image clustering, using co...

Please sign up or login with your details

Forgot password? Click here to reset