Multi-objective Semi-supervised Clustering for Finding Predictive Clusters

01/26/2022
by   Zahra Ghasemi, et al.
0

This study concentrates on clustering problems and aims to find compact clusters that are informative regarding the outcome variable. The main goal is partitioning data points so that observations in each cluster are similar and the outcome variable can be predicated using these clusters simultaneously. We model this semi-supervised clustering problem as a multi-objective optimization problem with considering deviation of data points in clusters and prediction error of the outcome variable as two objective functions to be minimized. For finding optimal clustering solutions, we employ a non-dominated sorting genetic algorithm II approach and local regression is applied as prediction method for the output variable. For comparing the performance of the proposed model, we compute seven models using five real-world data sets. Furthermore, we investigate the impact of using local regression for predicting the outcome variable in all models, and examine the performance of the multi-objective models compared to single-objective models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/09/2018

Fuzzy Clustering to Identify Clusters at Different Levels of Fuzziness: An Evolutionary Multi-Objective Optimization Approach

Fuzzy clustering methods identify naturally occurring clusters in a data...
research
08/15/2012

A Novel Strategy Selection Method for Multi-Objective Clustering Algorithms Using Game Theory

The most important factors which contribute to the efficiency of game-th...
research
01/13/2022

Improved Multi-objective Data Stream Clustering with Time and Memory Optimization

The analysis of data streams has received considerable attention over th...
research
10/26/2020

Multi-Objective Frequent Termset Clustering

Large media collections rapidly evolve in the World Wide Web. In additio...
research
08/31/2021

DoGR: Disaggregated Gaussian Regression for Reproducible Analysis of Heterogeneous Data

Quantitative analysis of large-scale data is often complicated by the pr...
research
03/03/2022

Multi-objective robust optimization using adaptive surrogate models for problems with mixed continuous-categorical parameters

Explicitly accounting for uncertainties is paramount to the safety of en...
research
12/16/2020

Predictive K-means with local models

Supervised classification can be effective for prediction but sometimes ...

Please sign up or login with your details

Forgot password? Click here to reset