Improved Multi-objective Data Stream Clustering with Time and Memory Optimization

01/13/2022
by   Mohammed Oualid Attaoui, et al.
0

The analysis of data streams has received considerable attention over the past few decades due to sensors, social media, etc. It aims to recognize patterns in an unordered, infinite, and evolving stream of observations. Clustering this type of data requires some restrictions in time and memory. This paper introduces a new data stream clustering method (IMOC-Stream). This method, unlike the other clustering algorithms, uses two different objective functions to capture different aspects of the data. The goal of IMOC-Stream is to: 1) reduce computation time by using idle times to apply genetic operations and enhance the solution. 2) reduce memory allocation by introducing a new tree synopsis. 3) find arbitrarily shaped clusters by using a multi-objective framework. We conducted an experimental study with high dimensional stream datasets and compared them to well-known stream clustering techniques. The experiments show the ability of our method to partition the data stream in arbitrarily shaped, compact, and well-separated clusters while optimizing the time and memory. Our method also outperformed most of the stream algorithms in terms of NMI and ARAND measures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/09/2018

Fuzzy Clustering to Identify Clusters at Different Levels of Fuzziness: An Evolutionary Multi-Objective Optimization Approach

Fuzzy clustering methods identify naturally occurring clusters in a data...
research
01/26/2022

Multi-objective Semi-supervised Clustering for Finding Predictive Clusters

This study concentrates on clustering problems and aims to find compact ...
research
06/24/2022

SECLEDS: Sequence Clustering in Evolving Data Streams via Multiple Medoids and Medoid Voting

Sequence clustering in a streaming environment is challenging because it...
research
09/09/2023

Benne: A Modular and Self-Optimizing Algorithm for Data Stream Clustering

In various real-world applications, ranging from the Internet of Things ...
research
05/23/2023

Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization

Combining end-to-end neural speaker diarization (EEND) with vector clust...
research
10/26/2020

Multi-Objective Frequent Termset Clustering

Large media collections rapidly evolve in the World Wide Web. In additio...
research
11/27/2018

Adaptive Wavelet Clustering for High Noise Data

In this paper we make progress on the unsupervised task of mining arbitr...

Please sign up or login with your details

Forgot password? Click here to reset