Distributed Clustering Algorithm for Spatial Data Mining

02/01/2018
by   Malika Bendechache, et al.
0

Distributed data mining techniques and mainly distributed clustering are widely used in the last decade because they deal with very large and heterogeneous datasets which cannot be gathered centrally. Current distributed clustering approaches are normally generating global models by aggregating local results that are obtained on each site. While this approach mines the datasets on their locations the aggregation phase is complex, which may produce incorrect and ambiguous global clusters and therefore incorrect knowledge. In this paper we propose a new clustering approach for very large spatial datasets that are heterogeneous and distributed. The approach is based on K-means Algorithm but it generates the number of global clusters dynamically. Moreover, this approach uses an elaborated aggregation phase. The aggregation phase is designed in such a way that the overall process is efficient in time and memory allocation. Preliminary results show that the proposed approach produces high quality results and scales up well. We also compared it to two popular clustering algorithms and show that this approach is much more efficient.

READ FULL TEXT
research
02/01/2018

Hierarchical Aggregation Approach for Distributed clustering of spatial datasets

In this paper, we present a new approach of distributed clustering for s...
research
10/26/2017

Distributed Spatial Data Clustering as a New Approach for Big Data Analysis

In this paper we propose a new approach for Big Data mining and analysis...
research
10/23/2019

Knowledge Map: Toward a New Approach Supporting the Knowledge Management in Distributed Data Mining

Distributed data mining (DDM) deals with the problem of finding patterns...
research
11/08/2022

An Incremental Phase Mapping Approach for X-ray Diffraction Patterns using Binary Peak Representations

Despite the huge advancement in knowledge discovery and data mining tech...
research
01/25/2019

A Kalman filtering induced heuristic optimization based partitional data clustering

Clustering algorithms have regained momentum with recent popularity of d...
research
12/17/2020

Time Aggregation Techniques Applied to a Capacity Expansion Model for Real-Life Sector Coupled Energy Systems

Simulating energy systems is vital for energy planning to understand the...
research
05/07/2023

Influence of Swarm Intelligence in Data Clustering Mechanisms

Data mining focuses on discovering interesting, non-trivial and meaningf...

Please sign up or login with your details

Forgot password? Click here to reset