Enhancing Cluster Quality of Numerical Datasets with Domain Ontology

Ontology-based clustering has gained attention in recent years due to the potential benefits of ontology. Current ontology-based clustering approaches have mainly been applied to reduce the dimensionality of attributes in text document clustering. Reduction in dimensionality of attributes using ontology helps to produce high quality clusters for a dataset. However, ontology-based approaches in clustering numerical datasets have not been gained enough attention. Moreover, some literature mentions that ontology-based clustering can produce either high quality or low-quality clusters from a dataset. Therefore, in this paper we present a clustering approach that is based on domain ontology to reduce the dimensionality of attributes in a numerical dataset using domain ontology and to produce high quality clusters. For every dataset, we produce three datasets using domain ontology. We then cluster these datasets using a genetic algorithm-based clustering technique called GenClust++. The clusters of each dataset are evaluated in terms of Sum of Squared-Error (SSE). We use six numerical datasets to evaluate the performance of our ontology-based approach. The experimental results of our approach indicate that cluster quality gradually improves from lower to the higher levels of a domain ontology.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/30/2017

Preliminary results on Ontology-based Open Data Publishing

Despite the current interest in Open Data publishing, a formal and compr...
research
02/17/2018

Technique for designing a domain ontology

The article describes the technique for designing a domain ontology, sho...
research
08/06/2015

Hyponymy extraction of domain ontology concept based on ccrfs and hierarchy clustering

Concept hierarchy is the backbone of ontology, and the concept hierarchy...
research
01/25/2019

Comparing of Term Clustering Frameworks for Modular Ontology Learning

This paper aims to use term clustering to build a modular ontology accor...
research
02/07/2021

Effective and Scalable Clustering on Massive Attributed Graphs

Given a graph G where each node is associated with a set of attributes, ...
research
08/03/2016

Improving Quality of Hierarchical Clustering for Large Data Series

Brown clustering is a hard, hierarchical, bottom-up clustering of words ...
research
10/22/2018

Biomedical Document Clustering and Visualization based on the Concepts of Diseases

Document clustering is a text mining technique used to provide better do...

Please sign up or login with your details

Forgot password? Click here to reset