Verbal Characterization of Probabilistic Clusters using Minimal Discriminative Propositions

08/25/2011
by   Yoshitaka Kameya, et al.
0

In a knowledge discovery process, interpretation and evaluation of the mined results are indispensable in practice. In the case of data clustering, however, it is often difficult to see in what aspect each cluster has been formed. This paper proposes a method for automatic and objective characterization or "verbalization" of the clusters obtained by mixture models, in which we collect conjunctions of propositions (attribute-value pairs) that help us interpret or evaluate the clusters. The proposed method provides us with a new, in-depth and consistent tool for cluster interpretation/evaluation, and works for various types of datasets including continuous attributes and missing values. Experimental results with a couple of standard datasets exhibit the utility of the proposed method, and the importance of the feedbacks from the interpretation/evaluation step.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/25/2021

Clustering acoustic emission data streams with sequentially appearing clusters using mixture models

The interpretation of unlabeled acoustic emission (AE) data classically ...
research
12/30/2022

A novel cluster internal evaluation index based on hyper-balls

It is crucial to evaluate the quality and determine the optimal number o...
research
12/07/2020

Cluster analysis of presolar silicon carbide grains: evaluation of their classification and astrophysical implications

Cluster analysis of presolar silicon carbide grains based on literature ...
research
10/17/2016

What is the Best Way for Extracting Meaningful Attributes from Pictures?

Automatic attribute discovery methods have gained in popularity to extra...
research
04/17/2019

SCE: A manifold regularized set-covering method for data partitioning

Cluster analysis plays a very important role in data analysis. In these ...
research
09/24/2015

Opinion mining from twitter data using evolutionary multinomial mixture models

Image of an entity can be defined as a structured and dynamic representa...
research
03/01/2017

Phylogenetic Tools in Astrophysics

Multivariate clustering in astrophysics is a recent development justifie...

Please sign up or login with your details

Forgot password? Click here to reset