Scalable Formal Concept Analysis algorithm for large datasets using Spark

07/06/2018
by   Raghavendra K Chunduri, et al.
0

In the process of knowledge discovery and representation in large datasets using formal concept analysis, complexity plays a major role in identifying all the formal concepts and constructing the concept lattice(digraph of the concepts). For identifying the formal concepts and constructing the digraph from the identified concepts in very large datasets, various distributed algorithms are available in the literature. However, the existing distributed algorithms are not very well suitable for concept generation because it is an iterative process. The existing algorithms are implemented using distributed frameworks like MapReduce and Open MP, these frameworks are not appropriate for iterative applications. Hence, in this paper we proposed efficient distributed algorithms for both formal concept generation and concept lattice digraph construction in large formal contexts using Apache Spark. Various performance metrics are considered for the evaluation of the proposed work, the results of the evaluation proves that the proposed algorithms are efficient for concept generation and lattice graph construction in comparison with the existing algorithms.

READ FULL TEXT

page 4

page 24

page 31

research
10/14/2018

Conceptual Collectives

The notions of formal contexts and concept lattices, although introduced...
research
11/08/2016

On interestingness measures of formal concepts

Formal concepts and closed itemsets proved to be of big importance for k...
research
02/01/2019

Accuracy Evaluation of Overlapping and Multi-resolution Clustering Algorithms on Large Datasets

Performance of clustering algorithms is evaluated with the help of accur...
research
10/14/2020

LCM is well implemented CbO: study of LCM from FCA point of view

LCM is an algorithm for enumeration of frequent closed itemsets in trans...
research
01/13/2021

Formalising Concepts as Grounded Abstractions

The notion of concept has been studied for centuries, by philosophers, l...
research
07/10/2021

Formal context reduction in deriving concept hierarchies from corpora using adaptive evolutionary clustering algorithm star

It is beneficial to automate the process of deriving concept hierarchies...
research
11/13/2014

A Randomized Algorithm for CCA

We present RandomizedCCA, a randomized algorithm for computing canonical...

Please sign up or login with your details

Forgot password? Click here to reset