Mining Biclusters of Similar Values with Triadic Concept Analysis

11/14/2011
by   Mehdi Kaytoue, et al.
0

Biclustering numerical data became a popular data-mining task in the beginning of 2000's, especially for analysing gene expression data. A bicluster reflects a strong association between a subset of objects and a subset of attributes in a numerical object/attribute data-table. So called biclusters of similar values can be thought as maximal sub-tables with close values. Only few methods address a complete, correct and non redundant enumeration of such patterns, which is a well-known intractable problem, while no formal framework exists. In this paper, we introduce important links between biclustering and formal concept analysis. More specifically, we originally show that Triadic Concept Analysis (TCA), provides a nice mathematical framework for biclustering. Interestingly, existing algorithms of TCA, that usually apply on binary data, can be used (directly or with slight modifications) after a preprocessing step for extracting maximal biclusters of similar values.

READ FULL TEXT
research
11/24/2011

Revisiting Numerical Pattern Mining with Formal Concept Analysis

In this paper, we investigate the problem of mining numerical data in th...
research
11/23/2018

Contributions to Biclustering of Microarray Data Using Formal Concept Analysis

Biclustering is an unsupervised data mining technique that aims to unvei...
research
02/17/2017

Towards a Unified Taxonomy of Biclustering Methods

Being an unsupervised machine learning and data mining technique, biclus...
research
10/09/2017

Efficient mining of maximal biclusters in mixed-attribute datasets

This paper presents a novel enumerative biclustering algorithm to direct...
research
03/07/2020

New advances in enumerative biclustering algorithms with online partitioning

This paper further extends RIn-Close_CVC, a biclustering algorithm capab...
research
10/17/2018

RIn-Close_CVC2: an even more efficient enumerative algorithm for biclustering of numerical datasets

RIn-Close_CVC is an efficient (take polynomial time per bicluster), comp...
research
09/11/2018

Knowledge extraction, modeling and formalization: EEG case study

Formal Concept Analysis (FCA) is a well-established method for data anal...

Please sign up or login with your details

Forgot password? Click here to reset