FCA2VEC: Embedding Techniques for Formal Concept Analysis

11/26/2019
by   Dominik Dürrschnabel, et al.
0

Embedding large and high dimensional data into low dimensional vector spaces is a necessary task to computationally cope with contemporary data sets. Superseding latent semantic analysis recent approaches like word2vec or node2vec are well established tools in this realm. In the present paper we add to this line of research by introducing fca2vec, a family of embedding techniques for formal concept analysis (FCA). Our investigation contributes to two distinct lines of research. First, we enable the application of FCA notions to large data sets. In particular, we demonstrate how the cover relation of a concept lattice can be retrieved from a computational feasible embedding. Secondly, we show an enhancement for the classical node2vec approach in low dimension. For both directions the overall constraint of FCA of explainable results is preserved. We evaluate our novel procedures by computing fca2vec on different data sets like, wiki44 (a dense part of the Wikidata knowledge graph), the Mushroom data set and a publication network derived from the FCA community.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/26/2020

Knowledge Cores in Large Formal Contexts

Knowledge computation tasks are often infeasible for large data sets. Th...
research
08/05/2018

Too many secants: a hierarchical approach to secant-based dimensionality reduction on large data sets

A fundamental question in many data analysis settings is the problem of ...
research
01/24/2018

Intrinsic dimension of concept lattices

Geometric analysis is a very capable theory to understand the influence ...
research
08/29/2023

Tuning the perplexity for and computing sampling-based t-SNE embeddings

Widely used pipelines for the analysis of high-dimensional data utilize ...
research
01/23/2021

ReliefE: Feature Ranking in High-dimensional Spaces via Manifold Embeddings

Feature ranking has been widely adopted in machine learning applications...
research
06/03/2021

Statistical embedding: Beyond principal components

There has been an intense recent activity in embedding of very high dime...
research
07/10/2018

A GPU-Oriented Algorithm Design for Secant-Based Dimensionality Reduction

Dimensionality-reduction techniques are a fundamental tool for extractin...

Please sign up or login with your details

Forgot password? Click here to reset