Greedy Discovery of Ordinal Factors

02/19/2023
by   Dominik Dürrschnabel, et al.
0

In large datasets, it is hard to discover and analyze structure. It is thus common to introduce tags or keywords for the items. In applications, such datasets are then filtered based on these tags. Still, even medium-sized datasets with a few tags result in complex and for humans hard-to-navigate systems. In this work, we adopt the method of ordinal factor analysis to address this problem. An ordinal factor arranges a subset of the tags in a linear order based on their underlying structure. A complete ordinal factorization, which consists of such ordinal factors, precisely represents the original dataset. Based on such an ordinal factorization, we provide a way to discover and explain relationships between different items and attributes in the dataset. However, computing even just one ordinal factor of high cardinality is computationally complex. We thus propose the greedy algorithm in this work. This algorithm extracts ordinal factors using already existing fast algorithms developed in formal concept analysis. Then, we leverage to propose a comprehensive way to discover relationships in the dataset. We furthermore introduce a distance measure based on the representation emerging from the ordinal factorization to discover similar items. To evaluate the method, we conduct a case study on different datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/06/2023

Maximal Ordinal Two-Factorizations

Given a formal context, an ordinal factor is a subset of its incidence r...
research
03/14/2018

Multi-objective Analysis of MAP-Elites Performance

In certain complex optimization tasks, it becomes necessary to use multi...
research
11/14/2019

Explainable Ordinal Factorization Model: Deciphering the Effects of Attributes by Piece-wise Linear Approximation

Ordinal regression predicts the objects' labels that exhibit a natural o...
research
04/10/2023

Ordinal Motifs in Lattices

Lattices are a commonly used structure for the representation and analys...
research
08/10/2021

Heywood cases in unidimensional factor models and item response models for binary data

Heywood cases are known from linear factor analysis literature as variab...
research
09/12/2022

Ordinal Graph Gamma Belief Network for Social Recommender Systems

To build recommender systems that not only consider user-item interactio...
research
12/18/2014

Tag-Aware Ordinal Sparse Factor Analysis for Learning and Content Analytics

Machine learning offers novel ways and means to design personalized lear...

Please sign up or login with your details

Forgot password? Click here to reset