Unsupervised Feature Ranking via Attribute Networks

11/25/2021
by   Urh Primožič, et al.
0

The need for learning from unlabeled data is increasing in contemporary machine learning. Methods for unsupervised feature ranking, which identify the most important features in such data are thus gaining attention, and so are their applications in studying high throughput biological experiments or user bases for recommender systems. We propose FRANe (Feature Ranking via Attribute Networks), an unsupervised algorithm capable of finding key features in given unlabeled data set. FRANe is based on ideas from network reconstruction and network analysis. FRANe performs better than state-of-the-art competitors, as we empirically demonstrate on a large collection of benchmarks. Moreover, we provide the time complexity analysis of FRANe further demonstrating its scalability. Finally, FRANe offers as the result the interpretable relational structures used to derive the feature importances.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/06/2023

On the Learnability of Multilabel Ranking

Multilabel ranking is a central task in machine learning with widespread...
research
11/12/2018

Learning From Positive and Unlabeled Data: A Survey

Learning from positive and unlabeled data or PU learning is the setting ...
research
02/12/2022

Robust Deep Semi-Supervised Learning: A Brief Introduction

Semi-supervised learning (SSL) is the branch of machine learning that ai...
research
07/31/2023

Pretrained deep models outperform GBDTs in Learning-To-Rank under label scarcity

While deep learning (DL) models are state-of-the-art in text and image d...
research
02/19/2014

Unsupervised Ranking of Multi-Attribute Objects Based on Principal Curves

Unsupervised ranking faces one critical challenge in evaluation applicat...
research
04/06/2014

Sparse Coding: A Deep Learning using Unlabeled Data for High - Level Representation

Sparse coding algorithm is an learning algorithm mainly for unsupervised...
research
10/14/2021

Interpretable transformed ANOVA approximation on the example of the prevention of forest fires

The distribution of data points is a key component in machine learning. ...

Please sign up or login with your details

Forgot password? Click here to reset