Feature Relevance Determination for Ordinal Regression in the Context of Feature Redundancies and Privileged Information

12/10/2019
by   Lukas Pfannschmidt, et al.
0

Advances in machine learning technologies have led to increasingly powerful models in particular in the context of big data. Yet, many application scenarios demand for robustly interpretable models rather than optimum model accuracy; as an example, this is the case if potential biomarkers or causal factors should be discovered based on a set of given measurements. In this contribution, we focus on feature selection paradigms, which enable us to uncover relevant factors of a given regularity based on a sparse model. We focus on the important specific setting of linear ordinal regression, i.e.data have to be ranked into one of a finite number of ordered categories by a linear projection. Unlike previous work, we consider the case that features are potentially redundant, such that no unique minimum set of relevant features exists. We aim for an identification of all strongly and all weakly relevant features as well as their type of relevance (strong or weak); we achieve this goal by determining feature relevance bounds, which correspond to the minimum and maximum feature relevance, respectively, if searched over all equivalent models. In addition, we discuss how this setting enables us to substitute some of the features, e.g. due to their semantics, and how to extend the framework of feature relevance intervals to the setting of privileged information, i.e.potentially relevant information is available for training purposes only, but cannot be used for the prediction itself.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/20/2019

Feature Relevance Bounds for Ordinal Regression

The increasing occurrence of ordinal data, mainly sociodemographic, led ...
research
03/02/2019

FRI - Feature Relevance Intervals for Interpretable and Interactive Data Exploration

Most existing feature selection methods are insufficient for analytic pu...
research
04/01/2020

Sequential Feature Classification in the Context of Redundancies

The problem of all-relevant feature selection is concerned with finding ...
research
12/22/2017

Relevance Scoring of Triples Using Ordinal Logistic Classification - The Celosia Triple Scorer at WSDM Cup 2017

In this paper, we report our participation in the Task 2: Triple Scoring...
research
05/12/2016

Context-dependent feature analysis with random forests

In many cases, feature selection is often more complicated than identify...
research
05/18/2020

Sparse Methods for Automatic Relevance Determination

This work considers methods for imposing sparsity in Bayesian regression...
research
07/29/2019

FDive: Learning Relevance Models using Pattern-based Similarity Measures

The detection of interesting patterns in large high-dimensional datasets...

Please sign up or login with your details

Forgot password? Click here to reset