Label Noise Filtering Techniques to Improve Monotonic Classification

10/21/2018
by   José-Ramón Cano, et al.
0

The monotonic ordinal classification has increased the interest of researchers and practitioners within machine learning community in the last years. In real applications, the problems with monotonicity constraints are very frequent. To construct predictive monotone models from those problems, many classifiers require as input a data set satisfying the monotonicity relationships among all samples. Changing the class labels of the data set (relabelling) is useful for this. Relabelling is assumed to be an important building block for the construction of monotone classifiers and it is proved that it can improve the predictive performance. In this paper, we will address the construction of monotone datasets considering as noise the cases that do not meet the monotonicity restrictions. For the first time in the specialized literature, we propose the use of noise filtering algorithms in a preprocessing stage with a double goal: to increase both the monotonicity index of the models and the accuracy of the predictions for different monotonic classifiers. The experiments are performed over 12 datasets coming from classification and regression problems and show that our scheme improves the prediction capabilities of the monotonic classifiers instead of being applied to original and relabeled datasets. In addition, we have included the analysis of noise filtering process in the particular case of wine quality classification to understand its effect in the predictive models generated.

READ FULL TEXT
research
11/17/2018

Monotonic classification: an overview on algorithms, performance measures and data sets

Currently, knowledge discovery in databases is an essential step to iden...
research
01/12/2021

Improving Classification Accuracy with Graph Filtering

In machine learning, classifiers are typically susceptible to noise in t...
research
10/23/2018

OCAPIS: R package for Ordinal Classification And Preprocessing In Scala

Ordinal Data are those where a natural order exist between the labels. T...
research
03/05/2020

Fuzzy k-Nearest Neighbors with monotonicity constraints: Moving towards the robustness of monotonic noise

This paper proposes a new model based on Fuzzy k-Nearest Neighbors for c...
research
06/01/2021

Explanations for Monotonic Classifiers

In many classification tasks there is a requirement of monotonicity. Con...
research
10/14/2022

Monotonicity and Double Descent in Uncertainty Estimation with Gaussian Processes

The quality of many modern machine learning models improves as model com...
research
04/10/2018

Monotonic models for real-time dynamic malware detection

In dynamic malware analysis, programs are classified as malware or benig...

Please sign up or login with your details

Forgot password? Click here to reset