Probabilistic Diagnostic Tests for Degradation Problems in Supervised Learning

04/06/2020
by   Gustavo A Valencia-Zapata, et al.
8

Several studies point out different causes of performance degradation in supervised machine learning. Problems such as class imbalance, overlapping, small-disjuncts, noisy labels, and sparseness limit accuracy in classification algorithms. Even though a number of approaches either in the form of a methodology or an algorithm try to minimize performance degradation, they have been isolated efforts with limited scope. Most of these approaches focus on remediation of one among many problems, with experimental results coming from few datasets and classification algorithms, insufficient measures of prediction power, and lack of statistical validation for testing the real benefit of the proposed approach. This paper consists of two main parts: In the first part, a novel probabilistic diagnostic model based on identifying signs and symptoms of each problem is presented. Thereby, early and correct diagnosis of these problems is to be achieved in order to select not only the most convenient remediation treatment but also unbiased performance metrics. Secondly, the behavior and performance of several supervised algorithms are studied when training sets have such problems. Therefore, prediction of success for treatments can be estimated across classifiers.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 18

02/27/2013

A Probabilistic Approach to Hierarchical Model-based Diagnosis

Model-based diagnosis reasons backwards from a functional schematic of a...
11/29/2018

A snapshot on nonstandard supervised learning problems: taxonomy, relationships and methods

Machine learning is a field which studies how machines can alter and ada...
11/29/2017

Causality Refined Diagnostic Prediction

Applying machine learning in the health care domain has shown promising ...
02/11/2021

Sample Efficient Learning of Image-Based Diagnostic Classifiers Using Probabilistic Labels

Deep learning approaches often require huge datasets to achieve good gen...
07/17/2020

Dealing with Nuisance Parameters using Machine Learning in High Energy Physics: a Review

In this work we discuss the impact of nuisance parameters on the effecti...
02/02/2019

Supervised classification via minimax probabilistic transformations

One of the most common and studied problem in machine learning is classi...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.