Explaining the Performance of Multi-label Classification Methods with Data Set Properties

06/28/2021
by   Jasmin Bogatinovski, et al.
0

Meta learning generalizes the empirical experience with different learning tasks and holds promise for providing important empirical insight into the behaviour of machine learning algorithms. In this paper, we present a comprehensive meta-learning study of data sets and methods for multi-label classification (MLC). MLC is a practically relevant machine learning task where each example is labelled with multiple labels simultaneously. Here, we analyze 40 MLC data sets by using 50 meta features describing different properties of the data. The main findings of this study are as follows. First, the most prominent meta features that describe the space of MLC data sets are the ones assessing different aspects of the label space. Second, the meta models show that the most important meta features describe the label space, and, the meta features describing the relationships among the labels tend to occur a bit more often than the meta features describing the distributions between and within the individual labels. Third, the optimization of the hyperparameters can improve the predictive performance, however, quite often the extent of the improvements does not always justify the resource utilization.

READ FULL TEXT

page 6

page 13

research
08/30/2018

Towards Reproducible Empirical Research in Meta-Learning

Meta-learning is increasingly used to support the recommendation of mach...
research
10/26/2021

Meta-Learning for Multi-Label Few-Shot Classification

Even with the luxury of having abundant data, multi-label classification...
research
09/09/2019

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Many tasks in natural language processing can be viewed as multi-label c...
research
06/02/2020

Interpretable Meta-Measure for Model Performance

Measures for evaluation of model performance play an important role in M...
research
11/21/2022

Explainable Model-specific Algorithm Selection for Multi-Label Classification

Multi-label classification (MLC) is an ML task of predictive modeling in...
research
07/08/2021

Task Fingerprinting for Meta Learning in Biomedical Image Analysis

Shortage of annotated data is one of the greatest bottlenecks in biomedi...
research
02/11/2021

EvoSplit: An evolutionary approach to split a multi-label data set into disjoint subsets

This paper presents a new evolutionary approach, EvoSplit, for the distr...

Please sign up or login with your details

Forgot password? Click here to reset