Convex Calibrated Surrogates for the Multi-Label F-Measure

09/16/2020
by   Mingyuan Zhang, et al.
0

The F-measure is a widely used performance measure for multi-label classification, where multiple labels can be active in an instance simultaneously (e.g. in image tagging, multiple tags can be active in any image). In particular, the F-measure explicitly balances recall (fraction of active labels predicted to be active) and precision (fraction of labels predicted to be active that are actually so), both of which are important in evaluating the overall performance of a multi-label classifier. As with most discrete prediction problems, however, directly optimizing the F-measure is computationally hard. In this paper, we explore the question of designing convex surrogate losses that are calibrated for the F-measure – specifically, that have the property that minimizing the surrogate loss yields (in the limit of sufficient data) a Bayes optimal multi-label classifier for the F-measure. We show that the F-measure for an s-label problem, when viewed as a 2^s × 2^s loss matrix, has rank at most s^2+1, and apply a result of Ramaswamy et al. (2014) to design a family of convex calibrated surrogates for the F-measure. The resulting surrogate risk minimization algorithms can be viewed as decomposing the multi-label F-measure learning problem into s^2+1 binary class probability estimation problems. We also provide a quantitative regret transfer bound for our surrogates, which allows any regret guarantees for the binary problems to be transferred to regret guarantees for the overall F-measure problem, and discuss a connection with the algorithm of Dembczynski et al. (2013). Our experiments confirm our theoretical findings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2010

Calibrated Surrogate Losses for Classification with Label-Dependent Costs

We present surrogate regret bounds for arbitrary surrogate losses in the...
research
11/16/2020

Multi-label classification: do Hamming loss and subset accuracy really conflict with each other?

Various evaluation measures have been developed for multi-label classifi...
research
01/13/2015

On Generalizing the C-Bound to the Multiclass and Multi-label Settings

The C-bound, introduced in Lacasse et al., gives a tight upper bound on ...
research
12/04/2021

Adaptive label thresholding methods for online multi-label classification

Existing online multi-label classification works cannot well handle the ...
research
11/24/2014

Iteratively Reweighted Graph Cut for Multi-label MRFs with Non-convex Priors

While widely acknowledged as highly effective in computer vision, multi-...
research
05/30/2023

Hierarchical Multi-Instance Multi-Label Learning for Detecting Propaganda Techniques

Since the introduction of the SemEval 2020 Task 11 (Martino et al., 2020...
research
03/01/2021

Fast threshold optimization for multi-label audio tagging using Surrogate gradient learning

Multi-label audio tagging consists of assigning sets of tags to audio re...

Please sign up or login with your details

Forgot password? Click here to reset