Elucidating image-to-set prediction: An analysis of models, losses and datasets

04/11/2019
by   Luis Pineda, et al.
0

In recent years, we have experienced a flurry of contributions in the multi-label classification literature. This problem has been framed under different perspectives, from predicting independent labels, to modeling label co-occurrences via architectural and/or loss function design. Despite great progress, it is still unclear which modeling choices are best suited to address this task, partially due to the lack of well defined benchmarks. Therefore, in this paper, we provide an in-depth analysis on five different computer vision datasets of increasing task complexity that are suitable for multi-label clasification (VOC, COCO, NUS-WIDE, ADE20k and Recipe1M). Our results show that (1) modeling label co-occurrences and predicting the number of labels that appear in the image is important, especially in high-dimensional output spaces; (2) carefully tuning hyper-parameters for very simple baselines leads to significant improvements, comparable to previously reported results; and (3) as a consequence of our analysis, we achieve state-of-the-art results on 3 datasets for which a fair comparison to previously published methods is feasible.

READ FULL TEXT

page 1

page 7

page 15

page 16

page 17

page 19

page 20

research
12/07/2018

LNEMLC: Label Network Embeddings for Multi-Label Classifiation

Multi-label classification aims to classify instances with discrete non-...
research
04/11/2017

Improving Pairwise Ranking for Multi-label Image Classification

Learning to rank has recently emerged as an attractive technique to trai...
research
11/22/2019

Orderless Recurrent Models for Multi-label Classification

Recurrent neural networks (RNN) are popular for many computer vision tas...
research
10/24/2022

An Effective Approach for Multi-label Classification with Missing Labels

Compared with multi-class classification, multi-label classification tha...
research
05/09/2022

When does dough become a bagel? Analyzing the remaining mistakes on ImageNet

Image classification accuracy on the ImageNet dataset has been a baromet...
research
07/08/2021

Parameter Selection: Why We Should Pay More Attention to It

The importance of parameter selection in supervised learning is well kno...
research
07/14/2020

Emoji Prediction: Extensions and Benchmarking

Emojis are a succinct form of language which can express concrete meanin...

Please sign up or login with your details

Forgot password? Click here to reset