Tencent ML-Images: A Large-Scale Multi-Label Image Database for Visual Representation Learning

01/07/2019
by   Baoyuan Wu, et al.
0

In existing visual representation learning tasks, deep convolutional neural networks (CNNs) are often trained on images annotated with single tags, such as ImageNet. However, a single tag cannot describe all important contents of one image, and some useful visual information may be wasted during training. In this work, we propose to train CNNs from images annotated with multiple tags, to enhance the quality of visual representation of the trained CNN model. To this end, we build a large-scale multi-label image database with 18M images and 11K categories, dubbed Tencent ML-Images. We efficiently train the ResNet-101 model with multi-label outputs on Tencent ML-Images, taking 90 hours for 60 epochs, based on a large-scale distributed deep learning framework,i.e.,TFplus. The good quality of the visual representation of the Tencent ML-Images checkpoint is verified through three transfer learning tasks, including single-label image classification on ImageNet and Caltech-256, object detection on PASCAL VOC 2007, and semantic segmentation on PASCAL VOC 2012. The Tencent ML-Images database, the checkpoints of ResNet-101, and all the training codehave been released at https://github.com/Tencent/tencent-ml-images. It is expected to promote other vision tasks in the research and industry community.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/13/2019

The iMaterialist Fashion Attribute Dataset

Large-scale image databases such as ImageNet have significantly advanced...
research
06/20/2020

Unsupervised Image Classification for Deep Representation Learning

Deep clustering against self-supervised learning is a very important and...
research
04/01/2020

Adversarial Learning for Personalized Tag Recommendation

We have recently seen great progress in image classification due to the ...
research
10/30/2014

DeepSentiBank: Visual Sentiment Concept Classification with Deep Convolutional Neural Networks

This paper introduces a visual sentiment concept classification method b...
research
06/05/2021

GLSD: The Global Large-Scale Ship Database and Baseline Evaluations

In this paper, we introduce a challenging global large-scale ship databa...
research
07/10/2017

Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

The success of deep learning in vision can be attributed to: (a) models ...
research
06/22/2014

CNN: Single-label to Multi-label

Convolutional Neural Network (CNN) has demonstrated promising performanc...

Please sign up or login with your details

Forgot password? Click here to reset