Knowledge Guided Disambiguation for Large-Scale Scene Classification with Multi-Resolution CNNs

10/04/2016
by   Limin Wang, et al.
0

Convolutional Neural Networks (CNNs) have made remarkable progress on scene recognition, partially due to these recent large-scale scene datasets, such as the Places and Places2. Scene categories are often defined by multi-level information, including local objects, global layout, and background environment, thus leading to large intra-class variations. In addition, with the increasing number of scene categories, label ambiguity has become another crucial issue in large-scale classification. This paper focuses on large-scale scene recognition and makes two major contributions to tackle these issues. First, we propose a multi-resolution CNN architecture that captures visual content and structure at multiple levels. The multi-resolution CNNs are composed of coarse resolution CNNs and fine resolution CNNs, which are complementary to each other. Second, we design two knowledge guided disambiguation techniques to deal with the problem of label ambiguity. (i) We exploit the knowledge from the confusion matrix computed on validation data to merge ambiguous classes into a super category. (ii) We utilize the knowledge of extra networks to produce a soft label for each image. Then the super categories or soft labels are employed to guide CNN training on the Places2. We conduct extensive experiments on three large-scale image datasets (ImageNet, Places, and Places2), demonstrating the effectiveness of our approach. Furthermore, our method takes part in two major scene recognition challenges, and achieves the second place at the Places2 challenge in ILSVRC 2015, and the first place at the LSUN challenge in CVPR 2016. Finally, we directly test the learned representations on other scene benchmarks, and obtain the new state-of-the-art results on the MIT Indoor67 (86.7%) and SUN397 (72.0%). We release the code and models at <https://github.com/wanglimin/MRCNN-Scene-Recognition>.

READ FULL TEXT

page 1

page 4

page 5

page 6

page 9

page 13

research
10/03/2014

HD-CNN: Hierarchical Deep Convolutional Neural Network for Large Scale Visual Recognition

In image classification, visual separability between different object ca...
research
06/13/2022

2nd Place Solution for ICCV 2021 VIPriors Image Classification Challenge: An Attract-and-Repulse Learning Approach

Convolutional neural networks (CNNs) have achieved significant success i...
research
10/14/2015

Better Exploiting OS-CNNs for Better Event Recognition in Images

Event recognition from still images is one of the most important problem...
research
01/21/2019

Understanding the Impact of Label Granularity on CNN-based Image Classification

In recent years, supervised learning using Convolutional Neural Networks...
research
10/06/2016

Places: An Image Database for Deep Scene Understanding

The rise of multi-million-item dataset initiatives has enabled data-hung...
research
04/20/2016

Scene Parsing with Integration of Parametric and Non-parametric Models

We adopt Convolutional Neural Networks (CNNs) to be our parametric model...
research
12/13/2018

ELASTIC: Improving CNNs with Instance Specific Scaling Policies

Scale variation has been a challenge from traditional to modern approach...

Please sign up or login with your details

Forgot password? Click here to reset