Network of Experts for Large-Scale Image Categorization

04/20/2016
by   Karim Ahmed, et al.
0

We present a tree-structured network architecture for large scale image classification. The trunk of the network contains convolutional layers optimized over all classes. At a given depth, the trunk splits into separate branches, each dedicated to discriminate a different subset of classes. Each branch acts as an expert classifying a set of categories that are difficult to tell apart, while the trunk provides common knowledge to all experts in the form of shared features. The training of our "network of experts" is completely end-to-end: the partition of categories into disjoint subsets is learned simultaneously with the parameters of the network trunk and the experts are trained jointly by minimizing a single learning objective over all classes. The proposed structure can be built from any existing convolutional neural network (CNN). We demonstrate its generality by adapting 4 popular CNNs for image categorization into the form of networks of experts. Our experiments on CIFAR100 and ImageNet show that in every case our method yields a substantial improvement in accuracy over the base CNN, and gives the best result achieved so far on CIFAR100. Finally, the improvement in accuracy comes at little additional cost: compared to the base network, the training time is only moderately increased and the number of parameters is comparable or in some cases even lower.

READ FULL TEXT

page 16

page 17

research
11/19/2015

Mediated Experts for Deep Convolutional Networks

We present a new supervised architecture termed Mediated Mixture-of-Expe...
research
10/03/2014

HD-CNN: Hierarchical Deep Convolutional Neural Network for Large Scale Visual Recognition

In image classification, visual separability between different object ca...
research
09/28/2017

B-CNN: Branch Convolutional Neural Network for Hierarchical Classification

Convolutional Neural Network (CNN) image classifiers are traditionally d...
research
06/24/2017

Deep Mixture of Diverse Experts for Large-Scale Visual Recognition

In this paper, a deep mixture of diverse experts algorithm is developed ...
research
12/07/2017

Incremental Learning in Deep Convolutional Neural Networks Using Partial Network Sharing

Deep convolutional neural network (DCNN) based supervised learning is a ...
research
08/20/2018

Class2Str: End to End Latent Hierarchy Learning

Deep neural networks for image classification typically consists of a co...

Please sign up or login with your details

Forgot password? Click here to reset