B-CNN: Branch Convolutional Neural Network for Hierarchical Classification

09/28/2017
by   Xinqi Zhu, et al.
0

Convolutional Neural Network (CNN) image classifiers are traditionally designed to have sequential convolutional layers with a single output layer. This is based on the assumption that all target classes should be treated equally and exclusively. However, some classes can be more difficult to distinguish than others, and classes may be organized in a hierarchy of categories. At the same time, a CNN is designed to learn internal representations that abstract from the input data based on its hierarchical layered structure. So it is natural to ask if an inverse of this idea can be applied to learn a model that can predict over a classification hierarchy using multiple output layers in decreasing order of class abstraction. In this paper, we introduce a variant of the traditional CNN model named the Branch Convolutional Neural Network (B-CNN). A B-CNN model outputs multiple predictions ordered from coarse to fine along the concatenated convolutional layers corresponding to the hierarchical structure of the target classes, which can be regarded as a form of prior knowledge on the output. To learn with B-CNNs a novel training strategy, named the Branch Training strategy (BT-strategy), is introduced which balances the strictness of the prior with the freedom to adjust parameters on the output layers to minimize the loss. In this way we show that CNN based models can be forced to learn successively coarse to fine concepts in the internal layers at the output stage, and that hierarchical prior knowledge can be adopted to boost CNN models' classification performance. Our models are evaluated to show that the B-CNN extensions improve over the corresponding baseline CNN on the benchmark datasets MNIST, CIFAR-10 and CIFAR-100.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2014

HD-CNN: Hierarchical Deep Convolutional Neural Network for Large Scale Visual Recognition

In image classification, visual separability between different object ca...
research
06/03/2019

Hierarchical Auxiliary Learning

Conventional application of convolutional neural networks (CNNs) for ima...
research
03/19/2020

Multilayer Dense Connections for Hierarchical Concept Classification

Classification is a pivotal function for many computer vision tasks such...
research
07/07/2020

Hierarchical nucleation in deep neural networks

Deep convolutional networks (DCNs) learn meaningful representations wher...
research
03/01/2022

SMTNet: Hierarchical cavitation intensity recognition based on sub-main transfer network

With the rapid development of smart manufacturing, data-driven machinery...
research
10/11/2021

Decomposing Convolutional Neural Networks into Reusable and Replaceable Modules

Training from scratch is the most common way to build a Convolutional Ne...
research
04/20/2016

Network of Experts for Large-Scale Image Categorization

We present a tree-structured network architecture for large scale image ...

Please sign up or login with your details

Forgot password? Click here to reset