Basic Level Categorization Facilitates Visual Object Recognition

11/12/2015
by   Panqu Wang, et al.
0

Recent advances in deep learning have led to significant progress in the computer vision field, especially for visual object recognition tasks. The features useful for object classification are learned by feed-forward deep convolutional neural networks (CNNs) automatically, and they are shown to be able to predict and decode neural representations in the ventral visual pathway of humans and monkeys. However, despite the huge amount of work on optimizing CNNs, there has not been much research focused on linking CNNs with guiding principles from the human visual cortex. In this work, we propose a network optimization strategy inspired by both of the developmental trajectory of children's visual object recognition capabilities, and Bar (2003), who hypothesized that basic level information is carried in the fast magnocellular pathway through the prefrontal cortex (PFC) and then projected back to inferior temporal cortex (IT), where subordinate level categorization is achieved. We instantiate this idea by training a deep CNN to perform basic level object categorization first, and then train it on subordinate level categorization. We apply this idea to training AlexNet (Krizhevsky et al., 2012) on the ILSVRC 2012 dataset and show that the top-5 accuracy increases from 80.13 demonstrating the effectiveness of the method. We also show that subsequent transfer learning on smaller datasets gives superior results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2019

Training neural networks to have brain-like representations improves object recognition performance

The current state-of-the-art object recognition algorithms, deep convolu...
research
06/15/2018

Seeing Neural Networks Through a Box of Toys: The Toybox Dataset of Visual Object Transformations

Deep convolutional neural networks (CNNs) have enjoyed tremendous succes...
research
06/08/2017

CortexNet: a Generic Network Family for Robust Visual Temporal Representations

In the past five years we have observed the rise of incredibly well perf...
research
11/16/2019

Sensory Optimization: Neural Networks as a Model for Understanding and Creating Art

This article is about the cognitive science of visual art. Artists creat...
research
04/26/2016

Are Face and Object Recognition Independent? A Neurocomputational Modeling Exploration

Are face and object recognition abilities independent? Although it is co...
research
05/19/2018

Learning Hierarchical Visual Representations in Deep Neural Networks Using Hierarchical Linguistic Labels

Modern convolutional neural networks (CNNs) are able to achieve human-le...
research
03/19/2017

Multilevel Context Representation for Improving Object Recognition

In this work, we propose the combined usage of low- and high-level block...

Please sign up or login with your details

Forgot password? Click here to reset