Seeing Neural Networks Through a Box of Toys: The Toybox Dataset of Visual Object Transformations

06/15/2018
by   Xiaohan Wang, et al.
0

Deep convolutional neural networks (CNNs) have enjoyed tremendous success in computer vision in the past several years, particularly for visual object recognition.However, how CNNs work remains poorly understood, and the training of deep CNNs is still considered more art than science. To better characterize deep CNNs and the training process, we introduce a new video dataset called Toybox. Images in Toybox come from first-person, wearable camera recordings of common household objects and toys being manually manipulated to undergo structured transformations like rotations and translations. We also present results from initial experiments using deep CNNs that begin to examine how different distributions of training data can affect visual object recognition performance, and how visual object concepts are represented within a trained network.

READ FULL TEXT

page 2

page 4

page 6

page 7

page 8

research
11/12/2015

Basic Level Categorization Facilitates Visual Object Recognition

Recent advances in deep learning have led to significant progress in the...
research
05/30/2018

Why do deep convolutional networks generalize so poorly to small image transformations?

Deep convolutional network architectures are often assumed to guarantee ...
research
05/27/2022

Prune and distill: similar reformatting of image information along rat visual cortex and deep neural networks

Visual object recognition has been extensively studied in both neuroscie...
research
12/12/2022

A novel feature-scrambling approach reveals the capacity of convolutional neural networks to learn spatial relations

Convolutional neural networks (CNNs) are one of the most successful comp...
research
01/31/2022

Rigidity Preserving Image Transformations and Equivariance in Perspective

We characterize the class of image plane transformations which realize r...
research
03/25/2020

What Deep CNNs Benefit from Global Covariance Pooling: An Optimization Perspective

Recent works have demonstrated that global covariance pooling (GCP) has ...
research
03/20/2017

On the Limitation of Convolutional Neural Networks in Recognizing Negative Images

Convolutional Neural Networks (CNNs) have achieved state-of-the-art perf...

Please sign up or login with your details

Forgot password? Click here to reset