Capturing the objects of vision with neural networks

09/07/2021
by   Benjamin Peters, et al.
0

Human visual perception carves a scene at its physical joints, decomposing the world into objects, which are selectively attended, tracked, and predicted as we engage our surroundings. Object representations emancipate perception from the sensory input, enabling us to keep in mind that which is out of sight and to use perceptual content as a basis for action and symbolic cognition. Human behavioral studies have documented how object representations emerge through grouping, amodal completion, proto-objects, and object files. Deep neural network (DNN) models of visual object recognition, by contrast, remain largely tethered to the sensory input, despite achieving human-level performance at labeling objects. Here, we review related work in both fields and examine how these fields can help each other. The cognitive literature provides a starting point for the development of new experimental tasks that reveal mechanisms of human object perception and serve as benchmarks driving development of deep neural network models that will put the object into object recognition.

READ FULL TEXT

page 2

page 3

page 6

page 8

page 10

page 11

page 12

page 15

research
01/12/2016

Deep Neural Networks predict Hierarchical Spatio-temporal Cortical Dynamics of Human Visual Object Recognition

The complex multi-stage architecture of cortical visual pathways provide...
research
03/04/2019

Optimizing Object-based Perception and Control by Free-Energy Principle

One of the well-known formulations of human perception is a hierarchical...
research
01/09/2023

3D Shape Perception Integrates Intuitive Physics and Analysis-by-Synthesis

Many surface cues support three-dimensional shape perception, but people...
research
10/06/2021

Learning a Metacognition for Object Detection

In contrast to object recognition models, humans do not blindly trust th...
research
07/21/2019

ImageNet-trained deep neural network exhibits illusion-like response to the Scintillating Grid

Deep neural network (DNN) models for computer vision are now capable of ...
research
06/14/2021

A Novel mapping for visual to auditory sensory substitution

visual information can be converted into audio stream via sensory substi...
research
02/17/2021

Grid Cell Path Integration For Movement-Based Visual Object Recognition

Grid cells enable the brain to model the physical space of the world and...

Please sign up or login with your details

Forgot password? Click here to reset