Pixels to Voxels: Modeling Visual Representation in the Human Brain

07/18/2014
by   Pulkit Agrawal, et al.
0

The human brain is adept at solving difficult high-level visual processing problems such as image interpretation and object recognition in natural scenes. Over the past few years neuroscientists have made remarkable progress in understanding how the human brain represents categories of objects and actions in natural scenes. However, all current models of high-level human vision operate on hand annotated images in which the objects and actions have been assigned semantic tags by a human operator. No current models can account for high-level visual function directly in terms of low-level visual input (i.e., pixels). To overcome this fundamental limitation we sought to develop a new class of models that can predict human brain activity directly from low-level visual input (i.e., pixels). We explored two classes of models drawn from computer vision and machine learning. The first class of models was based on Fisher Vectors (FV) and the second was based on Convolutional Neural Networks (ConvNets). We find that both classes of models accurately predict brain activity in high-level visual areas, directly from pixels and without the need for any semantic tags or hand annotation of images. This is the first time that such a mapping has been obtained. The fit models provide a new platform for exploring the functional principles of human vision, and they show that modern methods of computer vision and machine learning provide important tools for characterizing brain function.

READ FULL TEXT

page 6

page 7

page 8

page 14

page 15

research
04/30/2023

Reconstructing seen images from human brain activity via guided stochastic search

Visual reconstruction algorithms are an interpretive tool that map brain...
research
09/21/2018

Unsupervised Image to Sequence Translation with Canvas-Drawer Networks

Encoding images as a series of high-level constructs, such as brush stro...
research
06/01/2023

Second Sight: Using brain-optimized encoding models to align image distributions with human brain activity

Two recent developments have accelerated progress in image reconstructio...
research
09/23/2022

Semantic scene descriptions as an objective of human vision

Interpreting the meaning of a visual scene requires not only identificat...
research
04/01/2021

Memorability: An image-computable measure of information utility

The pixels in an image, and the objects, scenes, and actions that they c...
research
03/16/2017

Using Human Brain Activity to Guide Machine Learning

Machine learning is a field of computer science that builds algorithms t...
research
04/29/2015

Anticipating Visual Representations from Unlabeled Video

Anticipating actions and objects before they start or appear is a diffic...

Please sign up or login with your details

Forgot password? Click here to reset