Second Sight: Using brain-optimized encoding models to align image distributions with human brain activity

by   Reese Kneeland, et al.

Two recent developments have accelerated progress in image reconstruction from human brain activity: large datasets that offer samples of brain activity in response to many thousands of natural scenes, and the open-sourcing of powerful stochastic image-generators that accept both low- and high-level guidance. Most work in this space has focused on obtaining point estimates of the target image, with the ultimate goal of approximating literal pixel-wise reconstructions of target images from the brain activity patterns they evoke. This emphasis belies the fact that there is always a family of images that are equally compatible with any evoked brain activity pattern, and the fact that many image-generators are inherently stochastic and do not by themselves offer a method for selecting the single best reconstruction from among the samples they generate. We introduce a novel reconstruction procedure (Second Sight) that iteratively refines an image distribution to explicitly maximize the alignment between the predictions of a voxel-wise encoding model and the brain activity patterns evoked by any target image. We show that our process converges on a distribution of high-quality reconstructions by refining both semantic content and low-level image details across iterations. Images sampled from these converged image distributions are competitive with state-of-the-art reconstruction algorithms. Interestingly, the time-to-convergence varies systematically across visual cortex, with earlier visual areas generally taking longer and converging on narrower image distributions, relative to higher-level brain areas. Second Sight thus offers a succinct and novel method for exploring the diversity of representations across visual brain areas.


page 3

page 5

page 6

page 8

page 14

page 15

page 17

page 18


Reconstructing seen images from human brain activity via guided stochastic search

Visual reconstruction algorithms are an interpretive tool that map brain...

Pixels to Voxels: Modeling Visual Representation in the Human Brain

The human brain is adept at solving difficult high-level visual processi...

Improving visual image reconstruction from human brain activity using latent diffusion models via multiple decoded inputs

The integration of deep learning and neuroscience has been advancing rap...

Optimizing deep video representation to match brain activity

The comparison of observed brain activity with the statistics generated ...

MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion

Reconstructing visual stimuli from measured functional magnetic resonanc...

The Brain-Inspired Decoder for Natural Visual Image Reconstruction

Decoding images from brain activity has been a challenge. Owing to the d...

Modeling correlations in spontaneous activity of visual cortex with centered Gaussian-binary deep Boltzmann machines

Spontaneous cortical activity -- the ongoing cortical activities in abse...

Please sign up or login with your details

Forgot password? Click here to reset