Semantic scene descriptions as an objective of human vision

09/23/2022
by   Adrien Doerig, et al.
30

Interpreting the meaning of a visual scene requires not only identification of its constituent objects, but also a rich semantic characterization of object interrelations. Here, we study the neural mechanisms underlying visuo-semantic transformations by applying modern computational techniques to a large-scale 7T fMRI dataset of human brain responses elicited by complex natural scenes. Using semantic embeddings obtained by applying linguistic deep learning models to human-generated scene descriptions, we identify a widely distributed network of brain regions that encode semantic scene descriptions. Importantly, these semantic embeddings better explain activity in these regions than traditional object category labels. In addition, they are effective predictors of activity despite the fact that the participants did not actively engage in a semantic task, suggesting that visuo-semantic transformations are a default mode of vision. In support of this view, we then show that highly accurate reconstructions of scene captions can be directly linearly decoded from patterns of brain activity. Finally, a recurrent convolutional neural network trained on semantic embeddings further outperforms semantic embeddings in predicting brain activity, providing a mechanistic model of the brain's visuo-semantic transformations. Together, these experimental and computational results suggest that transforming visual input into rich semantic scene descriptions may be a central objective of the visual system, and that focusing efforts on this new objective may lead to improved models of visual information processing in the human brain.

READ FULL TEXT

page 3

page 5

page 7

page 9

page 21

page 22

page 23

research
09/04/2023

3D View Prediction Models of the Dorsal Visual Stream

Deep neural network representations align well with brain activity in th...
research
07/18/2014

Pixels to Voxels: Modeling Visual Representation in the Human Brain

The human brain is adept at solving difficult high-level visual processi...
research
04/30/2023

Reconstructing seen images from human brain activity via guided stochastic search

Visual reconstruction algorithms are an interpretive tool that map brain...
research
09/26/2019

ExpertoCoder: Capturing Divergent Brain Regions Using Mixture of Regression Experts

fMRI semantic category understanding using linguistic encoding models at...
research
07/20/2023

Brain2Music: Reconstructing Music from Human Brain Activity

The process of reconstructing experiences from human brain activity offe...
research
04/29/2021

Comparing Visual Reasoning in Humans and AI

Recent advances in natural language processing and computer vision have ...
research
03/14/2019

Recurrence required to capture the dynamic computations of the human ventral visual stream

The visual system is an intricate network of brain regions that enables ...

Please sign up or login with your details

Forgot password? Click here to reset