Cinematic Mindscapes: High-quality Video Reconstruction from Brain Activity

05/19/2023
by   Zijiao Chen, et al.
0

Reconstructing human vision from brain activities has been an appealing task that helps to understand our cognitive process. Even though recent research has seen great success in reconstructing static images from non-invasive brain recordings, work on recovering continuous visual experiences in the form of videos is limited. In this work, we propose Mind-Video that learns spatiotemporal information from continuous fMRI data of the cerebral cortex progressively through masked brain modeling, multimodal contrastive learning with spatiotemporal attention, and co-training with an augmented Stable Diffusion model that incorporates network temporal inflation. We show that high-quality videos of arbitrary frame rates can be reconstructed with Mind-Video using adversarial guidance. The recovered videos were evaluated with various semantic and pixel-level metrics. We achieved an average accuracy of 85 (SSIM), outperforming the previous state-of-the-art by 45 our model is biologically plausible and interpretable, reflecting established physiological processes.

READ FULL TEXT

page 2

page 3

page 4

page 7

page 8

page 9

page 14

page 15

research
11/13/2022

Seeing Beyond the Brain: Conditional Diffusion Model with Sparse Masked Modeling for Vision Decoding

Decoding visual stimuli from brain recordings aims to deepen our underst...
research
06/07/2022

A Penny for Your (visual) Thoughts: Self-Supervised Reconstruction of Natural Movies from Brain Activity

Reconstructing natural videos from fMRI brain recordings is very challen...
research
05/17/2023

Controllable Mind Visual Diffusion Model

Brain signal visualization has emerged as an active research area, servi...
research
04/30/2023

Reconstructing seen images from human brain activity via guided stochastic search

Visual reconstruction algorithms are an interpretive tool that map brain...
research
05/29/2023

Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors

We present MindEye, a novel fMRI-to-image approach to retrieve and recon...
research
06/20/2023

Sound reconstruction from human brain activity via a generative model with brain-like auditory features

The successful reconstruction of perceptual experiences from human brain...
research
03/25/2023

Selective Structured State-Spaces for Long-Form Video Understanding

Effective modeling of complex spatiotemporal dependencies in long-form v...

Please sign up or login with your details

Forgot password? Click here to reset