Spatial-Temporal Transformer for Video Snapshot Compressive Imaging

09/04/2022
by   Lishun Wang, et al.
25

Video snapshot compressive imaging (SCI) captures multiple sequential video frames by a single measurement using the idea of computational imaging. The underlying principle is to modulate high-speed frames through different masks and these modulated frames are summed to a single measurement captured by a low-speed 2D sensor (dubbed optical encoder); following this, algorithms are employed to reconstruct the desired high-speed frames (dubbed software decoder) if needed. In this paper, we consider the reconstruction algorithm in video SCI, i.e., recovering a series of video frames from a compressed measurement. Specifically, we propose a Spatial-Temporal transFormer (STFormer) to exploit the correlation in both spatial and temporal domains. STFormer network is composed of a token generation block, a video reconstruction block, and these two blocks are connected by a series of STFormer blocks. Each STFormer block consists of a spatial self-attention branch, a temporal self-attention branch and the outputs of these two branches are integrated by a fusion network. Extensive results on both simulated and real data demonstrate the state-of-the-art performance of STFormer. The code and models are publicly available at https://github.com/ucaswangls/STFormer.git

READ FULL TEXT

page 4

page 7

page 9

page 10

page 12

page 13

page 14

page 15

research
05/17/2023

EfficientSCI: Densely Connected Network with Space-time Factorization for Large-scale Video Snapshot Compressive Imaging

Video snapshot compressive imaging (SCI) uses a two-dimensional detector...
research
06/20/2023

Unfolding Framework with Prior of Convolution-Transformer Mixture and Uncertainty Estimation for Video Snapshot Compressive Imaging

We consider the problem of video snapshot compressive imaging (SCI), whe...
research
03/01/2022

Motion-aware Dynamic Graph Neural Network for Video Compressive Sensing

Video snapshot compressive imaging (SCI) utilizes a 2D detector to captu...
research
09/11/2021

Dual-view Snapshot Compressive Imaging via Optical Flow Aided Recurrent Neural Network

Dual-view snapshot compressive imaging (SCI) aims to capture videos from...
research
04/01/2021

Distributed Video Adaptive Block Compressive Sensing

Video block compressive sensing has been studied for use in resource con...
research
06/13/2018

Convolutional sparse coding for capturing high speed video content

Video capture is limited by the trade-off between spatial and temporal r...
research
04/30/2021

BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification

In this paper, we present an efficient spatial-temporal representation f...

Please sign up or login with your details

Forgot password? Click here to reset