Learning Physical-Spatio-Temporal Features for Video Shadow Removal

03/16/2023
by   Zhihao Chen, et al.
9

Shadow removal in a single image has received increasing attention in recent years. However, removing shadows over dynamic scenes remains largely under-explored. In this paper, we propose the first data-driven video shadow removal model, termed PSTNet, by exploiting three essential characteristics of video shadows, i.e., physical property, spatio relation, and temporal coherence. Specifically, a dedicated physical branch was established to conduct local illumination estimation, which is more applicable for scenes with complex lighting and textures, and then enhance the physical features via a mask-guided attention strategy. Then, we develop a progressive aggregation module to enhance the spatio and temporal characteristics of features maps, and effectively integrate the three kinds of features. Furthermore, to tackle the lack of datasets of paired shadow videos, we synthesize a dataset (SVSRD-85) with aid of the popular game GTAV by controlling the switch of the shadow renderer. Experiments against 9 state-of-the-art models, including image shadow removers and image/video restoration methods, show that our method improves the best SOTA in terms of RMSE error for the shadow area by 14.7. In addition, we develop a lightweight model adaptation strategy to make our synthetic-driven model effective in real world scenes. The visual comparison on the public SBU-TimeLapse dataset verifies the generalization ability of our model in real scenes.

READ FULL TEXT

page 1

page 3

page 4

page 7

page 8

page 9

page 10

page 12

research
06/24/2023

Real-World Video for Zoom Enhancement based on Spatio-Temporal Coupling

In recent years, single-frame image super-resolution (SR) has become mor...
research
03/11/2021

Triple-cooperative Video Shadow Detection

Shadow detection in a single image has received significant research int...
research
09/18/2019

A Survey on Rain Removal from Video and Single Image

Rain streaks might severely degenerate the performance of video/image pr...
research
05/19/2023

PastNet: Introducing Physical Inductive Biases for Spatio-temporal Video Prediction

In this paper, we investigate the challenge of spatio-temporal video pre...
research
10/16/2020

Toward Accurate Person-level Action Recognition in Videos of Crowded Scenes

Detecting and recognizing human action in videos with crowded scenes is ...
research
06/30/2021

Efficient Spatio-Temporal Recurrent Neural Network for Video Deblurring

Real-time video deblurring still remains a challenging task due to the c...
research
09/29/2022

Mask-Guided Image Person Removal with Data Synthesis

As a special case of common object removal, image person removal is play...

Please sign up or login with your details

Forgot password? Click here to reset