Controllable Attention for Structured Layered Video Decomposition

10/24/2019
by   Jean-Baptiste Alayrac, et al.
24

The objective of this paper is to be able to separate a video into its natural layers, and to control which of the separated layers to attend to. For example, to be able to separate reflections, transparency or object motion. We make the following three contributions: (i) we introduce a new structured neural network architecture that explicitly incorporates layers (as spatial masks) into its design. This improves separation performance over previous general purpose networks for this task; (ii) we demonstrate that we can augment the architecture to leverage external cues such as audio for controllability and to help disambiguation; and (iii) we experimentally demonstrate the effectiveness of our approach and training procedure with controlled experiments while also showing that the proposed model can be successfully applied to real-word applications such as reflection removal and action recognition in cluttered scenes.

READ FULL TEXT

page 1

page 5

page 6

page 7

page 8

page 12

page 14

research
09/07/2020

User-assisted Video Reflection Removal

Reflections in videos are obstructions that often occur when videos are ...
research
11/24/2021

Layered Controllable Video Generation

We introduce layered controllable video generation, where we, without an...
research
04/10/2018

Cortex Neural Network: learning with Neural Network groups

Neural Network has been successfully applied to many real-world problems...
research
12/02/2018

"Double-DIP": Unsupervised Image Decomposition via Coupled Deep-Image-Priors

Many seemingly unrelated computer vision tasks can be viewed as a specia...
research
07/06/2016

VideoLSTM Convolves, Attends and Flows for Action Recognition

We present a new architecture for end-to-end sequence learning of action...
research
11/22/2020

Learnable Sampling 3D Convolution for Video Enhancement and Action Recognition

A key challenge in video enhancement and action recognition is to fuse u...
research
04/29/2021

Unsupervised Layered Image Decomposition into Object Prototypes

We present an unsupervised learning framework for decomposing images int...

Please sign up or login with your details

Forgot password? Click here to reset