Xp-GAN: Unsupervised Multi-object Controllable Video Generation

11/19/2021
by   Bahman Rouhani, et al.
9

Video Generation is a relatively new and yet popular subject in machine learning due to its vast variety of potential applications and its numerous challenges. Current methods in Video Generation provide the user with little or no control over the exact specification of how the objects in the generate video are to be moved and located at each frame, that is, the user can't explicitly control how each object in the video should move. In this paper we propose a novel method that allows the user to move any number of objects of a single initial frame just by drawing bounding boxes over those objects and then moving those boxes in the desired path. Our model utilizes two Autoencoders to fully decompose the motion and content information in a video and achieves results comparable to well-known baseline and state of the art methods.

READ FULL TEXT

page 2

page 3

page 5

page 6

page 7

research
06/06/2023

Learn the Force We Can: Multi-Object Video Generation from Pixel-Level Interactions

We propose a novel unsupervised method to autoregressively generate vide...
research
11/24/2021

Layered Controllable Video Generation

We introduce layered controllable video generation, where we, without an...
research
05/06/2023

Multi-object Video Generation from Single Frame Layouts

In this paper, we study video synthesis with emphasis on simplifying the...
research
08/19/2021

Click to Move: Controlling Video Generation with Sparse Motion

This paper introduces Click to Move (C2M), a novel framework for video g...
research
02/19/2023

Accelerated Video Annotation driven by Deep Detector and Tracker

Annotating object ground truth in videos is vital for several downstream...
research
02/02/2017

YouTube-BoundingBoxes: A Large High-Precision Human-Annotated Data Set for Object Detection in Video

We introduce a new large-scale data set of video URLs with densely-sampl...
research
10/16/2015

You-Do, I-Learn: Unsupervised Multi-User egocentric Approach Towards Video-Based Guidance

This paper presents an unsupervised approach towards automatically extra...

Please sign up or login with your details

Forgot password? Click here to reset