Make-It-4D: Synthesizing a Consistent Long-Term Dynamic Scene Video from a Single Image

08/20/2023
by   Liao Shen, et al.
0

We study the problem of synthesizing a long-term dynamic video from only a single image. This is challenging since it requires consistent visual content movements given large camera motions. Existing methods either hallucinate inconsistent perpetual views or struggle with long camera trajectories. To address these issues, it is essential to estimate the underlying 4D (including 3D geometry and scene motion) and fill in the occluded regions. To this end, we present Make-It-4D, a novel method that can generate a consistent long-term dynamic video from a single image. On the one hand, we utilize layered depth images (LDIs) to represent a scene, and they are then unprojected to form a feature point cloud. To animate the visual content, the feature point cloud is displaced based on the scene flow derived from motion estimation and the corresponding camera pose. Such 4D representation enables our method to maintain the global consistency of the generated dynamic video. On the other hand, we fill in the occluded regions by using a pretrained diffusion model to inpaint and outpaint the input image. This enables our method to work under large camera motions. Benefiting from our design, our method can be training-free which saves a significant amount of training time. Experimental results demonstrate the effectiveness of our approach, which showcases compelling rendering results.

READ FULL TEXT

page 2

page 3

page 5

page 6

page 7

page 8

research
03/10/2023

3D Cinemagraphy from a Single Image

We present 3D Cinemagraphy, a new technique that marries 2D image animat...
research
03/30/2023

Consistent View Synthesis with Pose-Guided Diffusion Models

Novel view synthesis from a single image has been a cornerstone problem ...
research
09/12/2019

3D Ken Burns Effect from a Single Image

The Ken Burns effect allows animating still images with a virtual camera...
research
12/11/2020

Street-view Panoramic Video Synthesis from a Single Satellite Image

We present a novel method for synthesizing both temporally and geometric...
research
09/14/2023

Generative Image Dynamics

We present an approach to modeling an image-space prior on scene dynamic...
research
05/12/2022

3D Moments from Near-Duplicate Photos

We introduce 3D Moments, a new computational photography effect. As inpu...
research
04/21/2023

Long-Term Photometric Consistent Novel View Synthesis with Diffusion Models

Novel view synthesis from a single input image is a challenging task, wh...

Please sign up or login with your details

Forgot password? Click here to reset