Towards Real-Time Text2Video via CLIP-Guided, Pixel-Level Optimization

10/23/2022
by   Peter Schaldenbrand, et al.
0

We introduce an approach to generating videos based on a series of given language descriptions. Frames of the video are generated sequentially and optimized by guidance from the CLIP image-text encoder; iterating through language descriptions, weighting the current description higher than others. As opposed to optimizing through an image generator model itself, which tends to be computationally heavy, the proposed approach computes the CLIP loss directly at the pixel level, achieving general content at a speed suitable for near real-time systems. The approach can generate videos in up to 720p resolution, variable frame-rates, and arbitrary aspect ratios at a rate of 1-2 frames per second. Please visit our website to view videos and access our open-source code: https://pschaldenbrand.github.io/text2video/ .

READ FULL TEXT

page 1

page 2

page 4

research
12/29/2021

StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2

Videos show continuous events, yet most - if not all - video synthesis f...
research
10/16/2020

Vid-ODE: Continuous-Time Video Generation with Neural Ordinary Differential Equation

Video generation models often operate under the assumption of fixed fram...
research
06/25/2018

EAST Real-Time VOD System Based on MDSplus

As with EAST (Experimental Advanced Superconducting Tokamak) experimenta...
research
08/23/2022

Wavelet-Based Fast Decoding of 360-Degree Videos

In this paper, we propose a wavelet-based video codec specifically desig...
research
10/19/2021

Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation

It is hard to generate an image at target view well for previous cross-v...
research
06/04/2021

Hierarchical Video Generation for Complex Data

Videos can often be created by first outlining a global description of t...
research
09/14/2023

Judging a video by its bitstream cover

Classifying videos into distinct categories, such as Sport and Music Vid...

Please sign up or login with your details

Forgot password? Click here to reset