Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts

05/15/2023
by   Yuyang Zhao, et al.
0

The text-driven image and video diffusion models have achieved unprecedented success in generating realistic and diverse content. Recently, the editing and variation of existing images and videos in diffusion-based generative models have garnered significant attention. However, previous works are limited to editing content with text or providing coarse personalization using a single visual clue, rendering them unsuitable for indescribable content that requires fine-grained and detailed control. In this regard, we propose a generic video editing framework called Make-A-Protagonist, which utilizes textual and visual clues to edit videos with the goal of empowering individuals to become the protagonists. Specifically, we leverage multiple experts to parse source video, target visual and textual clues, and propose a visual-textual-based video generation model that employs mask-guided denoising sampling to generate the desired output. Extensive results demonstrate the versatile and remarkable editing capabilities of Make-A-Protagonist.

READ FULL TEXT

page 1

page 3

page 8

page 9

page 11

page 12

page 13

research
02/06/2023

Structure and Content-Guided Video Synthesis with Diffusion Models

Text-guided generative diffusion models unlock powerful image creation a...
research
02/19/2021

Clarification of Video Retrieval Query Results by the Automated Insertion of Supporting Shots

Computational Video Editing Systems output video generally follows a par...
research
03/19/2023

SKED: Sketch-guided Text-based 3D Editing

Text-to-image diffusion models are gradually introduced into computer gr...
research
05/15/2023

Edit As You Wish: Video Description Editing with Multi-grained Commands

Automatically narrating a video with natural language can assist people ...
research
08/09/2021

Learning to Cut by Watching Movies

Video content creation keeps growing at an incredible pace; yet, creatin...
research
11/21/2020

Iterative Text-based Editing of Talking-heads Using Neural Retargeting

We present a text-based tool for editing talking-head video that enables...
research
02/02/2023

Dreamix: Video Diffusion Models are General Video Editors

Text-driven image and video diffusion models have recently achieved unpr...

Please sign up or login with your details

Forgot password? Click here to reset