UPGPT: Universal Diffusion Model for Person Image Generation, Editing and Pose Transfer

04/18/2023
by   Soon Yau Cheong, et al.
5

Existing person image generative models can do either image generation or pose transfer but not both. We propose a unified diffusion model, UPGPT to provide a universal solution to perform all the person image tasks - generative, pose transfer, and editing. With fine-grained multimodality and disentanglement capabilities, our approach offers fine-grained control over the generation and the editing process of images using a combination of pose, text, and image, all without needing a semantic segmentation mask which can be challenging to obtain or edit. We also pioneer the parameterized body SMPL model in pose-guided person image generation to demonstrate new capability - simultaneous pose and camera view interpolation while maintaining a person's appearance. Results on the benchmark DeepFashion dataset show that UPGPT is the new state-of-the-art while simultaneously pioneering new capabilities of edit and pose transfer in human image generation.

READ FULL TEXT

page 1

page 6

page 7

page 11

page 12

page 13

page 14

research
11/11/2022

HumanDiffusion: a Coarse-to-Fine Alignment Diffusion Framework for Controllable Text-Driven Person Image Generation

Text-driven person image generation is an emerging and challenging task ...
research
04/14/2021

Dressing in Order: Recurrent Person Image Generation for Pose Transfer, Virtual Try-on and Outfit Editing

This paper proposes a flexible person generation framework called Dressi...
research
03/09/2022

Pose Guided Multi-person Image Generation From Text

Transformers have recently been shown to generate high quality images fr...
research
03/30/2023

PAIR-Diffusion: Object-Level Image Editing with Structure-and-Appearance Paired Diffusion Models

Image editing using diffusion models has witnessed extremely fast-paced ...
research
07/24/2022

TIPS: Text-Induced Pose Synthesis

In computer vision, human pose synthesis and transfer deal with probabil...
research
04/10/2019

Text Guided Person Image Synthesis

This paper presents a novel method to manipulate the visual appearance (...
research
01/31/2022

Third Time's the Charm? Image and Video Editing with StyleGAN3

StyleGAN is arguably one of the most intriguing and well-studied generat...

Please sign up or login with your details

Forgot password? Click here to reset