UMFuse: Unified Multi View Fusion for Human Editing applications

11/17/2022
by   Rishabh Jain, et al.
0

The vision community has explored numerous pose guided human editing methods due to their extensive practical applications. Most of these methods still use an image-to-image formulation in which a single image is given as input to produce an edited image as output. However, the problem is ill-defined in cases when the target pose is significantly different from the input pose. Existing methods then resort to in-painting or style transfer to handle occlusions and preserve content. In this paper, we explore the utilization of multiple views to minimize the issue of missing information and generate an accurate representation of the underlying human model. To fuse the knowledge from multiple viewpoints, we design a selector network that takes the pose keypoints and texture from images and generates an interpretable per-pixel selection map. After that, the encodings from a separate network (trained on a single image human reposing task) are merged in the latent space. This enables us to generate accurate, precise, and visually coherent images for different editing tasks. We show the application of our network on 2 newly proposed tasks - Multi-view human reposing, and Mix-and-match human image generation. Additionally, we study the limitations of single-view editing and scenarios in which multi-view provides a much better alternative.

READ FULL TEXT

page 1

page 3

page 5

page 6

page 7

page 9

page 10

research
03/25/2022

3D GAN Inversion for Controllable Portrait Image Animation

Millions of images of human faces are captured every single day; but the...
research
03/14/2022

Texture Generation Using Dual-Domain Feature Flow with Multi-View Hallucinations

We propose a dual-domain generative model to estimate a texture map from...
research
04/12/2021

Multi-View Image-to-Image Translation Supervised by 3D Pose

We address the task of multi-view image-to-image translation for person ...
research
04/26/2023

Ray Conditioning: Trading Photo-consistency for Photo-realism in Multi-view Image Generation

Multi-view image generation attracts particular attention these days due...
research
08/16/2022

Style Your Hair: Latent Optimization for Pose-Invariant Hairstyle Transfer via Local-Style-Aware Hair Alignment

Editing hairstyle is unique and challenging due to the complexity and de...
research
04/11/2018

View Extrapolation of Human Body from a Single Image

We study how to synthesize novel views of human body from a single image...
research
04/05/2023

StyleGAN Salon: Multi-View Latent Optimization for Pose-Invariant Hairstyle Transfer

Our paper seeks to transfer the hairstyle of a reference image to an inp...

Please sign up or login with your details

Forgot password? Click here to reset