Learning Modulated Transformation in GANs

08/29/2023
by   Ceyuan Yang, et al.
0

The success of style-based generators largely benefits from style modulation, which helps take care of the cross-instance variation within data. However, the instance-wise stochasticity is typically introduced via regular convolution, where kernels interact with features at some fixed locations, limiting its capacity for modeling geometric variation. To alleviate this problem, we equip the generator in generative adversarial networks (GANs) with a plug-and-play module, termed as modulated transformation module (MTM). This module predicts spatial offsets under the control of latent codes, based on which the convolution operation can be applied at variable locations for different instances, and hence offers the model an additional degree of freedom to handle geometry deformation. Extensive experiments suggest that our approach can be faithfully generalized to various generative tasks, including image generation, 3D-aware image synthesis, and video generation, and get compatible with state-of-the-art frameworks without any hyper-parameter tuning. It is noteworthy that, towards human generation on the challenging TaiChi dataset, we improve the FID of StyleGAN3 from 21.36 to 13.60, demonstrating the efficacy of learning modulated geometry transformation.

READ FULL TEXT

page 6

page 8

page 9

research
05/18/2021

Decorating Your Own Bedroom: Locally Controlling Image Generation with Generative Adversarial Networks

Generative Adversarial Networks (GANs) have made great success in synthe...
research
04/02/2022

StyleWaveGAN: Style-based synthesis of drum sounds with extensive controls using generative adversarial networks

In this paper we introduce StyleWaveGAN, a style-based drum sound genera...
research
03/13/2021

Unsupervised Image Transformation Learning via Generative Adversarial Networks

In this work, we study the image transformation problem by learning the ...
research
01/25/2022

GIU-GANs: Global Information Utilization for Generative Adversarial Networks

In recent years, with the rapid development of artificial intelligence, ...
research
04/01/2021

Improved Image Generation via Sparse Modeling

The interest of the deep learning community in image synthesis has grown...
research
12/14/2022

Towards Smooth Video Composition

Video generation requires synthesizing consistent and persistent frames ...

Please sign up or login with your details

Forgot password? Click here to reset