Grid Partitioned Attention: Efficient TransformerApproximation with Inductive Bias for High Resolution Detail Generation

07/08/2021
by   Nikolay Jetchev, et al.
10

Attention is a general reasoning mechanism than can flexibly deal with image information, but its memory requirements had made it so far impractical for high resolution image generation. We present Grid Partitioned Attention (GPA), a new approximate attention algorithm that leverages a sparse inductive bias for higher computational and memory efficiency in image domains: queries attend only to few keys, spatially close queries attend to close keys due to correlations. Our paper introduces the new attention layer, analyzes its complexity and how the trade-off between memory usage and model power can be tuned by the hyper-parameters.We will show how such attention enables novel deep learning architectures with copying modules that are especially useful for conditional image generation tasks like pose morphing. Our contributions are (i) algorithm and code1of the novel GPA layer, (ii) a novel deep attention-copying architecture, and (iii) new state-of-the art experimental results in human pose morphing generation benchmarks.

READ FULL TEXT

page 2

page 4

page 6

page 7

page 8

page 9

page 10

page 13

research
03/28/2022

GIRAFFE HD: A High-Resolution 3D-aware Generative Model

3D-aware generative models have shown that the introduction of 3D inform...
research
11/25/2018

PCGAN: Partition-Controlled Human Image Generation

Human image generation is a very challenging task since it is affected b...
research
12/04/2017

Energy-relaxed Wassertein GANs(EnergyWGAN): Towards More Stable and High Resolution Image Generation

Recently, generative adversarial networks (GANs) have achieved great imp...
research
03/27/2023

Diffusion Models for Memory-efficient Processing of 3D Medical Images

Denoising diffusion models have recently achieved state-of-the-art perfo...
research
07/03/2023

MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion

This paper introduces MVDiffusion, a simple yet effective multi-view ima...
research
07/31/2023

DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation

Denoising diffusion probabilistic models that were initially proposed fo...
research
05/17/2023

Exploring the Space of Key-Value-Query Models with Intention

Attention-based models have been a key element of many recent breakthrou...

Please sign up or login with your details

Forgot password? Click here to reset