Increasing diversity of omni-directional images generated from single image using cGAN based on MLPMixer

09/15/2023
by   Atsuya Nakata, et al.
0

This paper proposes a novel approach to generating omni-directional images from a single snapshot picture. The previous method has relied on the generative adversarial networks based on convolutional neural networks (CNN). Although this method has successfully generated omni-directional images, CNN has two drawbacks for this task. First, since a convolutional layer only processes a local area, it is difficult to propagate the information of an input snapshot picture embedded in the center of the omni-directional image to the edges of the image. Thus, the omni-directional images created by the CNN-based generator tend to have less diversity at the edges of the generated images, creating similar scene images. Second, the CNN-based model requires large video memory in graphics processing units due to the nature of the deep structure in CNN since shallow-layer networks only receives signals from a limited range of the receptive field. To solve these problems, MLPMixer-based method was proposed in this paper. The MLPMixer has been proposed as an alternative to the self-attention in the transformer, which captures long-range dependencies and contextual information. This enables to propagate information efficiently in the omni-directional image generation task. As a result, competitive performance has been achieved with reduced memory consumption and computational cost, in addition to increasing diversity of the generated omni-directional images.

READ FULL TEXT
research
02/09/2021

Diverse Single Image Generation with Controllable Global Structure through Self-Attention

Image generation from a single image using generative adversarial networ...
research
09/03/2020

SCG-Net: Self-Constructing Graph Neural Networks for Semantic Segmentation

Capturing global contextual representations by exploiting long-range pix...
research
04/03/2018

Bi-Directional Block Self-Attention for Fast and Memory-Efficient Sequence Modeling

Recurrent neural networks (RNN), convolutional neural networks (CNN) and...
research
07/06/2022

Is the U-Net Directional-Relationship Aware?

CNNs are often assumed to be capable of using contextual information abo...
research
11/20/2018

SpherePHD: Applying CNNs on a Spherical PolyHeDron Representation of 360 degree Images

Omni-directional cameras have many advantages over conventional cameras ...
research
06/30/2020

Rethinking CNN-Based Pansharpening: Guided Colorization of Panchromatic Images via GANs

Convolutional Neural Networks (CNN)-based approaches have shown promisin...
research
10/12/2020

Omni-Directional Image Generation from Single Snapshot Image

An omni-directional image (ODI) is the image that has a field of view co...

Please sign up or login with your details

Forgot password? Click here to reset