Autoregressive Omni-Aware Outpainting for Open-Vocabulary 360-Degree Image Generation

09/07/2023
by   Zhuqiang Lu, et al.
0

A 360-degree (omni-directional) image provides an all-encompassing spherical view of a scene. Recently, there has been an increasing interest in synthesising 360-degree images from conventional narrow field of view (NFoV) images captured by digital cameras and smartphones, for providing immersive experiences in various scenarios such as virtual reality. Yet, existing methods typically fall short in synthesizing intricate visual details or ensure the generated images align consistently with user-provided prompts. In this study, autoregressive omni-aware generative network (AOG-Net) is proposed for 360-degree image generation by out-painting an incomplete 360-degree image progressively with NFoV and text guidances joinly or individually. This autoregressive scheme not only allows for deriving finer-grained and text-consistent patterns by dynamically generating and adjusting the process but also offers users greater flexibility to edit their conditions throughout the generation process. A global-local conditioning mechanism is devised to comprehensively formulate the outpainting guidance in each autoregressive step. Text guidances, omni-visual cues, NFoV inputs and omni-geometry are encoded and further formulated with cross-attention based transformers into a global stream and a local stream into a conditioned generative backbone model. As AOG-Net is compatible to leverage large-scale models for the conditional encoder and the generative prior, it enables the generation to use extensive open-vocabulary text guidances. Comprehensive experiments on two commonly used 360-degree image datasets for both indoor and outdoor settings demonstrate the state-of-the-art performance of our proposed method. Our code will be made publicly available.

READ FULL TEXT

page 1

page 3

page 4

page 6

page 7

research
05/12/2023

Better speech synthesis through scaling

In recent years, the field of image generation has been revolutionized b...
research
05/24/2023

LayoutGPT: Compositional Visual Planning and Generation with Large Language Models

Attaining a high degree of user controllability in visual generation oft...
research
05/23/2023

Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach

Recent text-to-image generation models have demonstrated impressive capa...
research
01/13/2020

180-degree Outpainting from a Single Image

Presenting context images to a viewer's peripheral vision is one of the ...
research
01/18/2020

Text-to-Image Generation with Attention Based Recurrent Neural Networks

Conditional image modeling based on textual descriptions is a relatively...
research
01/09/2020

Spherical Image Generation from a Single Normal Field of View Image by Considering Scene Symmetry

Spherical images taken in all directions (360 degrees) allow representin...
research
08/28/2023

360-Degree Panorama Generation from Few Unregistered NFoV Images

360^∘ panoramas are extensively utilized as environmental light sources ...

Please sign up or login with your details

Forgot password? Click here to reset