GBSD: Generative Bokeh with Stage Diffusion

06/14/2023
by   Jieren Deng, et al.
0

The bokeh effect is an artistic technique that blurs out-of-focus areas in a photograph and has gained interest due to recent developments in text-to-image synthesis and the ubiquity of smart-phone cameras and photo-sharing apps. Prior work on rendering bokeh effects have focused on post hoc image manipulation to produce similar blurring effects in existing photographs using classical computer graphics or neural rendering techniques, but have either depth discontinuity artifacts or are restricted to reproducing bokeh effects that are present in the training data. More recent diffusion based models can synthesize images with an artistic style, but either require the generation of high-dimensional masks, expensive fine-tuning, or affect global image characteristics. In this paper, we present GBSD, the first generative text-to-image model that synthesizes photorealistic images with a bokeh style. Motivated by how image synthesis occurs progressively in diffusion models, our approach combines latent diffusion models with a 2-stage conditioning algorithm to render bokeh effects on semantically defined objects. Since we can focus the effect on objects, this semantic bokeh effect is more versatile than classical rendering techniques. We evaluate GBSD both quantitatively and qualitatively and demonstrate its ability to be applied in both text-to-image and image-to-image settings.

READ FULL TEXT

page 2

page 4

page 5

page 6

page 7

page 8

research
10/05/2022

LDEdit: Towards Generalized Text Guided Image Manipulation via Latent Diffusion Models

Research in vision-language models has seen rapid developments off-late,...
research
11/22/2022

Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation

Large-scale text-to-image generative models have been a revolutionary br...
research
11/15/2022

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model

The recent advances in diffusion models have set an impressive milestone...
research
06/01/2023

StyleDrop: Text-to-Image Generation in Any Style

Pre-trained large text-to-image models synthesize impressive images with...
research
06/10/2023

Image Vectorization: a Review

Nowadays, there are many diffusion and autoregressive models that show i...
research
06/06/2022

Blended Latent Diffusion

The tremendous progress in neural image generation, coupled with the eme...
research
11/04/2020

BGGAN: Bokeh-Glass Generative Adversarial Network for Rendering Realistic Bokeh

A photo captured with bokeh effect often means objects in focus are shar...

Please sign up or login with your details

Forgot password? Click here to reset