Perspective (In)consistency of Paint by Text

06/27/2022
by   Hany Farid, et al.
4

Type "a sea otter with a pearl earring by Johannes Vermeer" or "a photo of a teddy bear on a skateboard in Times Square" into OpenAI's DALL-E-2 paint-by-text synthesis engine and you will not be disappointed by the delightful and eerily pertinent results. The ability to synthesize highly realistic images – with seemingly no limitation other than our imagination – is sure to yield many exciting and creative applications. These images are also likely to pose new challenges to the photo-forensic community. Motivated by the fact that paint by text is not based on explicit geometric modeling, and the human visual system's often obliviousness to even glaring geometric inconsistencies, we provide an initial exploration of the perspective consistency of DALL-E-2 synthesized images to determine if geometric-based forensic analyses will prove fruitful in detecting this new breed of synthetic media.

READ FULL TEXT

page 3

page 5

page 6

page 7

page 8

page 9

page 10

page 11

research
07/27/2022

Lighting (In)consistency of Paint by Text

Whereas generative adversarial networks are capable of synthesizing high...
research
08/20/2022

Vision-Language Matching for Text-to-Image Synthesis via Generative Adversarial Networks

Text-to-image synthesis aims to generate a photo-realistic and semantic ...
research
09/19/2023

SideGAN: 3D-Aware Generative Model for Improved Side-View Image Synthesis

While recent 3D-aware generative models have shown photo-realistic image...
research
04/26/2023

Ray Conditioning: Trading Photo-consistency for Photo-realism in Multi-view Image Generation

Multi-view image generation attracts particular attention these days due...
research
08/16/2020

Learning Flow-based Feature Warping for Face Frontalization with Illumination Inconsistent Supervision

Despite recent advances in deep learning-based face frontalization metho...
research
04/02/2019

Semantics Disentangling for Text-to-Image Generation

Synthesizing photo-realistic images from text descriptions is a challeng...
research
10/26/2020

Geometrically Matched Multi-source Microscopic Image Synthesis Using Bidirectional Adversarial Networks

Microscopic images from different modality can provide more complete exp...

Please sign up or login with your details

Forgot password? Click here to reset