Jurassic World Remake: Bringing Ancient Fossils Back to Life via Zero-Shot Long Image-to-Image Translation

08/14/2023
by   Alexander Martin, et al.
0

With a strong understanding of the target domain from natural language, we produce promising results in translating across large domain gaps and bringing skeletons back to life. In this work, we use text-guided latent diffusion models for zero-shot image-to-image translation (I2I) across large domain gaps (longI2I), where large amounts of new visual features and new geometry need to be generated to enter the target domain. Being able to perform translations across large domain gaps has a wide variety of real-world applications in criminology, astrology, environmental conservation, and paleontology. In this work, we introduce a new task Skull2Animal for translating between skulls and living animals. On this task, we find that unguided Generative Adversarial Networks (GANs) are not capable of translating across large domain gaps. Instead of these traditional I2I methods, we explore the use of guided diffusion and image editing models and provide a new benchmark model, Revive-2I, capable of performing zero-shot I2I via text-prompting latent diffusion models. We find that guidance is necessary for longI2I because, to bridge the large domain gap, prior knowledge about the target domain is needed. In addition, we find that prompting provides the best and most scalable information about the target domain as classifier-guided diffusion models require retraining for specific use cases and lack stronger constraints on the target domain because of the wide variety of images they are trained on.

READ FULL TEXT

page 1

page 3

page 5

page 6

page 9

research
05/01/2018

Conditional Image-to-Image Translation

Image-to-image translation tasks have been widely investigated with Gene...
research
04/05/2023

Zero-shot Medical Image Translation via Frequency-Guided Diffusion Models

Recently, the diffusion model has emerged as a superior generative model...
research
01/31/2023

Zero-shot-Learning Cross-Modality Data Translation Through Mutual Information Guided Stochastic Diffusion

Cross-modality data translation has attracted great interest in image co...
research
05/08/2023

ReGeneration Learning of Diffusion Models with Rich Prompts for Zero-Shot Image Translation

Large-scale text-to-image models have demonstrated amazing ability to sy...
research
02/08/2023

Zero-shot Generation of Coherent Storybook from Plain Text Story using Diffusion Models

Recent advancements in large scale text-to-image models have opened new ...
research
12/03/2021

Image-to-image Translation as a Unique Source of Knowledge

Image-to-image (I2I) translation is an established way of translating da...
research
04/14/2023

Delta Denoising Score

We introduce Delta Denoising Score (DDS), a novel scoring function for t...

Please sign up or login with your details

Forgot password? Click here to reset