Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search

02/02/2021
by   Federico A. Galatolo, et al.
5

In this research work we present CLIP-GLaSS, a novel zero-shot framework to generate an image (or a caption) corresponding to a given caption (or image). CLIP-GLaSS is based on the CLIP neural network, which, given an image and a descriptive caption, provides similar embeddings. Differently, CLIP-GLaSS takes a caption (or an image) as an input, and generates the image (or the caption) whose CLIP embedding is the most similar to the input one. This optimal image (or caption) is produced via a generative network, after an exploration by a genetic algorithm. Promising results are shown, based on the experimentation of the image Generators BigGAN and StyleGAN2, and of the text Generator GPT2

READ FULL TEXT

page 6

page 7

page 8

page 9

research
06/05/2023

ZIGNeRF: Zero-shot 3D Scene Representation with Invertible Generative Neural Radiance Fields

Generative Neural Radiance Fields (NeRFs) have demonstrated remarkable p...
research
07/23/2020

Zero-Shot Recognition through Image-Guided Semantic Classification

We present a new embedding-based framework for zero-shot learning (ZSL)....
research
03/11/2023

ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation

Natural Language Generation (NLG) accepts input data in the form of imag...
research
07/18/2022

Towards Diverse and Faithful One-shot Adaption of Generative Adversarial Networks

One-shot generative domain adaption aims to transfer a pre-trained gener...
research
12/02/2021

FuseDream: Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization

Generating images from natural language instructions is an intriguing ye...
research
07/17/2023

Zero-Shot Image Harmonization with Generative Model Prior

Recent image harmonization methods have demonstrated promising results. ...
research
03/19/2021

Paint by Word

We investigate the problem of zero-shot semantic image painting. Instead...

Please sign up or login with your details

Forgot password? Click here to reset