Gaudí: Conversational Interactions with Deep Representations to Generate Image Collections

12/05/2021
by   Victor S. Bursztyn, et al.
0

Based on recent advances in realistic language modeling (GPT-3) and cross-modal representations (CLIP), Gaudí was developed to help designers search for inspirational images using natural language. In the early stages of the design process, with the goal of eliciting a client's preferred creative direction, designers will typically create thematic collections of inspirational images called "mood-boards". Creating a mood-board involves sequential image searches which are currently performed using keywords or images. Gaudí transforms this process into a conversation where the user is gradually detailing the mood-board's theme. This representation allows our AI to generate new search queries from scratch, straight from a project briefing, following a theme hypothesized by GPT-3. Compared to previous computational approaches to mood-board creation, to the best of our knowledge, ours is the first attempt to represent mood-boards as the stories that designers tell when presenting a creative direction to a client.

READ FULL TEXT

page 1

page 2

research
07/26/2023

Neural-based Cross-modal Search and Retrieval of Artwork

Creating an intelligent search and retrieval system for artwork images, ...
research
03/05/2023

Composing Mood Board with User Feedback in Concept Space

We propose the Mood Board Composer (MBC), which supports concept designe...
research
03/07/2020

PathVQA: 30000+ Questions for Medical Visual Question Answering

Is it possible to develop an "AI Pathologist" to pass the board-certifie...
research
08/24/2021

A QuadTree Image Representation for Computational Pathology

The field of computational pathology presents many challenges for comput...
research
07/18/2023

PromptMagician: Interactive Prompt Engineering for Text-to-Image Creation

Generative text-to-image models have gained great popularity among the p...
research
08/12/2016

DeepDiary: Automatic Caption Generation for Lifelogging Image Streams

Lifelogging cameras capture everyday life from a first-person perspectiv...
research
06/23/2021

CharacterChat: Supporting the Creation of Fictional Characters through Conversation and Progressive Manifestation with a Chatbot

We present CharacterChat, a concept and chatbot to support writers in cr...

Please sign up or login with your details

Forgot password? Click here to reset