Discovering the Hidden Vocabulary of DALLE-2

06/01/2022

∙

We discover that DALLE-2 seems to have a hidden vocabulary that can be used to generate images with absurd prompts. For example, it seems that means birds and (sometimes) means bugs or pests. We find that these prompts are often consistent in isolation but also sometimes in combinations. We present our black-box method to discover words that seem random but have some correspondence to visual concepts. This creates important security and interpretability challenges.

READ FULL TEXT

Discovering the Hidden Vocabulary of DALLE-2

Sign in with Google

Consider DeepAI Pro