Discovering the Hidden Vocabulary of DALLE-2

06/01/2022
by   Giannis Daras, et al.
11

We discover that DALLE-2 seems to have a hidden vocabulary that can be used to generate images with absurd prompts. For example, it seems that means birds and (sometimes) means bugs or pests. We find that these prompts are often consistent in isolation but also sometimes in combinations. We present our black-box method to discover words that seem random but have some correspondence to visual concepts. This creates important security and interpretability challenges.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset