Labeling neural network submodules with human-legible descriptions is us...
Text-to-image models suffer from various safety issues that may limit th...
Motivated by recent advancements in text-to-image diffusion, we study er...
The CLIP network measures the similarity between natural text and images...
Human action is naturally compositional: humans can easily recognize and...
Neural networks trained on datasets such as ImageNet have led to major
a...