ORES: Open-vocabulary Responsible Visual Synthesis

08/26/2023
by   Minheng Ni, et al.
0

Avoiding synthesizing specific visual concepts is an essential challenge in responsible visual synthesis. However, the visual concept that needs to be avoided for responsible visual synthesis tends to be diverse, depending on the region, context, and usage scenarios. In this work, we formalize a new task, Open-vocabulary Responsible Visual Synthesis (ORES), where the synthesis model is able to avoid forbidden visual concepts while allowing users to input any desired content. To address this problem, we present a Two-stage Intervention (TIN) framework. By introducing 1) rewriting with learnable instruction through a large-scale language model (LLM) and 2) synthesizing with prompt intervention on a diffusion synthesis model, it can effectively synthesize images avoiding any concepts but following the user's query as much as possible. To evaluate on ORES, we provide a publicly available dataset, baseline models, and benchmark. Experimental results demonstrate the effectiveness of our method in reducing risks of image generation. Our work highlights the potential of LLMs in responsible visual synthesis. Our code and dataset is public available.

READ FULL TEXT

page 2

page 4

page 5

page 7

page 10

page 11

research
06/17/2020

Synthesizing Tasks for Block-based Programming

Block-based visual programming environments play a critical role in intr...
research
06/01/2023

Intelligent Grimm – Open-ended Visual Storytelling via Latent Diffusion Models

Generative models have recently exhibited exceptional capabilities in va...
research
10/17/2022

Visual onoma-to-wave: environmental sound synthesis from visual onomatopoeias and sound-source images

We propose a method for synthesizing environmental sounds from visually ...
research
06/27/2017

Auto-Encoder Guided GAN for Chinese Calligraphy Synthesis

In this paper, we investigate the Chinese calligraphy synthesis problem:...
research
05/26/2023

Neural Task Synthesis for Visual Programming

Generative neural models hold great promise in enhancing programming edu...
research
05/27/2023

Synthesizing a Progression of Subtasks for Block-Based Visual Programming Tasks

Block-based visual programming environments play an increasingly importa...

Please sign up or login with your details

Forgot password? Click here to reset