StyleAdapter: A Single-Pass LoRA-Free Model for Stylized Image Generation

09/04/2023
by   Zhouxia Wang, et al.
0

This paper presents a LoRA-free method for stylized image generation that takes a text prompt and style reference images as inputs and produces an output image in a single pass. Unlike existing methods that rely on training a separate LoRA for each style, our method can adapt to various styles with a unified model. However, this poses two challenges: 1) the prompt loses controllability over the generated content, and 2) the output image inherits both the semantic and style features of the style reference image, compromising its content fidelity. To address these challenges, we introduce StyleAdapter, a model that comprises two components: a two-path cross-attention module (TPCA) and three decoupling strategies. These components enable our model to process the prompt and style reference features separately and reduce the strong coupling between the semantic and style information in the style references. StyleAdapter can generate high-quality images that match the content of the prompts and adopt the style of the references (even for unseen styles) in a single pass, which is more flexible and efficient than previous methods. Experiments have been conducted to demonstrate the superiority of our method over previous works.

READ FULL TEXT
research
11/14/2022

Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation

Diffusion-based text-to-image generation models like GLIDE and DALLE-2 h...
research
07/27/2023

PromptStyler: Prompt-driven Style Generation for Source-free Domain Generalization

In a joint vision-language space, a text feature (e.g., from "a photo of...
research
03/30/2021

Diagonal Attention and Style-based GAN for Content-Style Disentanglement in Image Generation and Translation

One of the important research topics in image generative models is to di...
research
07/22/2022

Few-shot Image Generation Using Discrete Content Representation

Few-shot image generation and few-shot image translation are two related...
research
06/22/2022

A Fast Text-Driven Approach for Generating Artistic Content

In this work, we propose a complete framework that generates visual art....
research
05/30/2023

AlteredAvatar: Stylizing Dynamic 3D Avatars with Fast Style Adaptation

This paper presents a method that can quickly adapt dynamic 3D avatars t...
research
09/13/2023

MagiCapture: High-Resolution Multi-Concept Portrait Customization

Large-scale text-to-image models including Stable Diffusion are capable ...

Please sign up or login with your details

Forgot password? Click here to reset