VQBB: Image-to-image Translation with Vector Quantized Brownian Bridge

05/16/2022
by   Bo Li, et al.
0

Image-to-image translation is an important and challenging problem in computer vision. Existing approaches like Pixel2Pixel, DualGAN suffer from the instability of GAN and fail to generate diverse outputs because they model the task as a one-to-one mapping. Although diffusion models can generate images with high quality and diversity, current conditional diffusion models still can not maintain high similarity with the condition image on image-to-image translation tasks due to the Gaussian noise added in the reverse process. To address these issues, a novel Vector Quantized Brownian Bridge(VQBB) diffusion model is proposed in this paper. On one hand, Brownian Bridge diffusion process can model the transformation between two domains more accurate and flexible than the existing Markov diffusion methods. As far as the authors know, it is the first work for Brownian Bridge diffusion process proposed for image-to-image translation. On the other hand, the proposed method improved the learning efficiency and translation accuracy by confining the diffusion process in the quantized latent space. Finally, numerical experimental results validated the performance of the proposed method.

READ FULL TEXT
research
05/24/2023

Unpaired Image-to-Image Translation via Neural Schrödinger Bridge

Diffusion models are a powerful class of generative models which simulat...
research
11/10/2021

Palette: Image-to-Image Diffusion Models

We introduce Palette, a simple and general framework for image-to-image ...
research
08/06/2023

Photorealistic and Identity-Preserving Image-Based Emotion Manipulation with Latent Diffusion Models

In this paper, we investigate the emotion manipulation capabilities of d...
research
04/06/2022

The Swiss Army Knife for Image-to-Image Translation: Multi-Task Diffusion Models

Recently, diffusion models were applied to a wide range of image analysi...
research
10/26/2019

Image to Image Translation based on Convolutional Neural Network Approach for Speech Declipping

Clipping, as a current nonlinear distortion, often occurs due to the lim...
research
08/04/2023

SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation

Recent score-based diffusion models (SBDMs) show promising results in un...
research
08/08/2023

A Comparative Study of Image-to-Image Translation Using GANs for Synthetic Child Race Data

The lack of ethnic diversity in data has been a limiting factor of face ...

Please sign up or login with your details

Forgot password? Click here to reset