We present InstructDiffusion, a unifying and generic framework for align...
Existing face forgery detection models try to discriminate fake images b...
This paper introduces a new large-scale image restoration dataset, calle...
StableDiffusion is a revolutionary text-to-image generator that is causi...
Text-to-Image diffusion models have made tremendous progress over the pa...
This work focuses on sign language retrieval-a recently proposed task fo...
Denoising diffusion models have been a mainstream approach for image
gen...
Diffusion models have shown remarkable success in visual synthesis, but ...
Recent studies have shown that CLIP has achieved remarkable success in
p...
Copy-Paste is a simple and effective data augmentation strategy for inst...
In this work, we are dedicated to text-guided image generation and propo...
We present SinDiffusion, leveraging denoising diffusion models to captur...
This paper presents a simple yet effective framework MaskCLIP, which
inc...
We propose bootstrapped masked autoencoders (BootMAE), a new approach fo...
Denoising Diffusion Probabilistic Models (DDPMs) have achieved remarkabl...
In this paper, we present the Intra- and Inter-Human Relation Networks
(...
Vector quantized diffusion (VQ-Diffusion) is a powerful generative model...
Masked image modeling (MIM) learns representations with remarkably good
...
This paper aims to address the problem of pre-training for person
re-ide...
Recent image-to-image translation works have been transferred from super...
In this work we propose Identity Consistency Transformer, a novel face
f...
Despite the tantalizing success in a broad of vision tasks, transformers...
How to learn a universal facial representation that boosts all face anal...
We present the vector quantized diffusion (VQ-Diffusion) model for
text-...
This paper explores a better codebook for BERT pre-training of vision
tr...
This paper presents SimMIM, a simple framework for masked image modeling...
Although current face manipulation techniques achieve impressive perform...
Domain adaptation for semantic segmentation enables to alleviate the nee...
Contrastive learning shows great potential in unpaired image-to-image
tr...
We present CSWin Transformer, an efficient and effective Transformer-bas...
In this paper, we present Uformer, an effective and efficient
Transforme...
Cycle consistency is widely used for face editing. However, we observe t...
DeepFake detection has so far been dominated by “artifact-driven” method...
In this paper, we present a large scale unlabeled person re-identificati...
We present the full-resolution correspondence learning for cross-domain
...
A key challenge in video enhancement and action recognition is to fuse u...
Modern deep neural networks(DNNs) are vulnerable to adversarial samples....
Our impression about one person often updates after we see more aspects ...
Generative adversarial networks (GANs) have achieved rapid progress in
l...
Generative adversarial networks (GANs) have achieved impressive results
...
In this paper we propose a novel image representation called face X-ray ...
In this work, we propose a novel two-stage framework, called FaceShifter...
Portrait editing is a popular subject in photo manipulation. The Generat...
We propose a framework based on Generative Adversarial Networks to
disen...
We present variational generative adversarial networks, a general learni...