Jianmin Bao

research

∙ 09/07/2023

InstructDiffusion: A Generalist Modeling Interface for Vision Tasks

We present InstructDiffusion, a unifying and generic framework for align...

0 Zigang Geng, et al. ∙

research

∙ 07/17/2023

AltFreezing for More General Video Face Forgery Detection

Existing face forgery detection models try to discriminate fake images b...

0 Zhendong Wang, et al. ∙

research

∙ 06/08/2023

HQ-50K: A Large-scale, High-quality Dataset for Image Restoration

This paper introduces a new large-scale image restoration dataset, calle...

0 Qinhong Yang, et al. ∙

research

∙ 06/07/2023

Designing a Better Asymmetric VQGAN for StableDiffusion

StableDiffusion is a revolutionary text-to-image generator that is causi...

0 Zixin Zhu, et al. ∙

research

∙ 05/25/2023

Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

Text-to-Image diffusion models have made tremendous progress over the pa...

0 Shihao Zhao, et al. ∙

research

∙ 03/22/2023

CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning

This work focuses on sign language retrieval-a recently proposed task fo...

0 Yiting Cheng, et al. ∙

research

∙ 03/16/2023

Efficient Diffusion Training via Min-SNR Weighting Strategy

Denoising diffusion models have been a mainstream approach for image gen...

0 Tiankai Hang, et al. ∙

research

∙ 03/16/2023

DIRE for Diffusion-Generated Image Detection

Diffusion models have shown remarkable success in visual synthesis, but ...

0 Zhendong Wang, et al. ∙

research

∙ 12/12/2022

CLIP Itself is a Strong Fine-tuner: Achieving 85.7 Accuracy with ViT-B and ViT-L on ImageNet

Recent studies have shown that CLIP has achieved remarkable success in p...

0 Xiaoyi Dong, et al. ∙

research

∙ 12/07/2022

X-Paste: Revisit Copy-Paste at Scale with CLIP and StableDiffusion

Copy-Paste is a simple and effective data augmentation strategy for inst...

0 Hanqing Zhao, et al. ∙

research

∙ 11/28/2022

CLIP2GAN: Towards Bridging Text with the Latent Space of GANs

In this work, we are dedicated to text-guided image generation and propo...

0 Yixuan Wang, et al. ∙

research

∙ 11/22/2022

SinDiffusion: Learning a Diffusion Model from a Single Natural Image

We present SinDiffusion, leveraging denoising diffusion models to captur...

0 Weilun Wang, et al. ∙

research

∙ 08/25/2022

MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining

This paper presents a simple yet effective framework MaskCLIP, which inc...

18 Xiaoyi Dong, et al. ∙

research

∙ 07/14/2022

Bootstrapped Masked Autoencoders for Vision BERT Pretraining

We propose bootstrapped masked autoencoders (BootMAE), a new approach fo...

21 Xiaoyi Dong, et al. ∙

research

∙ 06/30/2022

Semantic Image Synthesis via Diffusion Models

Denoising Diffusion Probabilistic Models (DDPMs) have achieved remarkabl...

6 Weilun Wang, et al. ∙

research

∙ 06/22/2022

I^2R-Net: Intra- and Inter-Human Relation Network for Multi-Person Pose Estimation

In this paper, we present the Intra- and Inter-Human Relation Networks (...

8 Yiwei Ding, et al. ∙

research

∙ 05/31/2022

Improved Vector Quantized Diffusion Models

Vector quantized diffusion (VQ-Diffusion) is a powerful generative model...

22 Zhicong Tang, et al. ∙

research

∙ 05/27/2022

Contrastive Learning Rivals Masked Image Modeling in Fine-tuning via Feature Distillation

Masked image modeling (MIM) learns representations with remarkably good ...

4 Yixuan Wei, et al. ∙

research

∙ 03/30/2022

Large-Scale Pre-training for Person Re-identification with Noisy Labels

This paper aims to address the problem of pre-training for person re-ide...

4 Dengpan Fu, et al. ∙

research

∙ 03/29/2022

Semi-Supervised Image-to-Image Translation using Latent Space Mapping

Recent image-to-image translation works have been transferred from super...

7 Pan Zhang, et al. ∙

research

∙ 03/02/2022

Protecting Celebrities with Identity Consistency Transformer

In this work we propose Identity Consistency Transformer, a novel face f...

9 Xiaoyi Dong, et al. ∙

research

∙ 12/20/2021

StyleSwin: Transformer-based GAN for High-resolution Image Generation

Despite the tantalizing success in a broad of vision tasks, transformers...

15 Bowen Zhang, et al. ∙

research

∙ 12/06/2021

General Facial Representation Learning in a Visual-Linguistic Manner

How to learn a universal facial representation that boosts all face anal...

7 Yinglin Zheng, et al. ∙

research

∙ 11/29/2021

Vector Quantized Diffusion Model for Text-to-Image Synthesis

We present the vector quantized diffusion (VQ-Diffusion) model for text-...

10 Shuyang Gu, et al. ∙

research

∙ 11/24/2021

PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers

This paper explores a better codebook for BERT pre-training of vision tr...

23 Xiaoyi Dong, et al. ∙

research

∙ 11/18/2021

SimMIM: A Simple Framework for Masked Image Modeling

This paper presents SimMIM, a simple framework for masked image modeling...

0 Zhenda Xie, et al. ∙

research

∙ 08/15/2021

Exploring Temporal Coherence for More General Video Face Forgery Detection

Although current face manipulation techniques achieve impressive perform...

2 Yinglin Zheng, et al. ∙

research

∙ 08/13/2021

Dual Path Learning for Domain Adaptation of Semantic Segmentation

Domain adaptation for semantic segmentation enables to alleviate the nee...

5 Yiting Cheng, et al. ∙

research

∙ 08/10/2021

Instance-wise Hard Negative Example Generation for Contrastive Learning in Unpaired Image-to-Image Translation

Contrastive learning shows great potential in unpaired image-to-image tr...

5 Weilun Wang, et al. ∙

research

∙ 07/01/2021

CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped Windows

We present CSWin Transformer, an efficient and effective Transformer-bas...

2 Xiaoyi Dong, et al. ∙

research

∙ 06/06/2021

Uformer: A General U-Shaped Transformer for Image Restoration

In this paper, we present Uformer, an effective and efficient Transforme...

0 Zhendong Wang, et al. ∙

research

∙ 03/29/2021

High-Fidelity and Arbitrary Face Editing

Cycle consistency is widely used for face editing. However, we observe t...

9 Yue Gao, et al. ∙

research

∙ 12/07/2020

Identity-Driven DeepFake Detection

DeepFake detection has so far been dominated by “artifact-driven” method...

9 Xiaoyi Dong, et al. ∙

research

∙ 12/07/2020

Unsupervised Pre-training for Person Re-identification

In this paper, we present a large scale unlabeled person re-identificati...

5 Dengpan Fu, et al. ∙

research

∙ 12/03/2020

Full-Resolution Correspondence Learning for Image Translation

We present the full-resolution correspondence learning for cross-domain ...

7 Xingran Zhou, et al. ∙

research

∙ 11/22/2020

Learnable Sampling 3D Convolution for Video Enhancement and Action Recognition

A key challenge in video enhancement and action recognition is to fuse u...

1 Shuyang Gu, et al. ∙

research

∙ 10/26/2020

GreedyFool: Distortion-Aware Sparse Adversarial Attack

Modern deep neural networks(DNNs) are vulnerable to adversarial samples....

0 Xiaoyi Dong, et al. ∙

research

∙ 09/21/2020

Improving Person Re-identification with Iterative Impression Aggregation

Our impression about one person often updates after we see more aspects ...

0 Dengpan Fu, et al. ∙

research

∙ 06/30/2020

PriorGAN: Real Data Prior for Generative Adversarial Nets

Generative adversarial networks (GANs) have achieved rapid progress in l...

0 Shuyang Gu, et al. ∙

research

∙ 03/19/2020

GIQA: Generated Image Quality Assessment

Generative adversarial networks (GANs) have achieved impressive results ...

0 Shuyang Gu, et al. ∙

research

∙ 12/31/2019

Face X-ray for More General Face Forgery Detection

In this paper we propose a novel image representation called face X-ray ...

14 Lingzhi Li, et al. ∙

research

∙ 12/31/2019

FaceShifter: Towards High Fidelity And Occlusion Aware Face Swapping

In this work, we propose a novel two-stage framework, called FaceShifter...

21 Lingzhi Li, et al. ∙

research

∙ 05/24/2019

Mask-Guided Portrait Editing with Conditional GANs

Portrait editing is a popular subject in photo manipulation. The Generat...

0 Shuyang Gu, et al. ∙

research

∙ 03/29/2018

Towards Open-Set Identity Preserving Face Synthesis

We propose a framework based on Generative Adversarial Networks to disen...

0 Jianmin Bao, et al. ∙

research

∙ 03/29/2017

CVAE-GAN: Fine-Grained Image Generation through Asymmetric Training

We present variational generative adversarial networks, a general learni...

0 Jianmin Bao, et al. ∙

Jianmin Bao

Featured Co-authors

Sign in with Google

Consider DeepAI Pro