b'Qi Tian'

research

∙ 08/27/2023

Computation-efficient Deep Learning for Computer Vision: A Survey

Over the past decade, deep learning models have exhibited considerable a...

0 Yulin Wang, et al. ∙

research

∙ 08/08/2023

Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation

Transformers have become the primary backbone of the computer vision com...

0 Shuangrui Ding, et al. ∙

research

∙ 08/04/2023

SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation

Recent score-based diffusion models (SBDMs) show promising results in un...

0 Shikun Sun, et al. ∙

research

∙ 07/20/2023

Human Motion Generation: A Survey

Human motion generation aims to generate natural human pose sequences an...

0 Wentao Zhu, et al. ∙

research

∙ 06/28/2023

Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners

Representation learning has been evolving from traditional supervised tr...

0 Bowen Shi, et al. ∙

research

∙ 06/14/2023

Towards AGI in Computer Vision: Lessons Learned from GPT and Large Language Models

The AI community has been pursuing algorithms known as artificial genera...

0 Lingxi Xie, et al. ∙

research

∙ 06/09/2023

Exploring Effective Mask Sampling Modeling for Neural Image Compression

Image compression aims to reduce the information redundancy in images. M...

0 Lin Liu, et al. ∙

research

∙ 06/08/2023

Joint Channel Estimation and Feedback with Masked Token Transformers in Massive MIMO Systems

When the base station has downlink channel status information (CSI), the...

0 Mingming Zhao, et al. ∙

research

∙ 05/24/2023

Reasoning over Hierarchical Question Decomposition Tree for Explainable Question Answering

Explainable question answering (XQA) aims to answer a given question and...

0 Jiajie Zhang, et al. ∙

research

∙ 05/22/2023

ControlVideo: Training-free Controllable Text-to-Video Generation

Text-driven diffusion models have unlocked unprecedented abilities in im...

0 Yabo Zhang, et al. ∙

research

∙ 05/18/2023

Advancing Incremental Few-shot Semantic Segmentation via Semantic-guided Relation Alignment and Adaptation

Incremental few-shot semantic segmentation (IFSS) aims to incrementally ...

0 Yuan Zhou, et al. ∙

research

∙ 05/15/2023

Mode Approximation Makes Good Vision-Language Prompts

With the advance of large-scale model technologies, parameter-efficient ...

0 Haixin Wang, et al. ∙

research

∙ 05/11/2023

Continual Vision-Language Representation Learning with Off-Diagonal Information

This paper discusses the feasibility of continuously training the CLIP m...

0 Zixuan Ni, et al. ∙

research

∙ 05/10/2023

Visual Tuning

Fine-tuning visual models has been widely shown promising performance on...

0 Bruce X. B. Yu, et al. ∙

research

∙ 04/24/2023

Segment Anything in 3D with NeRFs

The Segment Anything Model (SAM) has demonstrated its effectiveness in s...

0 Jiazhong Cen, et al. ∙

research

∙ 04/22/2023

Pipeline MoE: A Flexible MoE Implementation with Pipeline Parallelism

The Mixture of Experts (MoE) model becomes an important choice of large ...

0 Xin Chen, et al. ∙

research

∙ 04/22/2023

SAILER: Structure-aware Pre-trained Language Model for Legal Case Retrieval

Legal case retrieval, which aims to find relevant cases for a query case...

0 Haitao Li, et al. ∙

research

∙ 04/12/2023

Learning Transferable Pedestrian Representation from Multimodal Information Supervision

Recent researches on unsupervised person re-identification (reID) have d...

0 Liping Bao, et al. ∙

research

∙ 04/07/2023

PSLT: A Light-weight Vision Transformer with Ladder Self-Attention and Progressive Shift

Vision Transformer (ViT) has shown great potential for various visual ta...

0 Gaojie Wu, et al. ∙

research

∙ 03/21/2023

Multi-modal Prompting for Low-Shot Temporal Action Localization

In this paper, we consider the problem of temporal action localization u...

1 Chen Ju, et al. ∙

research

∙ 03/17/2023

LION: Implicit Vision Prompt Tuning

Despite recent competitive performance across a range of vision tasks, v...

0 Haixin Wang, et al. ∙

research

∙ 03/16/2023

Focus on Your Target: A Dual Teacher-Student Framework for Domain-adaptive Semantic Segmentation

We study unsupervised domain adaptation (UDA) for semantic segmentation....

0 Xinyue Huo, et al. ∙

research

∙ 03/14/2023

USAGE: A Unified Seed Area Generation Paradigm for Weakly Supervised Semantic Segmentation

Seed area generation is usually the starting point of weakly supervised ...

0 Zelin Peng, et al. ∙

research

∙ 03/12/2023

Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models

Prompt tuning, a recently emerging paradigm, enables the powerful vision...

0 Juncheng Li, et al. ∙

research

∙ 03/09/2023

R-Tuning: Regularized Prompt Tuning in Open-Set Scenarios

In realistic open-set scenarios where labels of a part of testing data a...

0 Ning Liao, et al. ∙

research

∙ 03/09/2023

Rethinking Visual Prompt Learning as Masked Visual Token Modeling

Prompt learning has achieved great success in efficiently exploiting lar...

0 Ning Liao, et al. ∙

research

∙ 03/07/2023

Lformer: Text-to-Image Generation with L-shape Block Parallel Decoding

Generative transformers have shown their superiority in synthesizing hig...

0 Jiacheng Li, et al. ∙

research

∙ 02/20/2023

Constraint and Union for Partially-Supervised Temporal Sentence Grounding

Temporal sentence grounding aims to detect the event timestamps describe...

0 Chen Ju, et al. ∙

research

∙ 02/05/2023

ShiftDDPMs: Exploring Conditional Diffusion Models by Shifting Diffusion Trajectories

Diffusion models have recently exhibited remarkable abilities to synthes...

0 Zijian Zhang, et al. ∙

research

∙ 12/26/2022

Prototype-guided Cross-task Knowledge Distillation for Large-scale Models

Recently, large-scale pre-trained models have shown their advantages in ...

0 Deng Li, et al. ∙

research

∙ 12/19/2022

Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization

Weakly-supervised temporal action localization (WTAL) learns to detect a...

0 Chen Ju, et al. ∙

research

∙ 12/14/2022

FedSkip: Combatting Statistical Heterogeneity with Federated Skip Aggregation

The statistical heterogeneity of the non-independent and identically dis...

0 Ziqing Fan, et al. ∙

research

∙ 12/12/2022

Feature Calibration Network for Occluded Pedestrian Detection

Pedestrian detection in the wild remains a challenging problem especiall...

1 Tianliang Zhang, et al. ∙

research

∙ 12/04/2022

ConfounderGAN: Protecting Image Data Privacy with Causal Confounder

The success of deep learning is partly attributed to the availability of...

0 Qi Tian, et al. ∙

research

∙ 11/28/2022

Learning From Good Trajectories in Offline Multi-Agent Reinforcement Learning

Offline multi-agent reinforcement learning (MARL) aims to learn effectiv...

0 Qi Tian, et al. ∙

research

∙ 11/23/2022

Integrally Pre-Trained Transformer Pyramid Networks

In this paper, we present an integral pre-training framework based on ma...

0 Yunjie Tian, et al. ∙

research

∙ 11/03/2022

Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global Weather Forecast

In this paper, we present Pangu-Weather, a deep learning based system fo...

1 Kaifeng Bi, et al. ∙

research

∙ 10/28/2022

OhMG: Zero-shot Open-vocabulary Human Motion Generation

Generating motion in line with text has attracted increasing attention n...

0 Junfan Lin, et al. ∙

research

∙ 10/14/2022

See Blue Sky: Deep Image Dehaze Using Paired and Unpaired Training Images

The issue of image haze removal has attracted wide attention in recent y...

0 Xiaoyan Zhang, et al. ∙

research

∙ 10/03/2022

Towards a Unified View on Visual Parameter-Efficient Transfer Learning

Since the release of various large-scale natural language processing (NL...

0 Bruce X. B. Yu, et al. ∙

research

∙ 10/01/2022

Learnable Distribution Calibration for Few-Shot Class-Incremental Learning

Few-shot class-incremental learning (FSCIL) faces challenges of memorizi...

0 Binghao Liu, et al. ∙

research

∙ 10/01/2022

Motion-inductive Self-supervised Object Discovery in Videos

In this paper, we consider the task of unsupervised object discovery in ...

7 Shuangrui Ding, et al. ∙

research

∙ 08/23/2022

Low-Light Video Enhancement with Synthetic Event Guidance

Low-light video enhancement (LLVE) is an important yet challenging task ...

1 Lin Liu, et al. ∙

research

∙ 08/22/2022

Prompt-Matched Semantic Segmentation

The objective of this work is to explore how to effectively and efficien...

0 Lingbo Liu, et al. ∙

research

∙ 08/04/2022

Fine-Grained Semantically Aligned Vision-Language Pre-Training

Large-scale vision-language pre-training has shown impressive advances i...

2 Juncheng Li, et al. ∙

research

∙ 08/03/2022

Dilated Context Integrated Network with Cross-Modal Consensus for Temporal Emotion Localization in Videos

Understanding human emotions is a crucial ability for intelligent robots...

2 Juncheng Li, et al. ∙

research

∙ 07/31/2022

SdAE: Self-distillated Masked Autoencoder

With the development of generative-based self-supervised learning (SSL) ...

0 Yabo Chen, et al. ∙

research

∙ 07/31/2022

Skeleton-Parted Graph Scattering Networks for 3D Human Motion Prediction

Graph convolutional network based methods that model the body-joints' re...

0 Maosen Li, et al. ∙

research

∙ 07/29/2022

Fine-grained Retrieval Prompt Tuning

Fine-grained object retrieval aims to learn discriminative representatio...

8 Shijie Wang, et al. ∙

research

∙ 07/28/2022

Pro-tuning: Unified Prompt Tuning for Vision Tasks

In computer vision, fine-tuning is the de-facto approach to leverage pre...

0 Xing Nie, et al. ∙

Qi Tian

Featured Co-authors

Sign in with Google

Consider DeepAI Pro