Xiaojun Chang

research

∙ 09/20/2023

PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement

Dominant Person Search methods aim to localize and recognize query perso...

1 Chengyou Jia, et al. ∙

research

∙ 08/20/2023

SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation

Despite significant progress in Text-to-Image (T2I) generative models, e...

1 Chengyou Jia, et al. ∙

research

∙ 07/31/2023

FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration

Multi-modality fusion and multi-task learning are becoming trendy in 3D ...

1 Zhijian Huang, et al. ∙

research

∙ 07/22/2023

Two-stream Multi-level Dynamic Point Transformer for Two-person Interaction Recognition

As a fundamental aspect of human life, two-person interactions contain m...

1 Yao Liu, et al. ∙

research

∙ 05/04/2023

Toward the Automated Construction of Probabilistic Knowledge Graphs for the Maritime Domain

International maritime crime is becoming increasingly sophisticated, oft...

1 Fatemeh Shiri, et al. ∙

research

∙ 04/26/2023

Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining

Medical artificial general intelligence (MAGI) enables one foundation mo...

17 Bingqian Lin, et al. ∙

research

∙ 04/24/2023

A Benchmark for Cycling Close Pass Near Miss Event Detection from Video Streams

Cycling is a healthy and sustainable mode of transport. However, interac...

1 Mingjie Li, et al. ∙

research

∙ 03/18/2023

Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation

Automatic radiology reporting has great clinical potential to relieve ra...

1 Mingjie Li, et al. ∙

research

∙ 03/07/2023

Guided Image-to-Image Translation by Discriminator-Generator Communication

The goal of Image-to-image (I2I) translation is to transfer an image fro...

1 Yuanjiang Cao, et al. ∙

research

∙ 12/02/2022

3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation

Text-guided 3D object generation aims to generate 3D objects described b...

1 Zutao Jiang, et al. ∙

research

∙ 11/05/2022

Simple Primitives with Feasibility- and Contextuality-Dependence for Open-World Compositional Zero-shot Learning

The task of Compositional Zero-Shot Learning (CZSL) is to recognize imag...

1 Zhe Liu, et al. ∙

research

∙ 10/16/2022

Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers

Automatic data augmentation (AutoAugment) strategies are indispensable i...

3 Tao Tang, et al. ∙

research

∙ 10/15/2022

PAR: Political Actor Representation Learning with Social Context and Expert Knowledge

Modeling the ideological perspectives of political actors is an essentia...

9 Shangbin Feng, et al. ∙

research

∙ 10/11/2022

ViLPAct: A Benchmark for Compositional Generalization on Multimodal Human Activities

We introduce ViLPAct, a novel vision-language benchmark for human activi...

9 Terry Yue Zhuo, et al. ∙

research

∙ 09/28/2022

Prompt-driven efficient Open-set Semi-supervised Learning

Open-set semi-supervised learning (OSSL) has attracted growing interest,...

1 Haoran Li, et al. ∙

research

∙ 07/21/2022

An Efficient Spatio-Temporal Pyramid Transformer for Action Detection

The task of action detection aims at deducing both the action category a...

1 Yuetian Weng, et al. ∙

research

∙ 07/16/2022

Generalizable Memory-driven Transformer for Multivariate Long Sequence Time-series Forecasting

Multivariate long sequence time-series forecasting (M-LSTF) is a practic...

1 Mingjie Li, et al. ∙

research

∙ 07/04/2022

Domain Adaptive Nuclei Instance Segmentation and Classification via Category-aware Feature Alignment and Pseudo-labelling

Unsupervised domain adaptation (UDA) methods have been broadly utilized ...

1 Canran Li, et al. ∙

research

∙ 06/04/2022

Cross-modal Clinical Graph Transformer for Ophthalmic Report Generation

Automatic generation of ophthalmic reports using data-driven neural netw...

1 Mingjie Li, et al. ∙

research

∙ 06/01/2022

Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent RL

Cooperative multi-agent reinforcement learning (MARL) is making rapid pr...

1 Siyi Hu, et al. ∙

research

∙ 05/20/2022

Towards Explanation for Unsupervised Graph-Level Representation Learning

Due to the superior performance of Graph Neural Networks (GNNs) in vario...

1 Qinghua Zheng, et al. ∙

research

∙ 04/27/2022

PRE-NAS: Predictor-assisted Evolutionary Neural Architecture Search

Neural architecture search (NAS) aims to automate architecture engineeri...

1 Yameng Peng, et al. ∙

research

∙ 03/28/2022

Automated Progressive Learning for Efficient Training of Vision Transformers

Recent advances in vision Transformers (ViTs) have come with a voracious...

10 Changlin Li, et al. ∙

research

∙ 03/27/2022

CGUA: Context-Guided and Unpaired-Assisted Weakly Supervised Person Search

Recently, weakly supervised person search is proposed to discard human-a...

1 Chengyou Jia, et al. ∙

research

∙ 03/04/2022

Voice-Face Homogeneity Tells Deepfake

Detecting forgery videos is highly desirable due to the abuse of deepfak...

1 Harry Cheng, et al. ∙

research

∙ 02/08/2022

Exploring Inter-Channel Correlation for Diversity-preserved KnowledgeDistillation

Knowledge Distillation has shown very promising abil-ity in transferring...

1 Li Liu, et al. ∙

research

∙ 01/06/2022

Balancing Generalization and Specialization in Zero-shot Learning

Zero-Shot Learning (ZSL) aims to transfer classification capability from...

1 Yun Li, et al. ∙

research

∙ 11/25/2021

BaLeNAS: Differentiable Architecture Search via the Bayesian Learning Rule

Differentiable Architecture Search (DARTS) has received massive attentio...

1 Miao Zhang, et al. ∙

research

∙ 11/03/2021

An Entropy-guided Reinforced Partial Convolutional Network for Zero-Shot Learning

Zero-Shot Learning (ZSL) aims to transfer learned knowledge from observe...

1 Yun Li, et al. ∙

research

∙ 10/22/2021

Signature-Graph Networks

We propose a novel approach for visual representation learning called Si...

0 Ali Hamdi, et al. ∙

research

∙ 10/17/2021

Dynamic Slimmable Denoising Network

Recently, tremendous human-designed and automatically searched neural ne...

0 Zutao Jiang, et al. ∙

research

∙ 10/12/2021

Reliable Shot Identification for Complex Event Detection via Visual-Semantic Embedding

Multimedia event detection is the task of detecting a specific event of ...

0 Minnan Luo, et al. ∙

research

∙ 09/21/2021

DS-Net++: Dynamic Weight Slicing for Efficient Inference in CNNs and Transformers

Dynamic networks have shown their promising capability in reducing theor...

1 Changlin Li, et al. ∙

research

∙ 08/09/2021

Encoding Heterogeneous Social and Political Context for Entity Stance Prediction

Political stance detection has become an important task due to the incre...

3 Shangbin Feng, et al. ∙

research

∙ 08/09/2021

Knowledge Graph Augmented Political Perspective Detection in News Media

Identifying political perspective in news media has become an important ...

0 Shangbin Feng, et al. ∙

research

∙ 06/22/2021

Differentiable Architecture Search Without Training Nor Labels: A Pruning Perspective

With leveraging the weight-sharing and continuous relaxation to enable g...

13 Miao Zhang, et al. ∙

research

∙ 06/15/2021

Vision-Language Navigation with Random Environmental Mixup

Vision-language Navigation (VLN) tasks require an agent to navigate step...

0 Chong Liu, et al. ∙

research

∙ 05/01/2021

Person Search Challenges and Solutions: A Survey

Person search has drawn increasing attention due to its real-world appli...

0 Xiangtan Lin, et al. ∙

research

∙ 03/31/2021

SOON: Scenario Oriented Object Navigation with Graph-based Exploration

The ability to navigate like a human towards a language-guided target fr...

8 Fengda Zhu, et al. ∙

research

∙ 03/24/2021

Dynamic Slimmable Network

Current dynamic networks and dynamic pruning methods have shown their pr...

1 Changlin Li, et al. ∙

research

∙ 03/23/2021

BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search

A myriad of recent breakthroughs in hand-crafted neural architectures fo...

1 Changlin Li, et al. ∙

research

∙ 01/20/2021

UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers

Recent advances in multi-agent reinforcement learning have been largely ...

0 Siyi Hu, et al. ∙

research

∙ 10/26/2020

Hierarchical Neural Architecture Search for Deep Stereo Matching

To reduce the human efforts in neural network design, Neural Architectur...

0 Xuelian Cheng, et al. ∙

research

∙ 09/24/2020

Self-Weighted Robust LDA for Multiclass Classification with Edge Classes

Linear discriminant analysis (LDA) is a popular technique to learn the m...

12 Caixia Yan, et al. ∙

research

∙ 08/30/2020

A Survey of Deep Active Learning

Active learning (AL) attempts to maximize the performance gain of the mo...

148 Pengzhen Ren, et al. ∙

research

∙ 07/03/2020

Accurate Bounding-box Regression with Distance-IoU Loss for Visual Tracking

Most existing tracking methods are based on using a classifier and multi...

18 Di Yuan, et al. ∙

research

∙ 06/23/2020

Multi-view Drone-based Geo-localization via Style and Spatial Alignment

In this paper, we focus on the task of multi-view multi-source geo-local...

0 Siyi Hu, et al. ∙

research

∙ 06/19/2020

Melanoma Diagnosis with Spatio-Temporal Feature Learning on Sequential Dermoscopic Images

Existing studies for automated melanoma diagnosis are based on single-ti...

0 Zhen Yu, et al. ∙

research

∙ 06/06/2020

Auxiliary Signal-Guided Knowledge Encoder-Decoder for Medical Report Generation

Beyond the common difficulties faced in the natural image captioning, me...

8 Mingjie Li, et al. ∙

research

∙ 06/01/2020

A Comprehensive Survey of Neural Architecture Search: Challenges and Solutions

Deep learning has made major breakthroughs and progress in many fields. ...

76 Pengzhen Ren, et al. ∙

Xiaojun Chang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro