b'Xiangyang Xue'

research

∙ 09/09/2023

DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions

Multiple object tracking (MOT) tends to become more challenging when sev...

0 Teng Fu, et al. ∙

research

∙ 09/03/2023

Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning

Scene text recognition has been studied for decades due to its broad app...

0 Haiyang Yu, et al. ∙

research

∙ 09/03/2023

Orientation-Independent Chinese Text Recognition in Scene Images

Scene text recognition (STR) has attracted much attention due to its bro...

0 Haiyang Yu, et al. ∙

research

∙ 08/30/2023

WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model

Enabling robots to understand language instructions and react accordingl...

0 Tianyu Wang, et al. ∙

research

∙ 08/21/2023

Rethinking Person Re-identification from a Projection-on-Prototypes Perspective

Person Re-IDentification (Re-ID) as a retrieval task, has achieved treme...

0 Qizao Wang, et al. ∙

research

∙ 08/21/2023

Exploring Fine-Grained Representation and Recomposition for Cloth-Changing Person Re-Identification

Cloth-changing person Re-IDentification (Re-ID) is a particularly challe...

0 Qizao Wang, et al. ∙

research

∙ 07/21/2023

PourIt!: Weakly-supervised Liquid Perception from a Single Image for Visual Closed-Loop Robotic Pouring

Liquid perception is critical for robotic pouring tasks. It usually requ...

0 Haitao Lin, et al. ∙

research

∙ 07/15/2023

Abstracting Concept-Changing Rules for Solving Raven's Progressive Matrix Problems

The abstract visual reasoning ability in human intelligence benefits dis...

0 Fan Shi, et al. ∙

research

∙ 06/19/2023

Understanding Depth Map Progressively: Adaptive Distance Interval Separation for Monocular 3d Object Detection

Monocular 3D object detection aims to locate objects in different scenes...

0 Xianhui Cheng, et al. ∙

research

∙ 06/16/2023

OCTScenes: A Versatile Real-World Dataset of Tabletop Scenes for Object-Centric Learning

Humans possess the cognitive ability to comprehend scenes in a compositi...

0 Yinxuan Huang, et al. ∙

research

∙ 06/14/2023

Training-free Diffusion Model Adaptation for Variable-Sized Text-to-Image Synthesis

Diffusion models (DMs) have recently gained attention with state-of-the-...

0 Zhiyu Jin, et al. ∙

research

∙ 05/09/2023

Privacy-Preserving Collaborative Chinese Text Recognition with Federated Learning

In Chinese text recognition, to compensate for the insufficient local da...

0 Shangchao Su, et al. ∙

research

∙ 05/06/2023

Exploring One-shot Semi-supervised Federated Learning with A Pre-trained Diffusion Model

Federated learning is a privacy-preserving collaborative learning approa...

0 Mingzhao Yang, et al. ∙

research

∙ 04/28/2023

Multi-to-Single Knowledge Distillation for Point Cloud Semantic Segmentation

3D point cloud semantic segmentation is one of the fundamental tasks for...

0 Shoumeng Qiu, et al. ∙

research

∙ 04/24/2023

Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions

Most existing point cloud upsampling methods have roughly three steps: f...

0 Yun He, et al. ∙

research

∙ 03/29/2023

GAT-COBO: Cost-Sensitive Graph Neural Network for Telecom Fraud Detection

Along with the rapid evolution of mobile communication technologies, suc...

0 Xinxin Hu, et al. ∙

research

∙ 03/28/2023

Cost Sensitive GNN-based Imbalanced Learning for Mobile Social Network Fraud Detection

With the rapid development of mobile networks, the people's social conta...

0 Xinxin Hu, et al. ∙

research

∙ 03/26/2023

Semantic Neural Decoding via Cross-Modal Generation

Semantic neural decoding aims to elucidate the cognitive processes of th...

0 Xuelin Qian, et al. ∙

research

∙ 03/26/2023

Learning Versatile 3D Shape Generation with Improved AR Models

Auto-Regressive (AR) models have achieved impressive results in 2D image...

0 Simian Luo, et al. ∙

research

∙ 03/11/2023

Rethinking the Multi-view Stereo from the Perspective of Rendering-based Augmentation

GigaMVS presents several challenges to existing Multi-View Stereo (MVS) ...

0 Chenjie Cao, et al. ∙

research

∙ 02/25/2023

SUPS: A Simulated Underground Parking Scenario Dataset for Autonomous Driving

Automatic underground parking has attracted considerable attention as th...

0 Jiawei Hou, et al. ∙

research

∙ 01/06/2023

Exploring Efficient Few-shot Adaptation for Vision Transformers

The task of Few-shot Learning (FSL) aims to do the inference on novel ca...

0 Chengming Xu, et al. ∙

research

∙ 01/03/2023

Vocabulary-informed Zero-shot and Open-set Learning

Despite significant progress in object categorization, in recent years, ...

0 Yanwei Fu, et al. ∙

research

∙ 11/24/2022

Chinese Character Recognition with Radical-Structured Stroke Trees

The flourishing blossom of deep learning has witnessed the rapid develop...

0 Haiyang Yu, et al. ∙

research

∙ 11/21/2022

Compositional Scene Modeling with Global Object-Centric Representations

The appearance of the same object may vary in different scene images due...

0 Tonglin Chen, et al. ∙

research

∙ 11/15/2022

Cross-domain Federated Adaptive Prompt Tuning for CLIP

Federated learning (FL) allows multiple parties to collaboratively train...

0 Shangchao Su, et al. ∙

research

∙ 10/04/2022

Domain Discrepancy Aware Distillation for Model Aggregation in Federated Learning

Knowledge distillation has recently become popular as a method of model ...

0 Shangchao Su, et al. ∙

research

∙ 09/20/2022

Dynamic Graph Message Passing Networks for Visual Recognition

Modelling long-range dependencies is critical for scene understanding ta...

17 Li Zhang, et al. ∙

research

∙ 09/15/2022

Compositional Law Parsing with Latent Random Functions

Human cognition has compositionality. We understand a scene by decomposi...

0 Fan Shi, et al. ∙

research

∙ 08/24/2022

AGO-Net: Association-Guided 3D Point Cloud Object Detection Network

The human brain can effortlessly recognize and localize objects, whereas...

0 Liang Du, et al. ∙

research

∙ 08/18/2022

LoRD: Local 4D Implicit Representation for High-Fidelity Dynamic Human Modeling

Recent progress in 4D implicit representation focuses on globally contro...

5 Boyan Jiang, et al. ∙

research

∙ 08/12/2022

Style Spectroscope: Improve Interpretability and Controllability through Fourier Analysis

Universal style transfer (UST) infuses styles from arbitrary reference i...

19 Zhiyu Jin, et al. ∙

research

∙ 07/19/2022

RCLane: Relay Chain Prediction for Lane Detection

Lane detection is an important component of many real-world autonomous s...

7 Shenghua Xu, et al. ∙

research

∙ 06/30/2022

Cross-domain Federated Object Detection

Detection models trained by one party (server) may face severe performan...

0 Shangchao Su, et al. ∙

research

∙ 06/17/2022

Local Slot Attention for Vision-and-Language Navigation

Vision-and-language navigation (VLN), a frontier study aiming to pave th...

0 Yifeng Zhuang, et al. ∙

research

∙ 05/09/2022

Learning 6-DoF Object Poses to Grasp Category-level Objects by Language Instructions

This paper studies the task of any objects grasping from the known categ...

0 Chilam Cheang, et al. ∙

research

∙ 05/09/2022

I Know What You Draw: Learning Grasp Detection Conditioned on a Few Freehand Sketches

In this paper, we are interested in the problem of generating target gra...

0 Haitao Lin, et al. ∙

research

∙ 04/27/2022

Density-preserving Deep Point Cloud Compression

Local density of point clouds is crucial for representing local details,...

0 Yun He, et al. ∙

research

∙ 04/26/2022

One-shot Federated Learning without Server-side Training

Federated Learning (FL) has recently made significant progress as a new ...

0 Shangchao Su, et al. ∙

research

∙ 04/21/2022

Pixel2Mesh++: 3D Mesh Generation and Refinement from Multi-View Images

We study the problem of shape generation in 3D mesh representation from ...

0 Chao Wen, et al. ∙

research

∙ 04/03/2022

DST: Dynamic Substitute Training for Data-free Black-box Attack

With the wide applications of deep neural network models in various comp...

0 Wenxuan Wang, et al. ∙

research

∙ 03/31/2022

ImpDet: Exploring Implicit Fields for 3D Object Detection

Conventional 3D object detection approaches concentrate on bounding boxe...

12 Xuelin Qian, et al. ∙

research

∙ 03/24/2022

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training

Multiple datasets and open challenges for object detection have been int...

5 Likun Cai, et al. ∙

research

∙ 03/22/2022

QS-Craft: Learning to Quantize, Scrabble and Craft for Conditional Human Motion Animation

This paper studies the task of conditional Human Motion Animation (cHMA)...

0 Yuxin Hong, et al. ∙

research

∙ 03/02/2022

H4D: Human 4D Modeling by Learning Neural Compositional Representation

Despite the impressive results achieved by deep learning based 3D recons...

8 Boyan Jiang, et al. ∙

research

∙ 02/15/2022

Compositional Scene Representation Learning via Reconstruction: A Survey

Visual scene representation learning is an important research problem in...

0 Jinyang Yuan, et al. ∙

research

∙ 12/30/2021

Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study

The flourishing blossom of deep learning has witnessed the rapid develop...

14 Jingye Chen, et al. ∙

research

∙ 12/28/2021

The Devil is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection

Low-cost monocular 3D object detection plays a fundamental role in auton...

12 Zhikang Zou, et al. ∙

research

∙ 12/13/2021

Text Gestalt: Stroke-Aware Scene Text Image Super-Resolution

In the last decade, the blossom of deep learning has witnessed the rapid...

0 Jingye Chen, et al. ∙

research

∙ 12/07/2021

Unsupervised Learning of Compositional Scene Representations from Multiple Unspecified Viewpoints

Visual scenes are extremely rich in diversity, not only because there ar...

0 Jinyang Yuan, et al. ∙

Xiangyang Xue

Featured Co-authors

Sign in with Google

Consider DeepAI Pro