b'Yanwei Fu'

research

∙ 09/18/2023

Unsupervised Open-Vocabulary Object Localization in Videos

In this paper, we show that recent advances in video representation lear...

0 Ke Fan, et al. ∙

research

∙ 09/01/2023

Object-Centric Multiple Object Tracking

Unsupervised object-centric learning methods allow the partitioning of s...

0 Zixu Zhao, et al. ∙

research

∙ 08/31/2023

Coarse-to-Fine Amodal Segmentation with Shape Prior

Amodal object segmentation is a challenging task that involves segmentin...

0 Jianxiong Gao, et al. ∙

research

∙ 08/30/2023

WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model

Enabling robots to understand language instructions and react accordingl...

0 Tianyu Wang, et al. ∙

research

∙ 08/21/2023

Rethinking Person Re-identification from a Projection-on-Prototypes Perspective

Person Re-IDentification (Re-ID) as a retrieval task, has achieved treme...

0 Qizao Wang, et al. ∙

research

∙ 08/21/2023

Exploring Fine-Grained Representation and Recomposition for Cloth-Changing Person Re-Identification

Cloth-changing person Re-IDentification (Re-ID) is a particularly challe...

0 Qizao Wang, et al. ∙

research

∙ 08/06/2023

Local Consensus Enhanced Siamese Network with Reciprocal Loss for Two-view Correspondence Learning

Recent studies of two-view correspondence learning usually establish an ...

0 Linbo Wang, et al. ∙

research

∙ 07/21/2023

PourIt!: Weakly-supervised Liquid Perception from a Single Image for Visual Closed-Loop Robotic Pouring

Liquid perception is critical for robotic pouring tasks. It usually requ...

0 Haitao Lin, et al. ∙

research

∙ 05/26/2023

GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot Attention for Vision-and-Language Navigation

Most existing works solving Room-to-Room VLN problem only utilize RGB im...

0 Jingyang Huo, et al. ∙

research

∙ 05/19/2023

A Unified Prompt-Guided In-Context Inpainting Framework for Reference-based Image Manipulations

Recent advancements in Text-to-Image (T2I) generative models have yielde...

0 Chenjie Cao, et al. ∙

research

∙ 04/24/2023

Grad-PU: Arbitrary-Scale Point Cloud Upsampling via Gradient Descent with Learned Distance Functions

Most existing point cloud upsampling methods have roughly three steps: f...

0 Yun He, et al. ∙

research

∙ 03/26/2023

Semantic Neural Decoding via Cross-Modal Generation

Semantic neural decoding aims to elucidate the cognitive processes of th...

0 Xuelin Qian, et al. ∙

research

∙ 03/26/2023

Learning Versatile 3D Shape Generation with Improved AR Models

Auto-Regressive (AR) models have achieved impressive results in 2D image...

0 Simian Luo, et al. ∙

research

∙ 03/15/2023

Rethinking Optical Flow from Geometric Matching Consistent Perspective

Optical flow estimation is a challenging problem remaining unsolved. Rec...

0 Qiaole Dong, et al. ∙

research

∙ 03/11/2023

Rethinking the Multi-view Stereo from the Perspective of Rendering-based Augmentation

GigaMVS presents several challenges to existing Multi-View Stereo (MVS) ...

0 Chenjie Cao, et al. ∙

research

∙ 03/06/2023

Improving Transformer-based Image Matching by Cascaded Capturing Spatially Informative Keypoints

Learning robust local image feature matching is a fundamental low-level ...

0 Chenjie Cao, et al. ∙

research

∙ 02/22/2023

Entity-Level Text-Guided Image Manipulation

Existing text-guided image manipulation methods aim to modify the appear...

0 Yikai Wang, et al. ∙

research

∙ 02/18/2023

Meta Style Adversarial Training for Cross-Domain Few-Shot Learning

Cross-Domain Few-Shot Learning (CD-FSL) is a recently emerging task that...

0 Yuqian Fu, et al. ∙

research

∙ 01/06/2023

Exploring Efficient Few-shot Adaptation for Vision Transformers

The task of Few-shot Learning (FSL) aims to do the inference on novel ca...

0 Chengming Xu, et al. ∙

research

∙ 01/03/2023

Vocabulary-informed Zero-shot and Open-set Learning

Despite significant progress in object categorization, in recent years, ...

0 Yanwei Fu, et al. ∙

research

∙ 01/02/2023

Knockoffs-SPR: Clean Sample Selection in Learning with Noisy Labels

A noisy training set usually leads to the degradation of the generalizat...

0 Yikai Wang, et al. ∙

research

∙ 11/30/2022

Split-PU: Hardness-aware Training Strategy for Positive-Unlabeled Learning

Positive-Unlabeled (PU) learning aims to learn a model with rare positiv...

0 Chengming Xu, et al. ∙

research

∙ 11/29/2022

PatchMix Augmentation to Identify Causal Features in Few-shot Learning

The task of Few-shot learning (FSL) aims to transfer the knowledge learn...

0 Chengming Xu, et al. ∙

research

∙ 11/28/2022

RankDNN: Learning to Rank for Few-shot Learning

This paper introduces a new few-shot learning pipeline that casts releva...

0 Qianyu Guo, et al. ∙

research

∙ 10/23/2022

Self-supervised Amodal Video Object Segmentation

Amodal perception requires inferring the full shape of an object that is...

0 Jian Yao, et al. ∙

research

∙ 10/12/2022

ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors

The image inpainting task fills missing areas of a corrupted image. Desp...

0 Chenjie Cao, et al. ∙

research

∙ 10/11/2022

ME-D2N: Multi-Expert Domain Decompositional Network for Cross-Domain Few-Shot Learning

Recently, Cross-Domain Few-Shot Learning (CD-FSL) which aims at addressi...

0 Yuqian Fu, et al. ∙

research

∙ 10/07/2022

Specialized Re-Ranking: A Novel Retrieval-Verification Framework for Cloth Changing Person Re-Identification

Cloth changing person re-identification(Re-ID) can work under more compl...

0 Renjie Zhang, et al. ∙

research

∙ 08/18/2022

LoRD: Local 4D Implicit Representation for High-Fidelity Dynamic Human Modeling

Recent progress in 4D implicit representation focuses on globally contro...

5 Boyan Jiang, et al. ∙

research

∙ 08/04/2022

MVSFormer: Multi-View Stereo with Pre-trained Vision Transformers and Temperature-based Depth

Feature representation learning is the key recipe for learning-based Mul...

12 Chenjie Cao, et al. ∙

research

∙ 08/03/2022

Learning Prior Feature and Attention Enhanced Image Inpainting

Many recent inpainting works have achieved impressive results by leverag...

9 Chenjie Cao, et al. ∙

research

∙ 07/19/2022

RCLane: Relay Chain Prediction for Lane Detection

Lane detection is an important component of many real-world autonomous s...

7 Shenghua Xu, et al. ∙

research

∙ 07/19/2022

Visual Representation Learning with Transformer: A Sequence-to-Sequence Perspective

Visual representation learning is the key of solving various vision prob...

10 Li Zhang, et al. ∙

research

∙ 07/17/2022

A Simple Test-Time Method for Out-of-Distribution Detection

Neural networks are known to produce over-confident predictions on input...

0 Ke Fan, et al. ∙

research

∙ 06/17/2022

Local Slot Attention for Vision-and-Language Navigation

Vision-and-language navigation (VLN), a frontier study aiming to pave th...

0 Yifeng Zhuang, et al. ∙

research

∙ 06/07/2022

Wavelet Prior Attention Learning in Axial Inpainting Network

Image inpainting is the task of filling masked or unknown regions of an ...

0 Chenjie Cao, et al. ∙

research

∙ 05/09/2022

Learning 6-DoF Object Poses to Grasp Category-level Objects by Language Instructions

This paper studies the task of any objects grasping from the known categ...

0 Chilam Cheang, et al. ∙

research

∙ 05/09/2022

I Know What You Draw: Learning Grasp Detection Conditioned on a Few Freehand Sketches

In this paper, we are interested in the problem of generating target gra...

0 Haitao Lin, et al. ∙

research

∙ 04/30/2022

ONCE-3DLanes: Building Monocular 3D Lane Detection

We present ONCE-3DLanes, a real-world autonomous driving dataset with la...

10 Fan Yan, et al. ∙

research

∙ 04/27/2022

Density-preserving Deep Point Cloud Compression

Local density of point clouds is crucial for representing local details,...

0 Yun He, et al. ∙

research

∙ 04/22/2022

Reinforcing Generated Images via Meta-learning for One-Shot Fine-Grained Visual Recognition

One-shot fine-grained visual recognition often suffers from the problem ...

0 Satoshi Tsutsui, et al. ∙

research

∙ 04/21/2022

Pixel2Mesh++: 3D Mesh Generation and Refinement from Multi-View Images

We study the problem of shape generation in 3D mesh representation from ...

0 Chao Wen, et al. ∙

research

∙ 04/09/2022

ManiTrans: Entity-Level Text-Guided Image Manipulation via Token-wise Semantic Alignment and Generation

Existing text-guided image manipulation methods aim to modify the appear...

0 Jianan Wang, et al. ∙

research

∙ 04/03/2022

DST: Dynamic Substitute Training for Data-free Black-box Attack

With the wide applications of deep neural network models in various comp...

0 Wenxuan Wang, et al. ∙

research

∙ 03/31/2022

ImpDet: Exploring Implicit Fields for 3D Object Detection

Conventional 3D object detection approaches concentrate on bounding boxe...

12 Xuelin Qian, et al. ∙

research

∙ 03/28/2022

A Framework of Meta Functional Learning for Regularising Knowledge Transfer

Machine learning classifiers' capability is largely dependent on the sca...

7 Pan Li, et al. ∙

research

∙ 03/27/2022

An Empirical Study and Comparison of Recent Few-Shot Object Detection Algorithms

The generic object detection (GOD) task has been successfully tackled by...

0 Tianying Liu, et al. ∙

research

∙ 03/22/2022

QS-Craft: Learning to Quantize, Scrabble and Craft for Conditional Human Motion Animation

This paper studies the task of conditional Human Motion Animation (cHMA)...

0 Yuxin Hong, et al. ∙

research

∙ 03/15/2022

Scalable Penalized Regression for Noise Detection in Learning with Noisy Labels

Noisy training set usually leads to the degradation of generalization an...

0 Yikai Wang, et al. ∙

research

∙ 03/15/2022

Wave-SAN: Wavelet based Style Augmentation Network for Cross-Domain Few-Shot Learning

Previous few-shot learning (FSL) works mostly are limited to natural ima...

0 Yuqian Fu, et al. ∙

Yanwei Fu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro