Dongfang Liu

research

∙ 07/25/2023

E^2VPT: An Effective and Efficient Approach for Visual Prompt Tuning

As the size of transformer-based models continues to grow, fine-tuning t...

0 Cheng Han, et al. ∙

research

∙ 05/03/2023

CLUSTSEG: Clustering for Universal Segmentation

We present CLUSTSEG, a general, transformer-based framework that tackles...

0 James Liang, et al. ∙

research

∙ 04/28/2023

Fusion is Not Enough: Single-Modal Attacks to Compromise Fusion Models in Autonomous Driving

Multi-sensor fusion (MSF) is widely adopted for perception in autonomous...

3 Zhiyuan Cheng, et al. ∙

research

∙ 04/23/2023

TransFlow: Transformer as Flow Learner

Optical flow is an indispensable building block for various important co...

0 Yawen Lu, et al. ∙

research

∙ 04/12/2023

Exploiting Logic Locking for a Neural Trojan Attack on Machine Learning Accelerators

Logic locking has been proposed to safeguard intellectual property (IP) ...

0 Hongye Xu, et al. ∙

research

∙ 01/31/2023

Adversarial Training of Self-supervised Monocular Depth Estimation against Physical-World Attacks

Monocular Depth Estimation (MDE) is a critical component in applications...

1 Zhiyuan Cheng, et al. ∙

research

∙ 10/03/2022

Learning Equivariant Segmentation with Instance-Unique Querying

Prevalent state-of-the-art instance segmentation methods fall into a que...

0 Wenguan Wang, et al. ∙

research

∙ 09/15/2022

Visual Recognition with Deep Nearest Centroids

We devise deep nearest centroids (DNC), a conceptually elegant yet surpr...

6 Wenguan Wang, et al. ∙

research

∙ 08/19/2022

Towards Unbiased Label Distribution Learning for Facial Pose Estimation Using Anisotropic Spherical Gaussian

Facial pose estimation refers to the task of predicting face orientation...

4 Zhiwen Cao, et al. ∙

research

∙ 07/11/2022

Physical Attack on Monocular Depth Estimation with Optimal Adversarial Patches

Deep learning has substantially boosted the performance of Monocular Dep...

0 Zhiyuan Cheng, et al. ∙

research

∙ 05/22/2022

GL-RG: Global-Local Representation Granularity for Video Captioning

Video captioning is a challenging task as it needs to accurately transfo...

4 Liqi Yan, et al. ∙

research

∙ 03/05/2022

Deep Partial Multiplex Network Embedding

Network embedding is an effective technique to learn the low-dimensional...

5 Qifan Wang, et al. ∙

research

∙ 02/01/2022

WebFormer: The Web-page Transformer for Structure Information Extraction

Structure information extraction refers to the task of extracting struct...

16 Qifan Wang, et al. ∙

research

∙ 10/15/2021

DG-Labeler and DGL-MOTS Dataset: Boost the Autonomous Driving Perception

Multi-object tracking and segmentation (MOTS) is a critical task for aut...

0 Yiming Cui, et al. ∙

research

∙ 08/12/2021

TF-Blender: Temporal Feature Blender for Video Object Detection

Video objection detection is a challenging task because isolated video f...

0 Yiming Cui, et al. ∙

research

∙ 03/18/2021

SG-Net: Spatial Granularity Network for One-Stage Video Instance Segmentation

Video instance segmentation (VIS) is a new and critical task in computer...

4 Dongfang Liu, et al. ∙

research

∙ 02/18/2021

Hierarchical Attention Fusion for Geo-Localization

Geo-localization is a critical task in computer vision. In this work, we...

0 Liqi Yan, et al. ∙

research

∙ 12/04/2020

DenserNet: Weakly Supervised Visual Localization Using Multi-scale Feature Aggregation

In this work, we introduce a Denser Feature Network (DenserNet) for visu...

1 Dongfang Liu, et al. ∙

research

∙ 10/14/2020

A Vector-based Representation to Enhance Head Pose Estimation

This paper proposes to use the three vectors in a rotation matrix as the...

0 Zhiwen Cao, et al. ∙

research

∙ 09/01/2020

Multimodal Aggregation Approach for Memory Vision-Voice Indoor Navigation with Meta-Learning

Vision and voice are two vital keys for agents' interaction and learning...

0 Liqi Yan, et al. ∙

research

∙ 08/13/2020

Visual Localization for Autonomous Driving: Mapping the Accurate Location in the City Maze

Accurate localization is a foundational capacity, required for autonomou...

0 Dongfang Liu, et al. ∙

Dongfang Liu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro