Yongming Rao

research

∙ 09/21/2023

TCOVIS: Temporally Consistent Online Video Instance Segmentation

In recent years, significant progress has been made in video instance se...

0 Junlong Li, et al. ∙

research

∙ 07/27/2023

Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models

With the overwhelming trend of mask image modeling led by MAE, generativ...

0 Ziyi Wang, et al. ∙

research

∙ 03/03/2023

Unleashing Text-to-Image Diffusion Models for Visual Perception

Diffusion models (DMs) have become the new trend of generative models an...

0 Wenliang Zhao, et al. ∙

research

∙ 02/09/2023

UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models

Diffusion probabilistic models (DPMs) have demonstrated a very promising...

0 Wenliang Zhao, et al. ∙

research

∙ 01/11/2023

AdaPoinTr: Diverse Point Cloud Completion with Adaptive Geometry-Aware Transformers

In this paper, we present a new method that reformulates point cloud com...

0 Xumin Yu, et al. ∙

research

∙ 12/09/2022

FLAG3D: A 3D Fitness Activity Dataset with Language Instruction

With the continuously thriving popularity around the world, fitness acti...

0 Yansong Tang, et al. ∙

research

∙ 08/04/2022

P2P: Tuning Pre-trained Image Models for Point Cloud Analysis with Point-to-Pixel Prompting

Nowadays, pre-training big models on large-scale datasets has become a c...

0 Ziyi Wang, et al. ∙

research

∙ 07/28/2022

HorNet: Efficient High-Order Spatial Interactions with Recursive Gated Convolutions

Recent progress in vision Transformers exhibits great success in various...

10 Yongming Rao, et al. ∙

research

∙ 07/04/2022

Dynamic Spatial Sparsification for Efficient Vision Transformers and Convolutional Neural Networks

In this paper, we present a new approach for model acceleration by explo...

9 Yongming Rao, et al. ∙

research

∙ 05/26/2022

SemAffiNet: Semantic-Affine Transformation for Point Cloud Segmentation

Conventional point cloud semantic segmentation methods usually employ an...

0 Ziyi Wang, et al. ∙

research

∙ 04/07/2022

FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment

Most existing action quality assessment methods rely on the deep feature...

2 Jinglin Xu, et al. ∙

research

∙ 04/07/2022

SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation

Depth estimation from images serves as the fundamental step of 3D percep...

1 Yi Wei, et al. ∙

research

∙ 03/28/2022

LiDAR Distillation: Bridging the Beam-Induced Domain Gap for 3D Object Detection

In this paper, we propose the LiDAR Distillation to bridge the domain ga...

23 Yi Wei, et al. ∙

research

∙ 03/25/2022

Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion

Human behavior has the nature of indeterminacy, which requires the pedes...

24 Tianpei Gu, et al. ∙

research

∙ 12/22/2021

Multi-View Partial (MVP) Point Cloud Challenge 2021 on Completion and Registration: Methods and Results

As real-scanned point clouds are mostly partial due to occlusions and vi...

18 Liang Pan, et al. ∙

research

∙ 12/02/2021

DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

Recent progress has shown that large-scale pre-training using contrastiv...

0 Yongming Rao, et al. ∙

research

∙ 11/29/2021

Point-BERT: Pre-training 3D Point Cloud Transformers with Masked Point Modeling

We present Point-BERT, a new paradigm for learning Transformers to gener...

0 Xumin Yu, et al. ∙

research

∙ 09/26/2021

Structure-Preserving Image Super-Resolution

Structures matter in single image super-resolution (SISR). Benefiting fr...

0 Cheng Ma, et al. ∙

research

∙ 09/02/2021

NerfingMVS: Guided Optimization of Neural Radiance Fields for Indoor Multi-view Stereo

In this work, we present a new multi-view depth estimation method that u...

1 Yi Wei, et al. ∙

research

∙ 08/19/2021

PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers

Point clouds captured in real-world applications are often incomplete du...

0 Xumin Yu, et al. ∙

research

∙ 08/19/2021

Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification

Attention mechanism has demonstrated great potential in fine-grained vis...

0 Yongming Rao, et al. ∙

research

∙ 08/17/2021

Group-aware Contrastive Regression for Action Quality Assessment

Assessing action quality is challenging due to the subtle differences be...

0 Xumin Yu, et al. ∙

research

∙ 08/17/2021

RandomRooms: Unsupervised Pre-training from Synthetic Shapes and Randomized Layouts for 3D Object Detection

3D point cloud understanding has made great progress in recent years. Ho...

12 Yongming Rao, et al. ∙

research

∙ 08/12/2021

Towards Interpretable Deep Metric Learning with Structural Matching

How do the neural networks distinguish two images? It is of critical imp...

52 Wenliang Zhao, et al. ∙

research

∙ 07/01/2021

Global Filter Networks for Image Classification

Recent advances in self-attention and pure multi-layer perceptrons (MLP)...

0 Yongming Rao, et al. ∙

research

∙ 06/03/2021

DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification

Attention is sparse in vision transformers. We observe the final predict...

12 Yongming Rao, et al. ∙

research

∙ 12/02/2020

PV-RAFT: Point-Voxel Correlation Fields for Scene Flow Estimation of Point Clouds

In this paper, we propose Point-Voxel Recurrent All-Pairs Field Transfor...

6 Yi Wei, et al. ∙

research

∙ 08/27/2020

MetaDistiller: Network Self-Boosting via Meta-Learned Top-Down Distillation

Knowledge Distillation (KD) has been one of the most popu-lar methods to...

0 Benlin Liu, et al. ∙

research

∙ 03/29/2020

Structure-Preserving Super Resolution with Gradient Guidance

Structures matter in single image super resolution (SISR). Recent studie...

0 Cheng Ma, et al. ∙

research

∙ 03/29/2020

Deep Face Super-Resolution with Iterative Collaboration between Attentive Recovery and Landmark Estimation

Recent works based on deep learning and facial priors have succeeded in ...

21 Cheng Ma, et al. ∙

research

∙ 03/29/2020

Global-Local Bidirectional Reasoning for Unsupervised Representation Learning of 3D Point Clouds

Local and global patterns of an object are closely related. Although eac...

0 Yongming Rao, et al. ∙

research

∙ 12/19/2019

P^2GNet: Pose-Guided Point Cloud Generating Networks for 6-DoF Object Pose Estimation

Humans are able to perform fast and accurate object pose estimation even...

0 Peiyu Yu, et al. ∙

research

∙ 03/07/2019

COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis

There are substantial instructional videos on the Internet, which enable...

0 Yansong Tang, et al. ∙

Yongming Rao

Featured Co-authors

Sign in with Google

Consider DeepAI Pro