Point-, voxel-, and range-views are three representative forms of point
...
Recent advancements in vision foundation models (VFMs) have opened up ne...
Vision foundation models such as Contrastive Vision-Language Pre-trainin...
The recent advances in camera-based bird's eye view (BEV) representation...
The robustness of 3D perception systems under natural corruptions from
e...
LiDAR segmentation is crucial for autonomous driving perception. Recent
...
Contrastive language-image pre-training (CLIP) achieves promising result...
Unsupervised video domain adaptation is a practical yet challenging task...
Densely annotating LiDAR point clouds is costly, which restrains the
sca...
Transferring knowledge learned from the labeled source domain to the raw...
We unveil a long-standing problem in the prevailing co-saliency detectio...
Recently, a time-varying quadratic programming (QP) framework that descr...
The gesture-determined-dynamic function (GDDF) offers an effective way t...