Vision foundation models such as Contrastive Vision-Language Pre-trainin...
We investigate transductive zero-shot point cloud semantic segmentation ...
Creating 3D shapes from 2D drawings is an important problem with applica...
We propose a method for self-supervised image representation learning un...
Promising performance has been achieved for visual perception on the poi...
Personalized video highlight detection aims to shorten a long video to
i...
We study the problem of weakly supervised grounded image captioning. Tha...
Detecting 3D landmarks on cone-beam computed tomography (CBCT) is crucia...
We introduce CurveFusion, the first approach for high quality scanning o...
Well-annotated medical images are costly and sometimes even impossible t...
We introduce Point2Skeleton, an unsupervised method to learn skeletal
re...
We propose an unsupervised learning framework with the pretext task of
f...
Thin structures, such as wire-frame sculptures, fences, cables, power li...
Thin structures, such as wire-frame sculptures, fences, cables, power li...
Learning structures of 3D shapes is a fundamental problem in the field o...
Marking anatomical landmarks in cephalometric radiography is a critical
...