In recent years, significant progress has been made in video instance
se...
With the overwhelming trend of mask image modeling led by MAE, generativ...
Diffusion models (DMs) have become the new trend of generative models an...
Diffusion probabilistic models (DPMs) have demonstrated a very promising...
In this paper, we present a new method that reformulates point cloud
com...
With the continuously thriving popularity around the world, fitness acti...
Nowadays, pre-training big models on large-scale datasets has become a
c...
Recent progress in vision Transformers exhibits great success in various...
In this paper, we present a new approach for model acceleration by explo...
Conventional point cloud semantic segmentation methods usually employ an...
Most existing action quality assessment methods rely on the deep feature...
Depth estimation from images serves as the fundamental step of 3D percep...
In this paper, we propose the LiDAR Distillation to bridge the domain ga...
Human behavior has the nature of indeterminacy, which requires the pedes...
As real-scanned point clouds are mostly partial due to occlusions and
vi...
Recent progress has shown that large-scale pre-training using contrastiv...
We present Point-BERT, a new paradigm for learning Transformers to gener...
Structures matter in single image super-resolution (SISR). Benefiting fr...
In this work, we present a new multi-view depth estimation method that
u...
Point clouds captured in real-world applications are often incomplete du...
Attention mechanism has demonstrated great potential in fine-grained vis...
Assessing action quality is challenging due to the subtle differences be...
3D point cloud understanding has made great progress in recent years.
Ho...
How do the neural networks distinguish two images? It is of critical
imp...
Recent advances in self-attention and pure multi-layer perceptrons (MLP)...
Attention is sparse in vision transformers. We observe the final predict...
In this paper, we propose Point-Voxel Recurrent All-Pairs Field Transfor...
Knowledge Distillation (KD) has been one of the most popu-lar methods to...
Structures matter in single image super resolution (SISR). Recent studie...
Recent works based on deep learning and facial priors have succeeded in
...
Local and global patterns of an object are closely related. Although eac...
Humans are able to perform fast and accurate object pose estimation even...
There are substantial instructional videos on the Internet, which enable...