Masked image modeling (MIM) has shown great promise for self-supervised
...
Vision-language pre-training (VLP) relying on large-scale pre-training
d...
Skeleton-based action recognition task is entangled with complex
spatio-...
Gesture recognition is a challenging problem in the field of biometrics....