Transformer-based pretrained language models (PLMs) have achieved great
...
Deep Neural Networks (DNNs) have significantly improved the accuracy of
...
Reconstructing interacting hands from a single RGB image is a very
chall...
Temporal grounding aims to locate a target video moment that semanticall...
Recent researches in artificial intelligence have proposed versatile
con...