Vision-language pre-training (VLP) relying on large-scale pre-training
d...
Historically lower-level tasks such as automatic speech recognition (ASR...
Panoramic segmentation is a scene where image segmentation tasks is more...
A fine-grained provenance-based access control policy model is proposed ...
Face detection has witnessed significant progress due to the advances of...