Despite promising advances in deep learning-based MRI reconstruction met...
To bring digital avatars into people's lives, it is highly demanded to
e...
We present a simple yet effective end-to-end Video-language Pre-training...
Music is essential when editing videos, but selecting music manually is
...
Providing quality-constant streams can simultaneously guarantee user
exp...
Given data on choices made by consumers for different assortments, a key...
In this report, we present the ReLER@ZJU-Alibaba submission to the Ego4D...
Skeleton extraction is a task focused on providing a simple representati...
In 3D face reconstruction, orthogonal projection has been widely employe...
The task of Human-Object Interaction (HOI) detection could be divided in...
The development of online economics arouses the demand of generating ima...
Two-stage methods have dominated Human-Object Interaction (HOI) detectio...
Video affective understanding, which aims to predict the evoked expressi...
This paper introduces a dual-critic reinforcement learning (RL) framewor...
This paper introduces the real image Super-Resolution (SR) challenge tha...
Few-shot Learning (FSL) which aims to learn from few labeled training da...
Visual secrete sharing (VSS) is an encryption technique that utilizes hu...