Movie highlights stand out of the screenplay for efficient browsing and ...
Text-based person retrieval aims to find the query person based on a tex...
Deep convolutional neural network (CNN) based models are vulnerable to t...
A long-term video, such as a movie or TV show, is composed of various sc...
Generative Adversarial Networks (GANs) have been widely adopted in vario...
The vanilla convolutional neural network (CNN) is vulnerable to images w...