This work aims to improve the efficiency of vision transformers (ViT). W...
High-resolution images are widely adopted for high-performance object
de...
In this paper, we propose a conditional early exiting framework for effi...
Time-aware encoding of frame sequences in a video is a fundamental probl...
This paper strives for pixel-level segmentation of actors and their acti...
In this paper, a new method for generating object and action proposals i...
In online action detection, the goal is to detect the start of an action...
We propose a function-based temporal pooling method that captures the la...
In this paper we evaluate the quality of the activation layers of a
conv...