Class prototype construction and matching are core aspects of few-shot a...
Applying large-scale pre-trained visual models like CLIP to few-shot act...
Spatial and temporal modeling is one of the most core aspects of few-sho...
The canonical approach to video action recognition dictates a neural mod...