Heavy occlusion and dense gathering in crowd scene make pedestrian detec...
Feature distillation is an effective way to improve the performance for ...
The non-local module is designed for capturing long-range spatio-tempora...
For fine-grained categorization tasks, videos could serve as a better so...