Fine-grained Video Categorization with Redundancy Reduction Attention

10/26/2018
by   Chen Zhu, et al.
8

For fine-grained categorization tasks, videos could serve as a better source than static images as videos have a higher chance of containing discriminative patterns. Nevertheless, a video sequence could also contain a lot of redundant and irrelevant frames. How to locate critical information of interest is a challenging task. In this paper, we propose a new network structure, known as Redundancy Reduction Attention (RRA), which learns to focus on multiple discriminative patterns by sup- pressing redundant feature channels. Specifically, it firstly summarizes the video by weight-summing all feature vectors in the feature maps of selected frames with a spatio-temporal soft attention, and then predicts which channels to suppress or to enhance according to this summary with a learned non-linear transform. Suppression is achieved by modulating the feature maps and threshing out weak activations. The updated feature maps are then used in the next iteration. Finally, the video is classified based on multiple summaries. The proposed method achieves out- standing performances in multiple video classification datasets. Further- more, we have collected two large-scale video datasets, YouTube-Birds and YouTube-Cars, for future researches on fine-grained video categorization. The datasets are available at http://www.cs.umd.edu/ chenzhu/fgvc.

READ FULL TEXT

page 7

page 14

research
10/17/2022

Cross-layer Attention Network for Fine-grained Visual Categorization

Learning discriminative representations for subtle localized details pla...
research
02/15/2021

VA-RED^2: Video Adaptive Redundancy Reduction

Performing inference on deep learning models for videos remains a challe...
research
04/11/2021

Fine-Grained Attention for Weakly Supervised Object Localization

Although recent advances in deep learning accelerated an improvement in ...
research
05/06/2019

Fine-grained Attention-based Video Face Recognition

This paper aims to learn a compact representation of a video for video f...
research
11/27/2018

Generating Attention from Classifier Activations for Fine-grained Recognition

Recent advances in fine-grained recognition utilize attention maps to lo...
research
04/21/2022

R2-Trans:Fine-Grained Visual Categorization with Redundancy Reduction

Fine-grained visual categorization (FGVC) aims to discriminate similar s...
research
01/31/2023

Sport Task: Fine Grained Action Detection and Classification of Table Tennis Strokes from Videos for MediaEval 2022

Sports video analysis is a widespread research topic. Its applications a...

Please sign up or login with your details

Forgot password? Click here to reset