TimeGate: Conditional Gating of Segments in Long-range Activities

04/03/2020
by   Noureldien Hussein, et al.
14

When recognizing a long-range activity, exploring the entire video is exhaustive and computationally expensive, as it can span up to a few minutes. Thus, it is of great importance to sample only the salient parts of the video. We propose TimeGate, along with a novel conditional gating module, for sampling the most representative segments from the long-range activity. TimeGate has two novelties that address the shortcomings of previous sampling methods, as SCSampler. First, it enables a differentiable sampling of segments. Thus, TimeGate can be fitted with modern CNNs and trained end-to-end as a single and unified model.Second, the sampling is conditioned on both the segments and their context. Consequently, TimeGate is better suited for long-range activities, where the importance of a segment heavily depends on the video context.TimeGate reduces the computation of existing CNNs on three benchmarks for long-range activities: Charades, Breakfast and MultiThumos. In particular, TimeGate reduces the computation of I3D by 50 classification accuracy.

READ FULL TEXT

page 1

page 9

research
03/18/2020

PIC: Permutation Invariant Convolution for Recognizing Long-range Activities

Neural operations as convolutions, self-attention, and vector aggregatio...
research
04/06/2022

ECLIPSE: Efficient Long-range Video Retrieval using Sight and Sound

We introduce an audiovisual method for long-range text-to-video retrieva...
research
04/22/2022

ChapterBreak: A Challenge Dataset for Long-Range Language Models

While numerous architectures for long-range language models (LRLMs) have...
research
09/08/2022

Lightweight Long-Range Generative Adversarial Networks

In this paper, we introduce novel lightweight generative adversarial net...
research
12/11/2017

Long-Range Correlation Underlying Childhood Language and Generative Models

Long-range correlation, a property of time series exhibiting long-term m...
research
12/04/2018

Timeception for Complex Action Recognition

This paper focuses on the temporal aspect for recognizing human activiti...
research
08/18/2023

Long-range Multimodal Pretraining for Movie Understanding

Learning computer vision models from (and for) movies has a long-standin...

Please sign up or login with your details

Forgot password? Click here to reset