LSTA: Long Short-Term Attention for Egocentric Action Recognition

11/26/2018
by   Swathikiran Sudhakaran, et al.
0

Egocentric activity recognition is one of the most challenging tasks in video analysis. It requires a fine-grained discrimination of small objects and their manipulation. While some methods base on strong supervision and attention mechanisms, they are either annotation consuming or do not take spatio-temporal patterns into account. In this paper we propose LSTA as a mechanism to focus on features from spatial relevant parts while attention is being tracked smoothly across the video sequence. We demonstrate the effectiveness of LSTA on egocentric activity recognition with an end-to-end trainable two-stream architecture, achieving state of the art performance on four standard benchmarks.

READ FULL TEXT

page 8

page 14

page 18

page 19

page 20

page 21

page 22

page 23

research
07/31/2018

Attention is All We Need: Nailing Down Object-centric Attention for Egocentric Activity Recognition

In this paper we propose an end-to-end trainable deep neural network mod...
research
09/16/2020

Multi-Label Activity Recognition using Activity-specific Features

We introduce an approach to multi-label activity recognition by extracti...
research
08/13/2019

Three Branches: Detecting Actions With Richer Features

We present our three branch solutions for International Challenge on Act...
research
08/19/2013

Seeing What You're Told: Sentence-Guided Activity Recognition In Video

We present a system that demonstrates how the compositional structure of...
research
05/11/2019

Follow the Attention: Combining Partial Pose and Object Motion for Fine-Grained Action Detection

Activity recognition in shopping environments is an important and challe...
research
12/01/2020

A compact sequence encoding scheme for online human activity recognition in HRI applications

Human activity recognition and analysis has always been one of the most ...
research
05/14/2023

Is end-to-end learning enough for fitness activity recognition?

End-to-end learning has taken hold of many computer vision tasks, in par...

Please sign up or login with your details

Forgot password? Click here to reset