DroneAttention: Sparse Weighted Temporal Attention for Drone-Camera Based Activity Recognition

12/07/2022
by   Santosh Kumar Yadav, et al.
0

Human activity recognition (HAR) using drone-mounted cameras has attracted considerable interest from the computer vision research community in recent years. A robust and efficient HAR system has a pivotal role in fields like video surveillance, crowd behavior analysis, sports analysis, and human-computer interaction. What makes it challenging are the complex poses, understanding different viewpoints, and the environmental scenarios where the action is taking place. To address such complexities, in this paper, we propose a novel Sparse Weighted Temporal Attention (SWTA) module to utilize sparsely sampled video frames for obtaining global weighted temporal attention. The proposed SWTA is comprised of two parts. First, temporal segment network that sparsely samples a given set of frames. Second, weighted temporal attention, which incorporates a fusion of attention maps derived from optical flow, with raw RGB images. This is followed by a basenet network, which comprises a convolutional neural network (CNN) module along with fully connected layers that provide us with activity recognition. The SWTA network can be used as a plug-in module to the existing deep CNN architectures, for optimizing them to learn temporal information by eliminating the need for a separate temporal stream. It has been evaluated on three publicly available benchmark datasets, namely Okutama, MOD20, and Drone-Action. The proposed model has received an accuracy of 72.76 surpassing the previous state-of-the-art performances by a margin of 25.26 18.56

READ FULL TEXT

page 3

page 4

page 12

page 23

page 24

research
11/10/2022

SWTF: Sparse Weighted Temporal Fusion for Drone-Based Activity Recognition

Drone-camera based human activity recognition (HAR) has received signifi...
research
08/22/2017

Activity Recognition based on a Magnitude-Orientation Stream Network

The temporal component of videos provides an important clue for activity...
research
03/21/2022

Fourier Disentangled Space-Time Attention for Aerial Video Recognition

We present an algorithm, Fourier Activity Recognition (FAR), for UAV vid...
research
12/16/2018

Towards Robust Human Activity Recognition from RGB Video Stream with Limited Labeled Data

Human activity recognition based on video streams has received numerous ...
research
04/28/2015

Compact CNN for Indexing Egocentric Videos

While egocentric video is becoming increasingly popular, browsing it is ...
research
02/22/2018

Glimpse Clouds: Human Activity Recognition from Unstructured Feature Points

We propose a method for human activity recognition from RGB data which d...

Please sign up or login with your details

Forgot password? Click here to reset