Fourier Disentangled Space-Time Attention for Aerial Video Recognition

03/21/2022
by   Divya Kothandaraman, et al.
2

We present an algorithm, Fourier Activity Recognition (FAR), for UAV video activity recognition. Our formulation uses a novel Fourier object disentanglement method to innately separate out the human agent (which is typically small) from the background. Our disentanglement technique operates in the frequency domain to characterize the extent of temporal change of spatial pixels, and exploits convolution-multiplication properties of Fourier transform to map this representation to the corresponding object-background entangled features obtained from the network. To encapsulate contextual information and long-range space-time dependencies, we present a novel Fourier Attention algorithm, which emulates the benefits of self-attention by modeling the weighted outer product in the frequency domain. Our Fourier attention formulation uses much fewer computations than self-attention. We have evaluated our approach on multiple UAV datasets including UAV Human RGB, UAV Human Night, Drone Action, and NEC Drone. We demonstrate a relative improvement of 8.02 38.69

READ FULL TEXT

page 11

page 18

page 19

research
09/15/2022

Differentiable Frequency-based Disentanglement for Aerial Video Action Recognition

We present a learning algorithm for human activity recognition in videos...
research
12/07/2022

DroneAttention: Sparse Weighted Temporal Attention for Drone-Camera Based Activity Recognition

Human activity recognition (HAR) using drone-mounted cameras has attract...
research
05/27/2021

SSAN: Separable Self-Attention Network for Video Representation Learning

Self-attention has been successfully applied to video representation lea...
research
11/10/2022

SWTF: Sparse Weighted Temporal Fusion for Drone-Based Activity Recognition

Drone-camera based human activity recognition (HAR) has received signifi...
research
03/05/2023

MITFAS: Mutual Information based Temporal Feature Alignment and Sampling for Aerial Video Action Recognition

We present a novel approach for action recognition in UAV videos. Our fo...
research
12/06/2018

Tri-axial Self-Attention for Concurrent Activity Recognition

We present a system for concurrent activity recognition. To extract feat...
research
01/17/2021

Coarse Temporal Attention Network (CTA-Net) for Driver's Activity Recognition

There is significant progress in recognizing traditional human activitie...

Please sign up or login with your details

Forgot password? Click here to reset