ST-DETR: Spatio-Temporal Object Traces Attention Detection Transformer

07/13/2021
by   Eslam Mohamed, et al.
0

We propose ST-DETR, a Spatio-Temporal Transformer-based architecture for object detection from a sequence of temporal frames. We treat the temporal frames as sequences in both space and time and employ the full attention mechanisms to take advantage of the features correlations over both dimensions. This treatment enables us to deal with frames sequence as temporal object features traces over every location in the space. We explore two possible approaches; the early spatial features aggregation over the temporal dimension, and the late temporal aggregation of object query spatial features. Moreover, we propose a novel Temporal Positional Embedding technique to encode the time sequence information. To evaluate our approach, we choose the Moving Object Detection (MOD)task, since it is a perfect candidate to showcase the importance of the temporal dimension. Results show a significant 5 KITTI MOD dataset over the 1-step spatial baseline.

READ FULL TEXT

page 1

page 3

research
06/21/2021

Spatio-Temporal Multi-Task Learning Transformer for Joint Moving Object Detection and Segmentation

Moving objects have special importance for Autonomous Driving tasks. Det...
research
03/30/2022

TubeDETR: Spatio-Temporal Video Grounding with Transformers

We consider the problem of localizing a spatio-temporal tube in a video ...
research
06/23/2020

An Efficient Index for Contact Tracing Query in a Large Spatio-Temporal Database

In this paper, we study a novel contact tracing query (CTQ) that finds u...
research
10/02/2019

Object Parsing in Sequences Using CoordConv Gated Recurrent Networks

We present a monocular object parsing framework for consistent keypoint ...
research
06/16/2021

Grounding Spatio-Temporal Language with Transformers

Language is an interface to the outside world. In order for embodied age...
research
01/03/2023

Semi-Structured Object Sequence Encoders

In this paper we explore the task of modeling (semi) structured object s...
research
09/04/2022

Hierarchical Transformer with Spatio-Temporal Context Aggregation for Next Point-of-Interest Recommendation

Next point-of-interest (POI) recommendation is a critical task in locati...

Please sign up or login with your details

Forgot password? Click here to reset