Efficient Joint Detection and Multiple Object Tracking with Spatially Aware Transformer

11/09/2022
by   Siddharth Sagar Nijhawan, et al.
0

We propose a light-weight and highly efficient Joint Detection and Tracking pipeline for the task of Multi-Object Tracking using a fully-transformer architecture. It is a modified version of TransTrack, which overcomes the computational bottleneck associated with its design, and at the same time, achieves state-of-the-art MOTA score of 73.20 transformer based backbone instead of CNN, which is highly scalable with the input resolution. We also propose a drop-in replacement for Feed Forward Network of transformer encoder layer, by using Butterfly Transform Operation to perform channel fusion and depth-wise convolution to learn spatial context within the feature maps, otherwise missing within the attention maps of the transformer. As a result of our modifications, we reduce the overall model size of TransTrack by 58.73 design to provide novel perspectives for architecture optimization in future research related to multi-object tracking.

READ FULL TEXT

page 3

page 4

research
12/31/2020

TransTrack: Multiple-Object Tracking with Transformer

Multiple-object tracking(MOT) is mostly dominated by complex and multi-s...
research
04/01/2021

TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking

Tracking multiple objects in videos relies on modeling the spatial-tempo...
research
03/06/2023

Referring Multi-Object Tracking

Existing referring understanding tasks tend to involve the detection of ...
research
10/08/2022

Towards Light Weight Object Detection System

Transformers are a popular choice for classification tasks and as backbo...
research
08/10/2022

Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking with Transformer

With the prevalence of LiDAR sensors in autonomous driving, 3D object tr...
research
11/26/2019

Multi-Object Portion Tracking in 4D Fluorescence Microscopy Imagery with Deep Feature Maps

3D fluorescence microscopy of living organisms has increasingly become a...
research
03/10/2022

Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking

Exploiting a general-purpose neural architecture to replace hand-wired d...

Please sign up or login with your details

Forgot password? Click here to reset