Towards Light Weight Object Detection System

10/08/2022
by   Dharma KC, et al.
0

Transformers are a popular choice for classification tasks and as backbones for object detection tasks. However, their high latency brings challenges in their adaptation to lightweight object detection systems. We present an approximation of the self-attention layers used in the transformer architecture. This approximation reduces the latency of the classification system while incurring minimal loss in accuracy. We also present a method that uses a transformer encoder layer for multi-resolution feature fusion. This feature fusion improves the accuracy of the state-of-the-art lightweight object detection system without significantly increasing the number of parameters. Finally, we provide an abstraction for the transformer architecture called Generalized Transformer (gFormer) that can guide the design of novel transformer-like architectures.

READ FULL TEXT

page 2

page 3

research
12/26/2021

Miti-DETR: Object Detection based on Transformers with Mitigatory Self-Attention Convergence

Object Detection with Transformers (DETR) and related works reach or eve...
research
03/29/2021

Monitoring Object Detection Abnormalities via Data-Label and Post-Algorithm Abstractions

While object detection modules are essential functionalities for any aut...
research
12/13/2022

CNN-transformer mixed model for object detection

Object detection, one of the three main tasks of computer vision, has be...
research
11/18/2020

End-to-End Object Detection with Adaptive Clustering Transformer

End-to-end Object Detection with Transformer (DETR)proposes to perform o...
research
06/14/2023

Unraveling the ARC Puzzle: Mimicking Human Solutions with Object-Centric Decision Transformer

In the pursuit of artificial general intelligence (AGI), we tackle Abstr...
research
11/09/2022

Efficient Joint Detection and Multiple Object Tracking with Spatially Aware Transformer

We propose a light-weight and highly efficient Joint Detection and Track...
research
11/21/2020

Rethinking Transformer-based Set Prediction for Object Detection

DETR is a recently proposed Transformer-based method which views object ...

Please sign up or login with your details

Forgot password? Click here to reset