TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-captured Scenarios

08/26/2021
by   Xingkui Zhu, et al.
12

Object detection on drone-captured scenarios is a recent popular task. As drones always navigate in different altitudes, the object scale varies violently, which burdens the optimization of networks. Moreover, high-speed and low-altitude flight bring in the motion blur on the densely packed objects, which leads to great challenge of object distinction. To solve the two issues mentioned above, we propose TPH-YOLOv5. Based on YOLOv5, we add one more prediction head to detect different-scale objects. Then we replace the original prediction heads with Transformer Prediction Heads (TPH) to explore the prediction potential with self-attention mechanism. We also integrate convolutional block attention model (CBAM) to find attention region on scenarios with dense objects. To achieve more improvement of our proposed TPH-YOLOv5, we provide bags of useful strategies such as data augmentation, multiscale testing, multi-model integration and utilizing extra classifier. Extensive experiments on dataset VisDrone2021 show that TPH-YOLOv5 have good performance with impressive interpretability on drone-captured scenarios. On DET-test-challenge dataset, the AP result of TPH-YOLOv5 are 39.18 better than previous SOTA method (DPNetV3) by 1.81 2021, TPHYOLOv5 wins 5th place and achieves well-matched results with 1st place model (AP 39.43 about 7

READ FULL TEXT

page 1

page 4

page 6

page 8

research
07/03/2022

Dynamic boxes fusion strategy in object detection

Object detection on microscopic scenarios is a popular task. As microsco...
research
11/16/2020

Drone LAMS: A Drone-based Face Detection Dataset with Large Angles and Many Scenarios

This work presented a new drone-based face detection dataset Drone LAMS ...
research
11/21/2020

Rethinking Transformer-based Set Prediction for Object Detection

DETR is a recently proposed Transformer-based method which views object ...
research
07/17/2020

2nd Place Solution to ECCV 2020 VIPriors Object Detection Challenge

In this report, we descibe our approach to the ECCV 2020 VIPriors Object...
research
06/15/2021

Dynamic Head: Unifying Object Detection Heads with Attentions

The complex nature of combining localization and classification in objec...
research
12/19/2020

Dense Multiscale Feature Fusion Pyramid Networks for Object Detection in UAV-Captured Images

Although much significant progress has been made in the research field o...
research
10/21/2022

Automatic Cattle Identification using YOLOv5 and Mosaic Augmentation: A Comparative Analysis

You Only Look Once (YOLO) is a single-stage object detection model popul...

Please sign up or login with your details

Forgot password? Click here to reset