DEYOv3: DETR with YOLO for Real-time Object Detection

09/21/2023
by   Haodong Ouyang, et al.
0

Recently, end-to-end object detectors have gained significant attention from the research community due to their outstanding performance. However, DETR typically relies on supervised pretraining of the backbone on ImageNet, which limits the practical application of DETR and the design of the backbone, affecting the model's potential generalization ability. In this paper, we propose a new training method called step-by-step training. Specifically, in the first stage, the one-to-many pre-trained YOLO detector is used to initialize the end-to-end detector. In the second stage, the backbone and encoder are consistent with the DETR-like model, but only the detector needs to be trained from scratch. Due to this training method, the object detector does not need the additional dataset (ImageNet) to train the backbone, which makes the design of the backbone more flexible and dramatically reduces the training cost of the detector, which is helpful for the practical application of the object detector. At the same time, compared with the DETR-like model, the step-by-step training method can achieve higher accuracy than the traditional training method of the DETR-like model. With the aid of this novel training method, we propose a brand-new end-to-end real-time object detection model called DEYOv3. DEYOv3-N achieves 41.1 while DEYOv3-L achieves 51.3 training data, DEYOv3 surpasses all existing real-time object detectors in terms of both speed and accuracy. It is worth noting that for models of N, S, and M scales, the training on the COCO dataset can be completed using a single 24GB RTX3090 GPU.

READ FULL TEXT

page 3

page 6

page 7

page 8

research
03/26/2019

DetNAS: Neural Architecture Search on Object Detection

Object detectors are usually equipped with networks designed for image c...
research
09/24/2020

MimicDet: Bridging the Gap Between One-Stage and Two-Stage Object Detection

Modern object detection methods can be divided into one-stage approaches...
research
04/17/2023

DETRs Beat YOLOs on Real-time Object Detection

Recently, end-to-end transformer-based detectors (DETRs) have achieved r...
research
09/24/2019

Fast and Accurate Convolutional Object Detectors for Real-time Embedded Platforms

With the improvements in the object detection networks, several variatio...
research
04/01/2022

Proper Reuse of Image Classification Features Improves Object Detection

A common practice in transfer learning is to initialize the downstream m...
research
09/11/2023

An Effective Two-stage Training Paradigm Detector for Small Dataset

Learning from the limited amount of labeled data to the pre-train model ...
research
07/28/2023

SimDETR: Simplifying self-supervised pretraining for DETR

DETR-based object detectors have achieved remarkable performance but are...

Please sign up or login with your details

Forgot password? Click here to reset