You Only Look Once: Unified, Real-Time Object Detection

06/08/2015
by   Joseph Redmon, et al.
0

We present YOLO, a new approach to object detection. Prior work on object detection repurposes classifiers to perform detection. Instead, we frame object detection as a regression problem to spatially separated bounding boxes and associated class probabilities. A single neural network predicts bounding boxes and class probabilities directly from full images in one evaluation. Since the whole detection pipeline is a single network, it can be optimized end-to-end directly on detection performance. Our unified architecture is extremely fast. Our base YOLO model processes images in real-time at 45 frames per second. A smaller version of the network, Fast YOLO, processes an astounding 155 frames per second while still achieving double the mAP of other real-time detectors. Compared to state-of-the-art detection systems, YOLO makes more localization errors but is far less likely to predict false detections where nothing exists. Finally, YOLO learns very general representations of objects. It outperforms all other detection methods, including DPM and R-CNN, by a wide margin when generalizing from natural images to artwork on both the Picasso Dataset and the People-Art Dataset.

READ FULL TEXT

page 1

page 2

page 6

page 8

research
11/15/2019

You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization

Spatiotemporal action localization requires incorporation of two sources...
research
09/16/2015

DenseBox: Unifying Landmark Localization with End to End Object Detection

How can a single fully convolutional neural network (FCN) perform on obj...
research
07/31/2017

Towards the Success Rate of One: Real-time Unconstrained Salient Object Detection

In this work, we propose an efficient and effective approach for unconst...
research
11/23/2016

Straight to Shapes: Real-time Detection of Encoded Shapes

Current object detection approaches predict bounding boxes, but these pr...
research
12/26/2018

Region Proposal Networks with Contextual Selective Attention for Real-Time Organ Detection

State-of-the-art methods for object detection use region proposal networ...
research
01/20/2020

Real-Time Object Detection and Recognition on Low-Compute Humanoid Robots using Deep Learning

We envision that in the near future, humanoid robots would share home sp...
research
07/14/2021

Diff-Net: Image Feature Difference based High-Definition Map Change Detection

Up-to-date High-Definition (HD) maps are essential for self-driving cars...

Please sign up or login with your details

Forgot password? Click here to reset