Deep Learning Based Automatic Video Annotation Tool for Self-Driving Car

04/19/2019
by   N. S. Manikandan, et al.
0

In a self-driving car, objection detection, object classification, lane detection and object tracking are considered to be the crucial modules. In recent times, using the real time video one wants to narrate the scene captured by the camera fitted in our vehicle. To effectively implement this task, deep learning techniques and automatic video annotation tools are widely used. In the present paper, we compare the various techniques that are available for each module and choose the best algorithm among them by using appropriate metrics. For object detection, YOLO and Retinanet-50 are considered and the best one is chosen based on mean Average Precision (mAP). For object classification, we consider VGG-19 and Resnet-50 and select the best algorithm based on low error rate and good accuracy. For lane detection, Udacity's 'Finding Lane Line' and deep learning based LaneNet algorithms are compared and the best one that can accurately identify the given lane is chosen for implementation. As far as object tracking is concerned, we compare Udacity's 'Object Detection and Tracking' algorithm and deep learning based Deep Sort algorithm. Based on the accuracy of tracking the same object in many frames and predicting the movement of objects, the best algorithm is chosen. Our automatic video annotation tool is found to be 83 annotator. We considered a video with 530 frames each of resolution 1035 x 1800 pixels. At an average each frame had about 15 objects. Our annotation tool consumed 43 minutes in a CPU based system and 2.58 minutes in a mid-level GPU based system to process all four modules. But the same video took nearly 3060 minutes for one human annotator to narrate the scene in the given video. Thus we claim that our proposed automatic video annotation tool is reasonably fast (about 1200 times in a GPU system) and accurate.

READ FULL TEXT
research
07/25/2022

Video object tracking based on YOLOv7 and DeepSORT

Multiple object tracking (MOT) is an important technology in the field o...
research
08/24/2022

Comparison of Object Detection Algorithms for Street-level Objects

Object detection for street-level objects can be applied to various use ...
research
05/31/2019

Driver Behavior Analysis Using Lane Departure Detection Under Challenging Conditions

In this paper, we present a novel model to detect lane regions and extra...
research
10/28/2018

Deep Affinity Network for Multiple Object Tracking

Multiple Object Tracking (MOT) plays an important role in solving many f...
research
02/22/2021

Phase Space Reconstruction Network for Lane Intrusion Action Recognition

In a complex road traffic scene, illegal lane intrusion of pedestrians o...
research
11/09/2019

A Proposed Artificial intelligence Model for Real-Time Human Action Localization and Tracking

In recent years, artificial intelligence (AI) based on deep learning (DL...
research
01/18/2021

Semi-Automatic Video Annotation For Object Detection

In this study, a semi-automatic video annotation method is proposed whic...

Please sign up or login with your details

Forgot password? Click here to reset