Real-Time Object Detection and Recognition on Low-Compute Humanoid Robots using Deep Learning

01/20/2020
by   Sayantan Chatterjee, et al.
0

We envision that in the near future, humanoid robots would share home space and assist us in our daily and routine activities through object manipulations. One of the fundamental technologies that need to be developed for robots is to enable them to detect objects and recognize them for effective manipulations and take real-time decisions involving those objects. In this paper, we describe a novel architecture that enables multiple low-compute NAO robots to perform real-time detection, recognition and localization of objects in its camera view and take programmable actions based on the detected objects. The proposed algorithm for object detection and localization is an empirical modification of YOLOv3, based on indoor experiments in multiple scenarios, with a smaller weight size and lesser computational requirements. Quantization of the weights and re-adjusting filter sizes and layer arrangements for convolutions improved the inference time for low-resolution images from the robot s camera feed. YOLOv3 was chosen after a comparative study of bounding box algorithms was performed with an objective to choose one that strikes the perfect balance among information retention, low inference time and high accuracy for real-time object detection and localization. The architecture also comprises of an effective end-to-end pipeline to feed the real-time frames from the camera feed to the neural net and use its results for guiding the robot with customizable actions corresponding to the detected class labels.

READ FULL TEXT
research
03/14/2017

Geometry-Based Region Proposals for Real-Time Robot Detection of Tabletop Objects

We present a novel object detection pipeline for localization and recogn...
research
06/08/2015

You Only Look Once: Unified, Real-Time Object Detection

We present YOLO, a new approach to object detection. Prior work on objec...
research
08/21/2020

Line-Circle-Square (LCS): A Multilayered Geometric Filter for Edge-Based Detection

This paper presents a state-of-the-art filter that reduces the complexit...
research
09/07/2022

Real Time Multi-Class Object Detection and Recognition Using Vision Augmentation Algorithm

The aim of this research is to detect small objects with low resolution ...
research
12/23/2019

FisheyeMultiNet: Real-time Multi-task Learning Architecture for Surround-view Automated Parking System

Automated Parking is a low speed manoeuvring scenario which is quite uns...
research
09/01/2021

From Movement Kinematics to Object Properties: Online Recognition of Human Carefulness

When manipulating objects, humans finely adapt their motions to the char...
research
01/18/2014

Modelling Observation Correlations for Active Exploration and Robust Object Detection

Today, mobile robots are expected to carry out increasingly complex task...

Please sign up or login with your details

Forgot password? Click here to reset