Real-world Object Recognition with Off-the-shelf Deep Conv Nets: How Many Objects can iCub Learn?

by   Giulia Pasquale, et al.

The ability to visually recognize objects is a fundamental skill for robotics systems. Indeed, a large variety of tasks involving manipulation, navigation or interaction with other agents, deeply depends on the accurate understanding of the visual scene. Yet, at the time being, robots are lacking good visual perceptual systems, which often become the main bottleneck preventing the use of autonomous agents for real-world applications. Lately in computer vision, systems that learn suitable visual representations and based on multi-layer deep convolutional networks are showing remarkable performance in tasks such as large-scale visual recognition and image retrieval. To this regard, it is natural to ask whether such remarkable performance would generalize also to the robotic setting. In this paper we investigate such possibility, while taking further steps in developing a computational vision system to be embedded on a robotic platform, the iCub humanoid robot. In particular, we release a new dataset ( iCubWorld28) that we use as a benchmark to address the question: how many objects can iCub recognize? Our study is developed in a learning framework which reflects the typical visual experience of a humanoid robot like the iCub. Experiments shed interesting insights on the strength and weaknesses of current computer vision approaches applied in real robotic settings.


page 2

page 6

page 14


Recognizing Objects In-the-wild: Where Do We Stand?

The ability to recognize objects is an essential skill for a robotic sys...

Are we Done with Object Recognition? The iCub robot's Perspective

We report on an extensive study of the current benefits and limitations ...

The Freiburg Groceries Dataset

With the increasing performance of machine learning techniques in the la...

The RGB-D Triathlon: Towards Agile Visual Toolboxes for Robots

Deep networks have brought significant advances in robot perception, ena...

Trixi the Librarian

In this work, we present a three-part system that automatically sorts bo...

Towards a Framework for Visual Intelligence in Service Robotics: Epistemic Requirements and Gap Analysis

A key capability required by service robots operating in real-world, dyn...

The utilization of spherical camera in simulation for service robotics

Safety is one of the most critical factors in robotics, especially when ...

Please sign up or login with your details

Forgot password? Click here to reset