Pushing the envelope in deep visual recognition for mobile platforms

10/16/2017
by   Lorenzo Alvino, et al.
0

Image classification is the task of assigning to an input image a label from a fixed set of categories. One of its most important applicative fields is that of robotics, in particular the needing of a robot to be aware of what's around and the consequent exploitation of that information as a benefit for its tasks. In this work we consider the problem of a robot that enters a new environment and wants to understand visual data coming from its camera, so to extract knowledge from them. As main novelty we want to overcome the needing of a physical robot, as it could be expensive and unhandy, so to hopefully enhance, speed up and ease the research in this field. That's why we propose to develop an application for a mobile platform that wraps several deep visual recognition tasks. First we deal with a simple Image classification, testing a model obtained from an AlexNet trained on the ILSVRC 2012 dataset. Several photo settings are considered to better understand which factors affect most the quality of classification. For the same purpose we are interested to integrate the classification task with an extra module dealing with segmentation of the object inside the image. In particular we propose a technique for extracting the object shape and moving out all the background, so to focus the classification only on the region occupied by the object. Another significant task that is included is that of object discovery. Its purpose is to simulate the situation in which the robot needs a certain object to complete one of its activities. It starts searching for what it needs by looking around and trying to understand the location of the object by scanning the surrounding environment. Finally we provide a tool for dealing with the creation of customized task-specific databases, meant to better suit to one's needing in a particular vision task.

READ FULL TEXT

page 7

page 11

page 13

research
03/24/2023

Category Query Learning for Human-Object Interaction Classification

Unlike most previous HOI methods that focus on learning better human-obj...
research
05/19/2021

VSGM – Enhance robot task understanding ability through visual semantic graph

In recent years, developing AI for robotics has raised much attention. T...
research
06/19/2021

Informative Class Activation Maps

We study how to evaluate the quantitative information content of a regio...
research
06/01/2018

Accurate and Efficient Similarity Search for Large Scale Face Recognition

Face verification is a relatively easy task with the help of discriminat...
research
11/06/2018

Hide-and-Seek: A Data Augmentation Technique for Weakly-Supervised Localization and Beyond

We propose 'Hide-and-Seek' a general purpose data augmentation technique...
research
02/28/2017

Learning Deep Visual Object Models From Noisy Web Data: How to Make it Work

Deep networks thrive when trained on large scale data collections. This ...
research
05/25/2018

Cooking State Recognition From Images Using Inception Architecture

A kitchen robot properly needs to understand the cooking environment to ...

Please sign up or login with your details

Forgot password? Click here to reset