Semantically Meaningful View Selection

07/26/2018
by   Joris Guérin, et al.
0

An understanding of the nature of objects could help robots to solve both high-level abstract tasks and improve performance at lower-level concrete tasks. Although deep learning has facilitated progress in image understanding, a robot's performance in problems like object recognition often depends on the angle from which the object is observed. Traditionally, robot sorting tasks rely on a fixed top-down view of an object. By changing its viewing angle, a robot can select a more semantically informative view leading to better performance for object recognition. In this paper, we introduce the problem of semantic view selection, which seeks to find good camera poses to gain semantic knowledge about an observed object. We propose a conceptual formulation of the problem, together with a solvable relaxation based on clustering. We then present a new image dataset consisting of around 10k images representing various views of 144 objects under different poses. Finally we use this dataset to propose a first solution to the problem by training a neural network to predict a "semantic score" from a top view image and camera pose. The views predicted to have higher scores are then shown to provide better clustering results than fixed top-down views.

READ FULL TEXT
research
03/07/2021

MetaView: Few-shot Active Object Recognition

In robot sensing scenarios, instead of passively utilizing human capture...
research
12/02/2007

Learning View Generalization Functions

Learning object models from views in 3D visual object recognition is usu...
research
03/17/2021

MORE: Simultaneous Multi-View 3D Object Recognition and Pose Estimation

Simultaneous object recognition and pose estimation are two key function...
research
08/09/2017

Personalized Cinemagraphs using Semantic Understanding and Collaborative Learning

Cinemagraphs are a compelling way to convey dynamic aspects of a scene. ...
research
05/03/2023

Learning-based Relational Object Matching Across Views

Intelligent robots require object-level scene understanding to reason ab...
research
07/26/2019

Multiple Human Association between Top and Horizontal Views by Matching Subjects' Spatial Distributions

Video surveillance can be significantly enhanced by using both top-view ...
research
08/24/2021

OOWL500: Overcoming Dataset Collection Bias in the Wild

The hypothesis that image datasets gathered online "in the wild" can pro...

Please sign up or login with your details

Forgot password? Click here to reset