Learning the semantic structure of objects from Web supervision

07/05/2016
by   David Novotny, et al.
0

While recent research in image understanding has often focused on recognizing more types of objects, understanding more about the objects is just as important. Recognizing object parts and attributes has been extensively studied before, yet learning large space of such concepts remains elusive due to the high cost of providing detailed object annotations for supervision. The key contribution of this paper is an algorithm to learn the nameable parts of objects automatically, from images obtained by querying Web search engines. The key challenge is the high level of noise in the annotations; to address it, we propose a new unified embedding space where the appearance and geometry of objects and their semantic parts are represented uniformly. Geometric relationships are induced in a soft manner by a rich set of nonsemantic mid-level anchors, bridging the gap between semantic and non-semantic parts. We also show that the resulting embedding provides a visually-intuitive mechanism to navigate the learned concepts and their corresponding images.

READ FULL TEXT

page 2

page 3

page 14

research
12/15/2016

Beyond Holistic Object Recognition: Enriching Image Understanding with Part States

Important high-level vision tasks such as human-object interaction, imag...
research
08/18/2016

Semantic Understanding of Scenes through the ADE20K Dataset

Scene parsing, or recognizing and segmenting objects and stuff in an ima...
research
09/11/2016

Learning Semantic Part-Based Models from Google Images

We propose a technique to train semantic part-based models of object cla...
research
08/18/2018

Concept Mask: Large-Scale Segmentation from Semantic Concepts

Existing works on semantic segmentation typically consider a small numbe...
research
09/14/2015

Expanded Parts Model for Semantic Description of Humans in Still Images

We introduce an Expanded Parts Model (EPM) for recognizing human attribu...
research
12/09/2015

ShapeNet: An Information-Rich 3D Model Repository

We present ShapeNet: a richly-annotated, large-scale repository of shape...
research
05/11/2022

Identifying concept libraries from language about object structure

Our understanding of the visual world goes beyond naming objects, encomp...

Please sign up or login with your details

Forgot password? Click here to reset