Information Pursuit: A Bayesian Framework for Sequential Scene Parsing

01/09/2017
by   Ehsan Jahangiri, et al.
0

Despite enormous progress in object detection and classification, the problem of incorporating expected contextual relationships among object instances into modern recognition systems remains a key challenge. In this work we propose Information Pursuit, a Bayesian framework for scene parsing that combines prior models for the geometry of the scene and the spatial arrangement of objects instances with a data model for the output of high-level image classifiers trained to answer specific questions about the scene. In the proposed framework, the scene interpretation is progressively refined as evidence accumulates from the answers to a sequence of questions. At each step, we choose the question to maximize the mutual information between the new answer and the full interpretation given the current evidence obtained from previous inquiries. We also propose a method for learning the parameters of the model from synthesized, annotated scenes obtained by top-down sampling from an easy-to-learn generative scene model. Finally, we introduce a database of annotated indoor scenes of dining room tables, which we use to evaluate the proposed approach.

READ FULL TEXT

page 16

page 18

page 19

page 26

page 27

page 29

page 30

page 31

research
03/17/2022

TO-Scene: A Large-scale Dataset for Understanding 3D Tabletop Scenes

Many basic indoor activities such as eating or writing are always conduc...
research
08/08/2017

FoveaNet: Perspective-aware Urban Scene Parsing

Parsing urban scene images benefits many applications, especially self-d...
research
08/18/2016

Semantic Understanding of Scenes through the ADE20K Dataset

Scene parsing, or recognizing and segmenting objects and stuff in an ima...
research
08/27/2020

Visual Question Answering on Image Sets

We introduce the task of Image-Set Visual Question Answering (ISVQA), wh...
research
06/11/2019

Clouds of Oriented Gradients for 3D Detection of Objects, Surfaces, and Indoor Scene Layouts

We develop new representations and algorithms for three-dimensional (3D)...
research
06/12/2019

A Bayesian Hierarchical Model for Evaluating Forensic Footwear Evidence

When a latent shoeprint is discovered at a crime scene, forensic analyst...

Please sign up or login with your details

Forgot password? Click here to reset