Lifting GIS Maps into Strong Geometric Context for Scene Understanding

07/14/2015
by   Raúl Díaz, et al.
0

Contextual information can have a substantial impact on the performance of visual tasks such as semantic segmentation, object detection, and geometric estimation. Data stored in Geographic Information Systems (GIS) offers a rich source of contextual information that has been largely untapped by computer vision. We propose to leverage such information for scene understanding by combining GIS resources with large sets of unorganized photographs using Structure from Motion (SfM) techniques. We present a pipeline to quickly generate strong 3D geometric priors from 2D GIS data using SfM models aligned with minimal user input. Given an image resectioned against this model, we generate robust predictions of depth, surface normals, and semantic labels. We show that the precision of the predicted geometry is substantially more accurate other single-image depth estimation methods. We then demonstrate the utility of these contextual constraints for re-scoring pedestrian detections, and use these GIS contextual features alongside object detection score maps to improve a CRF-based semantic segmentation framework, boosting accuracy over baseline models.

READ FULL TEXT

page 2

page 3

page 5

page 7

page 8

research
07/29/2021

CI-Net: Contextual Information for Joint Semantic Segmentation and Depth Estimation

Monocular depth estimation and semantic segmentation are two fundamental...
research
04/05/2023

Semantic Validation in Structure from Motion

The Structure from Motion (SfM) challenge in computer vision is the proc...
research
06/02/2023

Towards In-context Scene Understanding

In-context learningx2013the ability to configure a model's behavior with...
research
10/04/2022

FreDSNet: Joint Monocular Depth and Semantic Segmentation with Fast Fourier Convolutions

In this work we present FreDSNet, a deep learning solution which obtains...
research
02/10/2023

Context Understanding in Computer Vision: A Survey

Contextual information plays an important role in many computer vision t...
research
09/13/2022

CMR3D: Contextualized Multi-Stage Refinement for 3D Object Detection

Existing deep learning-based 3D object detectors typically rely on the a...
research
01/06/2021

RethNet: Object-by-Object Learning for Detecting Facial Skin Problems

Semantic segmentation is a hot topic in computer vision where the most c...

Please sign up or login with your details

Forgot password? Click here to reset