DeepAI AI Chat
Log In Sign Up

Large-Scale 3D Scene Classification With Multi-View Volumetric CNN

by   Dror Aiger, et al.

We introduce a method to classify imagery using a convo- lutional neural network (CNN) on multi-view image pro- jections. The power of our method comes from using pro- jections of multiple images at multiple depth planes near the reconstructed surface. This enables classification of categories whose salient aspect is appearance change un- der different viewpoints, such as water, trees, and other materials with complex reflection/light response proper- ties. Our method does not require boundary labelling in images and works on pixel-level classification with a small (few pixels) context, which simplifies the cre- ation of a training set. We demonstrate this application on large-scale aerial imagery collections, and extend the per-pixel classification to robustly create a consistent 2D classification which can be used to fill the gaps in non- reconstructible water regions. We also apply our method to classify tree regions. In both cases, the training data can quickly be generated using a small number of manually- created polygons on a map. We show that even with a very simple and standard network our CNN outperforms the state-of-the-art image classification, the Inception-V3 model retrained from a large collection of aerial images.


page 2

page 5

page 6

page 9

page 10


AiRound and CV-BrCT: Novel Multi-View Datasets for Scene Classification

It is undeniable that aerial/satellite images can provide useful informa...

Learning CNN filters from user-drawn image markers for coconut-tree image classification

Identifying species of trees in aerial images is essential for land-use ...

Mapping industrial poultry operations at scale with deep learning and aerial imagery

Concentrated Animal Feeding Operations (CAFOs) pose serious risks to air...

Facing the Void: Overcoming Missing Data in Multi-View Imagery

In some scenarios, a single input image may not be enough to allow the o...

A Novel Recurrent Encoder-Decoder Structure for Large-Scale Multi-view Stereo Reconstruction from An Open Aerial Dataset

A great deal of research has demonstrated recently that multi-view stere...

Constrained Mutual Convex Cone Method for Image Set Based Recognition

In this paper, we propose a method for image-set classification based on...

Disaster Feature Classification on Aerial Photography to Explain Typhoon Damaged Region using Grad-CAM

Recent years, typhoon damages has become social problem owing to climate...