Beyond Fixed Grid: Learning Geometric Image Representation with a Deformable Grid

08/21/2020
by   Jun Gao, et al.
0

In modern computer vision, images are typically represented as a fixed uniform grid with some stride and processed via a deep convolutional neural network. We argue that deforming the grid to better align with the high-frequency image content is a more effective strategy. We introduce Deformable Grid DefGrid, a learnable neural network module that predicts location offsets of vertices of a 2-dimensional triangular grid, such that the edges of the deformed grid align with image boundaries. We showcase our DefGrid in a variety of use cases, i.e., by inserting it as a module at various levels of processing. We utilize DefGrid as an end-to-end learnable geometric downsampling layer that replaces standard pooling methods for reducing feature resolution when feeding images into a deep CNN. We show significantly improved results at the same grid resolution compared to using CNNs on uniform grids for the task of semantic segmentation. We also utilize DefGrid at the output layers for the task of object mask annotation, and show that reasoning about object boundaries on our predicted polygonal grid leads to more accurate results over existing pixel-wise and curve-based approaches. We finally showcase DefGrid as a standalone module for unsupervised image partitioning, showing superior performance over existing approaches. Project website: http://www.cs.toronto.edu/ jungao/def-grid

READ FULL TEXT

page 10

page 13

page 14

research
12/28/2020

Spectral Analysis for Semantic Segmentation with Applications on Feature Truncation and Weak Annotation

The current neural networks for semantic segmentation usually predict th...
research
12/29/2020

SALA: Soft Assignment Local Aggregation for 3D Semantic Segmentation

We introduce the idea of using learnable neighbor-to-grid soft assignmen...
research
03/13/2019

LPM: Learnable Pooling Module for Efficient Full-Face Gaze Estimation

Gaze tracking is an important technology in many domains. Techniques suc...
research
04/28/2023

Differentiable Sensor Layouts for End-to-End Learning of Task-Specific Camera Parameters

The success of deep learning is frequently described as the ability to t...
research
11/29/2019

Deep Object Co-segmentation via Spatial-Semantic Network Modulation

Object co-segmentation is to segment the shared objects in multiple rele...
research
01/26/2021

AINet: Association Implantation for Superpixel Segmentation

Recently, some approaches are proposed to harness deep convolutional net...

Please sign up or login with your details

Forgot password? Click here to reset