Pyramid Point: A Multi-Level Focusing Network for Revisiting Feature Layers

11/17/2020
by   Nina Varney, et al.
0

We present a method to learn a diverse group of object categories from an unordered point set. We propose our Pyramid Point network, which uses a dense pyramid structure instead of the traditional 'U' shape, typically seen in semantic segmentation networks. This pyramid structure gives a second look, allowing the network to revisit different layers simultaneously, increasing the contextual information by creating additional layers with less noise. We introduce a Focused Kernel Point convolution (FKP Conv), which expands on the traditional point convolutions by adding an attention mechanism to the kernel outputs. This FKP Conv increases our feature quality and allows us to weigh the kernel outputs dynamically. These FKP Convs are the central part of our Recurrent FKP Bottleneck block, which makes up the backbone of our encoder. With this distinct network, we demonstrate competitive performance on three benchmark data sets. We also perform an ablation study to show the positive effects of each element in our FKP Conv.

READ FULL TEXT

page 6

page 7

page 8

research
06/03/2020

DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution

Many modern object detectors demonstrate outstanding performances by usi...
research
05/06/2020

Scale-Equalizing Pyramid Convolution for Object Detection

Feature pyramid has been an efficient method to extract features at diff...
research
04/28/2021

Point Cloud Learning with Transformer

Remarkable performance from Transformer networks in Natural Language Pro...
research
12/02/2019

IPG-Net: Image Pyramid Guidance Network for Object Detection

For Convolutional Neural Network based object detection, there is a typi...
research
03/23/2020

Spatial Pyramid Based Graph Reasoning for Semantic Segmentation

The convolution operation suffers from a limited receptive filed, while ...
research
09/11/2021

DeepPyram: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos

Semantic segmentation in cataract surgery has a wide range of applicatio...
research
06/03/2022

YOLOv5s-GTB: light-weighted and improved YOLOv5s for bridge crack detection

In response to the situation that the conventional bridge crack manual d...

Please sign up or login with your details

Forgot password? Click here to reset