FaPN: Feature-aligned Pyramid Network for Dense Image Prediction

08/16/2021
by   Shihua Huang, et al.
10

Recent advancements in deep neural networks have made remarkable leap-forwards in dense image prediction. However, the issue of feature alignment remains as neglected by most existing approaches for simplicity. Direct pixel addition between upsampled and local features leads to feature maps with misaligned contexts that, in turn, translate to mis-classifications in prediction, especially on object boundaries. In this paper, we propose a feature alignment module that learns transformation offsets of pixels to contextually align upsampled higher-level features; and another feature selection module to emphasize the lower-level features with rich spatial details. We then integrate these two modules in a top-down pyramidal architecture and present the Feature-aligned Pyramid Network (FaPN). Extensive experimental evaluations on four dense prediction tasks and four datasets have demonstrated the efficacy of FaPN, yielding an overall improvement of 1.2 - 2.6 points in AP / mIoU over FPN when paired with Faster / Mask R-CNN. In particular, our FaPN achieves the state-of-the-art of 56.7 integrated within Mask-Former. The code is available from https://github.com/EMI-Group/FaPN.

READ FULL TEXT

page 1

page 2

page 6

page 7

page 10

page 11

page 12

research
06/28/2023

AFPN: Asymptotic Feature Pyramid Network for Object Detection

Multi-scale features are of great importance in encoding objects with sc...
research
02/24/2020

Semantic Flow for Fast and Accurate Scene Parsing

In this paper, we focus on effective methods for fast and accurate scene...
research
11/01/2022

Pixel-Wise Contrastive Distillation

We present the first pixel-level self-supervised distillation framework ...
research
07/21/2020

BorderDet: Border Feature for Dense Object Detection

Dense object detectors rely on the sliding-window paradigm that predicts...
research
11/08/2020

Adaptive Linear Span Network for Object Skeleton Detection

Conventional networks for object skeleton detection are usually hand-cra...
research
12/04/2022

CoupAlign: Coupling Word-Pixel with Sentence-Mask Alignments for Referring Image Segmentation

Referring image segmentation aims at localizing all pixels of the visual...
research
05/10/2022

OTFPF: Optimal Transport-Based Feature Pyramid Fusion Network for Brain Age Estimation with 3D Overlapped ConvNeXt

Chronological age of healthy brain is able to be predicted using deep ne...

Please sign up or login with your details

Forgot password? Click here to reset