Rethinking Learnable Tree Filter for Generic Feature Transform

12/07/2020
by   Lin Song, et al.
0

The Learnable Tree Filter presents a remarkable approach to model structure-preserving relations for semantic segmentation. Nevertheless, the intrinsic geometric constraint forces it to focus on the regions with close spatial distance, hindering the effective long-range interactions. To relax the geometric constraint, we give the analysis by reformulating it as a Markov Random Field and introduce a learnable unary term. Besides, we propose a learnable spanning tree algorithm to replace the original non-differentiable one, which further improves the flexibility and robustness. With the above improvements, our method can better capture long-range dependencies and preserve structural details with linear complexity, which is extended to several vision tasks for more generic feature transform. Extensive experiments on object detection/instance segmentation demonstrate the consistent improvements over the original version. For semantic segmentation, we achieve leading performance (82.1 bells-and-whistles. Code is available at https://github.com/StevenGrove/LearnableTreeFilterV2.

READ FULL TEXT

page 2

page 7

page 13

page 14

page 17

research
09/27/2019

Learnable Tree Filter for Structure-preserving Feature Transform

Learning discriminative global features plays a vital role in semantic s...
research
11/23/2022

EurNet: Efficient Multi-Range Relational Modeling of Spatial Multi-Relational Data

Modeling spatial relationship in the data remains critical across many d...
research
09/16/2019

Global Aggregation then Local Distribution in Fully Convolutional Networks

It has been widely proven that modelling long-range dependencies in full...
research
07/17/2023

On Point Affiliation in Feature Upsampling

We introduce the notion of point affiliation into feature upsampling. By...
research
10/03/2022

Masked Supervised Learning for Semantic Segmentation

Self-attention is of vital importance in semantic segmentation as it ena...
research
04/14/2021

HoughNet: Integrating near and long-range evidence for visual detection

This paper presents HoughNet, a one-stage, anchor-free, voting-based, bo...
research
12/17/2020

Semi-Global Shape-aware Network

Non-local operations are usually used to capture long-range dependencies...

Please sign up or login with your details

Forgot password? Click here to reset