Pyramid Grafting Network for One-Stage High Resolution Saliency Detection

04/11/2022
by   Chenxi Xie, et al.
2

Recent salient object detection (SOD) methods based on deep neural network have achieved remarkable performance. However, most of existing SOD models designed for low-resolution input perform poorly on high-resolution images due to the contradiction between the sampling depth and the receptive field size. Aiming at resolving this contradiction, we propose a novel one-stage framework called Pyramid Grafting Network (PGNet), using transformer and CNN backbone to extract features from different resolution images independently and then graft the features from transformer branch to CNN branch. An attention-based Cross-Model Grafting Module (CMGM) is proposed to enable CNN branch to combine broken detailed information more holistically, guided by different source feature during decoding process. Moreover, we design an Attention Guided Loss (AGL) to explicitly supervise the attention matrix generated by CMGM to help the network better interact with the attention from different models. We contribute a new Ultra-High-Resolution Saliency Detection dataset UHRSD, containing 5,920 images at 4K-8K resolutions. To our knowledge, it is the largest dataset in both quantity and resolution for high-resolution SOD task, which can be used for training and testing in future research. Sufficient experiments on UHRSD and widely-used SOD datasets demonstrate that our method achieves superior performance compared to the state-of-the-art methods.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 7

research
08/20/2019

Towards High-Resolution Salient Object Detection

Deep neural network based methods have made a significant breakthrough i...
research
07/20/2023

Hybrid Feature Embedding For Automatic Building Outline Extraction

Building outline extracted from high-resolution aerial images can be use...
research
12/01/2022

Concealed Object Detection for Passive Millimeter-Wave Security Imaging Based on Task-Aligned Detection Transformer

Passive millimeter-wave (PMMW) is a significant potential technique for ...
research
10/26/2020

P^2 Net: Augmented Parallel-Pyramid Net for Attention Guided Pose Estimation

We propose an augmented Parallel-Pyramid Net (P^2 Net) with feature refi...
research
06/08/2021

LocalTrans: A Multiscale Local Transformer Network for Cross-Resolution Homography Estimation

Cross-resolution image alignment is a key problem in multiscale gigapixe...
research
11/14/2019

Detecting cutaneous basal cell carcinomas in ultra-high resolution and weakly labelled histopathological images

Diagnosing basal cell carcinomas (BCC), one of the most common cutaneous...
research
02/17/2022

Single UHD Image Dehazing via Interpretable Pyramid Network

Currently, most single image dehazing models cannot run an ultra-high-re...

Please sign up or login with your details

Forgot password? Click here to reset