BARNet: Bilinear Attention Network with Adaptive Receptive Field for Surgical Instrument Segmentation

01/20/2020
by   Zhen-Liang Ni, et al.
20

Surgical instrument segmentation is extremely important for computer-assisted surgery. Different from common object segmentation, it is more challenging due to the large illumination and scale variation caused by the special surgical scenes. In this paper, we propose a novel bilinear attention network with adaptive receptive field to solve these two challenges. For the illumination variation, the bilinear attention module can capture second-order statistics to encode global contexts and semantic dependencies between local pixels. With them, semantic features in challenging areas can be inferred from their neighbors and the distinction of various semantics can be boosted. For the scale variation, our adaptive receptive field module aggregates multi-scale features and automatically fuses them with different weights. Specifically, it encodes the semantic relationship between channels to emphasize feature maps with appropriate scales, changing the receptive field of subsequent convolutions. The proposed network achieves the best performance 97.47 IOU on Cata7 and comes first place on EndoVis 2017 by 10.10 second-ranking method.

READ FULL TEXT

page 1

page 3

page 5

page 6

research
09/23/2019

RAUNet: Residual Attention U-Net for Semantic Segmentation of Cataract Surgical Instruments

Semantic segmentation of surgical instruments plays a crucial role in ro...
research
06/13/2021

DMSANet: Dual Multi Scale Attention Network

Attention mechanism of late has been quite popular in the computer visio...
research
11/17/2016

AutoScaler: Scale-Attention Networks for Visual Correspondence

Finding visual correspondence between local features is key to many comp...
research
09/21/2022

SDA-xNet: Selective Depth Attention Networks for Adaptive Multi-scale Feature Representation

Existing multi-scale solutions lead to a risk of just increasing the rec...
research
01/21/2020

VMRFANet:View-Specific Multi-Receptive Field Attention Network for Person Re-identification

Person re-identification (re-ID) aims to retrieve the same person across...
research
01/19/2020

Gated Path Selection Network for Semantic Segmentation

Semantic segmentation is a challenging task that needs to handle large s...
research
01/12/2020

Complementary Network with Adaptive Receptive Fields for Melanoma Segmentation

Automatic melanoma segmentation in dermoscopic images is essential in co...

Please sign up or login with your details

Forgot password? Click here to reset