ReCal-Net: Joint Region-Channel-Wise Calibrated Network for Semantic Segmentation in Cataract Surgery Videos

09/25/2021
by   Negin Ghamsarian, et al.
0

Semantic segmentation in surgical videos is a prerequisite for a broad range of applications towards improving surgical outcomes and surgical video analysis. However, semantic segmentation in surgical videos involves many challenges. In particular, in cataract surgery, various features of the relevant objects such as blunt edges, color and context variation, reflection, transparency, and motion blur pose a challenge for semantic segmentation. In this paper, we propose a novel convolutional module termed as ReCal module, which can calibrate the feature maps by employing region intra-and-inter-dependencies and channel-region cross-dependencies. This calibration strategy can effectively enhance semantic representation by correlating different representations of the same semantic label, considering a multi-angle local view centering around each pixel. Thus the proposed module can deal with distant visual characteristics of unique objects as well as cross-similarities in the visual characteristics of different objects. Moreover, we propose a novel network architecture based on the proposed module termed as ReCal-Net. Experimental results confirm the superiority of ReCal-Net compared to rival state-of-the-art approaches for all relevant objects in cataract surgery. Moreover, ablation studies reveal the effectiveness of the ReCal module in boosting semantic segmentation accuracy.

READ FULL TEXT

page 9

page 10

research
09/11/2021

DeepPyram: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos

Semantic segmentation in cataract surgery has a wide range of applicatio...
research
07/04/2022

DeepPyramid: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos

Semantic segmentation in cataract surgery has a wide range of applicatio...
research
09/23/2019

RAUNet: Residual Attention U-Net for Semantic Segmentation of Cataract Surgical Instruments

Semantic segmentation of surgical instruments plays a crucial role in ro...
research
06/19/2023

A spatio-temporal network for video semantic segmentation in surgical videos

Semantic segmentation in surgical videos has applications in intra-opera...
research
07/02/2021

LensID: A CNN-RNN-Based Framework Towards Lens Irregularity Detection in Cataract Surgery Videos

A critical complication after cataract surgery is the dislocation of the...
research
09/08/2019

Squeeze-and-Attention Networks for Semantic Segmentation

Squeeze-and-excitation (SE) module enhances the representational power o...
research
03/29/2022

Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation

Automatic surgical scene segmentation is fundamental for facilitating co...

Please sign up or login with your details

Forgot password? Click here to reset