DeepPyram: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos

09/11/2021
by   Negin Ghamsarian, et al.
0

Semantic segmentation in cataract surgery has a wide range of applications contributing to surgical outcome enhancement and clinical risk reduction. However, the varying issues in segmenting the different relevant instances make the designation of a unique network quite challenging. This paper proposes a semantic segmentation network termed as DeepPyram that can achieve superior performance in segmenting relevant objects in cataract surgery videos with varying issues. This superiority mainly originates from three modules: (i) Pyramid View Fusion, which provides a varying-angle global view of the surrounding region centering at each pixel position in the input convolutional feature map; (ii) Deformable Pyramid Reception, which enables a wide deformable receptive field that can adapt to geometric transformations in the object of interest; and (iii) Pyramid Loss that adaptively supervises multi-scale semantic feature maps. These modules can effectively boost semantic segmentation performance, especially in the case of transparency, deformability, scalability, and blunt edges in objects. The proposed approach is evaluated using four datasets of cataract surgery for objects with different contextual features and compared with thirteen state-of-the-art segmentation networks. The experimental results confirm that DeepPyram outperforms the rival approaches without imposing additional trainable parameters. Our comprehensive ablation study further proves the effectiveness of the proposed modules.

READ FULL TEXT

page 1

page 7

page 9

page 11

research
07/04/2022

DeepPyramid: Enabling Pyramid View and Deformable Pyramid Reception for Semantic Segmentation in Cataract Surgery Videos

Semantic segmentation in cataract surgery has a wide range of applicatio...
research
09/25/2021

ReCal-Net: Joint Region-Channel-Wise Calibrated Network for Semantic Segmentation in Cataract Surgery Videos

Semantic segmentation in surgical videos is a prerequisite for a broad r...
research
05/26/2022

Deep Sensor Fusion with Pyramid Fusion Networks for 3D Semantic Segmentation

Robust environment perception for autonomous vehicles is a tremendous ch...
research
06/27/2019

ELKPPNet: An Edge-aware Neural Network with Large Kernel Pyramid Pooling for Learning Discriminative Features in Semantic Segmentation

Semantic segmentation has been a hot topic across diverse research field...
research
01/02/2018

Restricted Deformable Convolution based Road Scene Semantic Segmentation Using Surround View Cameras

Understanding the surrounding environment of the vehicle is still one of...
research
03/17/2017

Deformable Convolutional Networks

Convolutional neural networks (CNNs) are inherently limited to model geo...
research
11/17/2020

Pyramid Point: A Multi-Level Focusing Network for Revisiting Feature Layers

We present a method to learn a diverse group of object categories from a...

Please sign up or login with your details

Forgot password? Click here to reset