Robotic Scene Segmentation with Memory Network for Runtime Surgical Context Inference

08/24/2023
by   Zongyu Li, et al.
0

Surgical context inference has recently garnered significant attention in robot-assisted surgery as it can facilitate workflow analysis, skill assessment, and error detection. However, runtime context inference is challenging since it requires timely and accurate detection of the interactions among the tools and objects in the surgical scene based on the segmentation of video data. On the other hand, existing state-of-the-art video segmentation methods are often biased against infrequent classes and fail to provide temporal consistency for segmented masks. This can negatively impact the context inference and accurate detection of critical states. In this study, we propose a solution to these challenges using a Space Time Correspondence Network (STCN). STCN is a memory network that performs binary segmentation and minimizes the effects of class imbalance. The use of a memory bank in STCN allows for the utilization of past image and segmentation information, thereby ensuring consistency of the masks. Our experiments using the publicly available JIGSAWS dataset demonstrate that STCN achieves superior segmentation performance for objects that are difficult to segment, such as needle and thread, and improves context inference compared to the state-of-the-art. We also demonstrate that segmentation and context inference can be performed at runtime without compromising performance.

READ FULL TEXT

page 1

page 3

page 6

research
02/28/2023

Towards Surgical Context Inference and Translation to Gestures

Manual labeling of gestures in robot-assisted surgery is labor intensive...
research
02/07/2020

Temporal Segmentation of Surgical Sub-tasks through Deep Learning with Multiple Data Sources

Many tasks in robot-assisted surgeries (RAS) can be represented by finit...
research
08/22/2023

SAMSNeRF: Segment Anything Model (SAM) Guides Dynamic Surgical Scene Reconstruction by Neural Radiance Field (NeRF)

The accurate reconstruction of surgical scenes from surgical videos is c...
research
05/21/2019

RASNet: Segmentation for Tracking Surgical Instruments in Surgical Videos Using Refined Attention Segmentation Network

Segmentation for tracking surgical instruments plays an important role i...
research
09/01/2020

Aggregating Long-Term Context for Learning Surgical Workflows

Analyzing surgical workflow is crucial for computers to understand surge...
research
06/27/2019

CaDIS: Cataract Dataset for Image Segmentation

Video signals provide a wealth of information about surgical procedures ...
research
03/01/2022

Runtime Detection of Executional Errors in Robot-Assisted Surgery

Despite significant developments in the design of surgical robots and au...

Please sign up or login with your details

Forgot password? Click here to reset