Exploiting Segment-level Semantics for Online Phase Recognition from Surgical Videos

11/22/2021
by   Xinpeng Ding, et al.
6

Automatic surgical phase recognition plays an important role in robot-assisted surgeries. Existing methods ignored a pivotal problem that surgical phases should be classified by learning segment-level semantics instead of solely relying on frame-wise information. In this paper, we present a segment-attentive hierarchical consistency network (SAHC) for surgical phase recognition from videos. The key idea is to extract hierarchical high-level semantic-consistent segments and use them to refine the erroneous predictions caused by ambiguous frames. To achieve it, we design a temporal hierarchical network to generate hierarchical high-level segments. Then, we introduce a hierarchical segment-frame attention (SFA) module to capture relations between the low-level frames and high-level segments. By regularizing the predictions of frames and their corresponding segments via a consistency loss, the network can generate semantic-consistent segments and then rectify the misclassified predictions caused by ambiguous low-level frames. We validate SAHC on two public surgical video datasets, i.e., the M2CAI16 challenge dataset and the Cholec80 dataset. Experimental results show that our method outperforms previous state-of-the-arts by a large margin, notably reaches 4.1 on M2CAI16. Code will be released at GitHub upon acceptance.

READ FULL TEXT

page 1

page 3

page 6

page 7

research
06/15/2023

SF-TMN: SlowFast Temporal Modeling Network for Surgical Phase Recognition

Automatic surgical phase recognition is one of the key technologies to s...
research
07/07/2022

What Makes for Automatic Reconstruction of Pulmonary Segments

3D reconstruction of pulmonary segments plays an important role in surgi...
research
02/16/2022

Less is More: Surgical Phase Recognition from Timestamp Supervision

Surgical phase recognition is a fundamental task in computer-assisted su...
research
04/29/2021

Relevance Detection in Cataract Surgery Videos by Spatio-Temporal Action Localization

In cataract surgery, the operation is performed with the help of a micro...
research
03/05/2021

OperA: Attention-Regularized Transformers for Surgical Phase Recognition

In this paper we introduce OperA, a transformer-based model that accurat...
research
09/02/2022

ARST: Auto-Regressive Surgical Transformer for Phase Recognition from Laparoscopic Videos

Phase recognition plays an essential role for surgical workflow analysis...
research
06/21/2018

Deep Reinforcement Learning for Surgical Gesture Segmentation and Classification

Recognition of surgical gesture is crucial for surgical skill assessment...

Please sign up or login with your details

Forgot password? Click here to reset