Surgical Skill Assessment via Video Semantic Aggregation

08/04/2022
by   Zhenqiang Li, et al.
8

Automated video-based assessment of surgical skills is a promising task in assisting young surgical trainees, especially in poor-resource areas. Existing works often resort to a CNN-LSTM joint framework that models long-term relationships by LSTMs on spatially pooled short-term CNN features. However, this practice would inevitably neglect the difference among semantic concepts such as tools, tissues, and background in the spatial dimension, impeding the subsequent temporal relationship modeling. In this paper, we propose a novel skill assessment framework, Video Semantic Aggregation (ViSA), which discovers different semantic parts and aggregates them across spatiotemporal dimensions. The explicit discovery of semantic parts provides an explanatory visualization that helps understand the neural network's decisions. It also enables us to further incorporate auxiliary information such as the kinematic data to improve representation learning and performance. The experiments on two datasets show the competitiveness of ViSA compared to state-of-the-art methods. Source code is available at: bit.ly/MICCAI2022ViSA.

READ FULL TEXT

page 7

page 8

page 13

page 14

page 15

research
06/02/2021

Towards Unified Surgical Skill Assessment

Surgical skills have a great influence on surgical safety and patients' ...
research
07/05/2022

Video-based Surgical Skills Assessment using Long term Tool Tracking

Mastering the technical skills required to perform surgery is an extreme...
research
03/03/2021

Deep Neural Networks for the Assessment of Surgical Skills: A Systematic Review

Surgical training in medical school residency programs has followed the ...
research
07/26/2019

Using 3D Convolutional Neural Networks to Learn Spatiotemporal Features for Automatic Surgical Gesture Recognition in Video

Automatically recognizing surgical gestures is a crucial step towards a ...
research
06/07/2018

Evaluating surgical skills from kinematic data using convolutional neural networks

The need for automatic surgical skills assessment is increasing, especia...
research
02/24/2017

Video and Accelerometer-Based Motion Analysis for Automated Surgical Skills Assessment

Purpose: Basic surgical skills of suturing and knot tying are an essenti...
research
01/09/2019

Manipulation-skill Assessment from Videos with Spatial Attention Network

Recent advances in computer vision have made it possible to automaticall...

Please sign up or login with your details

Forgot password? Click here to reset