Activity Detection in Long Surgical Videos using Spatio-Temporal Models

05/05/2022
by   Aidean Sharghi, et al.
0

Automatic activity detection is an important component for developing technologies that enable next generation surgical devices and workflow monitoring systems. In many application, the videos of interest are long and include several activities; hence, the deep models designed for such purposes consist of a backbone and a temporal sequence modeling architecture. In this paper, we investigate both the state-of-the-art activity recognition and temporal models to find the architectures that yield the highest performance. We first benchmark these models on a large-scale activity recognition dataset in the operating room with over 800 full-length surgical videos. However, since most other medical applications lack such a large dataset, we further evaluate our models on the Cholec80 surgical phase segmentation dataset, consisting of only 40 training videos. For backbone architectures, we investigate both 3D ConvNets and most recent transformer-based models; for temporal modeling, we include temporal ConvNets, RNNs, and transformer models for a comprehensive and thorough study. We show that even in the case of limited labeled data, we can outperform the existing work by benefiting from models pre-trained on other tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/29/2020

Automatic Operating Room Surgical Activity Recognition for Robot-Assisted Surgery

Automatic recognition of surgical activities in the operating room (OR) ...
research
05/19/2023

SurgMAE: Masked Autoencoders for Long Surgical Video Analysis

There has been a growing interest in using deep learning models for proc...
research
07/07/2022

Adaptation of Surgical Activity Recognition Models Across Operating Rooms

Automatic surgical activity recognition enables more intelligent surgica...
research
03/09/2022

Using Human Gaze For Surgical Activity Recognition

Automatically recognizing surgical activities plays an important role in...
research
02/21/2023

Weakly Supervised Temporal Convolutional Networks for Fine-grained Surgical Activity Recognition

Automatic recognition of fine-grained surgical activities, called steps,...
research
09/02/2022

ARST: Auto-Regressive Surgical Transformer for Phase Recognition from Laparoscopic Videos

Phase recognition plays an essential role for surgical workflow analysis...
research
01/11/2020

Towards Generalizable Surgical Activity Recognition Using Spatial Temporal Graph Convolutional Networks

Modeling and recognition of surgical activities poses an interesting res...

Please sign up or login with your details

Forgot password? Click here to reset