Towards Holistic Surgical Scene Understanding

12/08/2022
by   Natalia Valderrama, et al.
1

Most benchmarks for studying surgical interventions focus on a specific challenge instead of leveraging the intrinsic complementarity among different tasks. In this work, we present a new experimental framework towards holistic surgical scene understanding. First, we introduce the Phase, Step, Instrument, and Atomic Visual Action recognition (PSI-AVA) Dataset. PSI-AVA includes annotations for both long-term (Phase and Step recognition) and short-term reasoning (Instrument detection and novel Atomic Action recognition) in robot-assisted radical prostatectomy videos. Second, we present Transformers for Action, Phase, Instrument, and steps Recognition (TAPIR) as a strong baseline for surgical scene understanding. TAPIR leverages our dataset's multi-level annotations as it benefits from the learned representation on the instrument detection task to improve its classification capacity. Our experimental results in both PSI-AVA and other publicly available databases demonstrate the adequacy of our framework to spur future research on holistic surgical scene understanding.

READ FULL TEXT

page 2

page 13

research
07/18/2023

Surgical Action Triplet Detection by Mixed Supervised Learning of Instrument-Tissue Interactions

Surgical action triplets describe instrument-tissue interactions as (ins...
research
03/16/2023

MATIS: Masked-Attention Transformers for Surgical Instrument Segmentation

We propose Masked-Attention Transformers for Surgical Instrument Segment...
research
03/23/2023

LABRAD-OR: Lightweight Memory Scene Graphs for Accurate Bimodal Reasoning in Dynamic Operating Rooms

Modern surgeries are performed in complex and dynamic settings, includin...
research
06/29/2023

Surgical Phase and Instrument Recognition: How to identify appropriate Dataset Splits

Purpose: The development of machine learning models for surgical workflo...
research
03/16/2023

Shifted-Windows Transformers for the Detection of Cerebral Aneurysms in Microsurgery

Purpose: Microsurgical Aneurysm Clipping Surgery (MACS) carries a high r...
research
03/22/2022

4D-OR: Semantic Scene Graphs for OR Domain Modeling

Surgical procedures are conducted in highly complex operating rooms (OR)...
research
04/16/2021

Spatiotemporal Deformable Models for Long-Term Complex Activity Detection

Long-term complex activity recognition and localisation can be crucial f...

Please sign up or login with your details

Forgot password? Click here to reset