Deep Homography Estimation in Dynamic Surgical Scenes for Laparoscopic Camera Motion Extraction

09/30/2021
by   Martin Huber, et al.
10

Current laparoscopic camera motion automation relies on rule-based approaches or only focuses on surgical tools. Imitation Learning (IL) methods could alleviate these shortcomings, but have so far been applied to oversimplified setups. Instead of extracting actions from oversimplified setups, in this work we introduce a method that allows to extract a laparoscope holder's actions from videos of laparoscopic interventions. We synthetically add camera motion to a newly acquired dataset of camera motion free da Vinci surgery image sequences through the introduction of a novel homography generation algorithm. The synthetic camera motion serves as a supervisory signal for camera motion estimation that is invariant to object and tool motion. We perform an extensive evaluation of state-of-the-art (SOTA) Deep Neural Networks (DNNs) across multiple compute regimes, finding our method transfers from our camera motion free da Vinci surgery dataset to videos of laparoscopic interventions, outperforming classical homography estimation approaches in both, precision by 41

READ FULL TEXT

page 4

page 5

page 6

page 10

research
02/06/2021

A surgical dataset from the da Vinci Research Kit for task automation and recognition

The use of datasets is getting more relevance in surgical robotics since...
research
08/22/2023

WS-SfMLearner: Self-supervised Monocular Depth and Ego-motion Estimation on Surgical Videos with Unknown Camera Parameters

Depth estimation in surgical video plays a crucial role in many image-gu...
research
11/25/2016

Deep Video Deblurring

Motion blur from camera shake is a major problem in videos captured by h...
research
03/21/2019

Quotienting Impertinent Camera Kinematics for 3D Video Stabilization

With the recent advent of methods that allow for real-time computation, ...
research
03/19/2020

3D Ego-Pose Estimation via Imitation Learning

Ego-pose estimation, i.e., estimating a person's 3D pose with a single w...
research
04/21/2020

A Deep Learning Approach for Motion Forecasting Using 4D OCT Data

Forecasting motion of a specific target object is a common problem for s...
research
04/03/2022

Neural Global Shutter: Learn to Restore Video from a Rolling Shutter Camera with Global Reset Feature

Most computer vision systems assume distortion-free images as inputs. Th...

Please sign up or login with your details

Forgot password? Click here to reset