Learning-Based Depth and Pose Estimation for Monocular Endoscope with Loss Generalization

07/28/2021
by   Aji Resindra Widya, et al.
12

Gastroendoscopy has been a clinical standard for diagnosing and treating conditions that affect a part of a patient's digestive system, such as the stomach. Despite the fact that gastroendoscopy has a lot of advantages for patients, there exist some challenges for practitioners, such as the lack of 3D perception, including the depth and the endoscope pose information. Such challenges make navigating the endoscope and localizing any found lesion in a digestive tract difficult. To tackle these problems, deep learning-based approaches have been proposed to provide monocular gastroendoscopy with additional yet important depth and pose information. In this paper, we propose a novel supervised approach to train depth and pose estimation networks using consecutive endoscopy images to assist the endoscope navigation in the stomach. We firstly generate real depth and pose training data using our previously proposed whole stomach 3D reconstruction pipeline to avoid poor generalization ability between computer-generated (CG) models and real data for the stomach. In addition, we propose a novel generalized photometric loss function to avoid the complicated process of finding proper weights for balancing the depth and the pose loss terms, which is required for existing direct depth and pose supervision approaches. We then experimentally show that our proposed generalized loss performs better than existing direct supervision losses.

READ FULL TEXT

page 1

page 2

page 4

page 5

research
11/22/2022

Weakly-supervised Pre-training for 3D Human Pose Estimation via Perspective Knowledge

Modern deep learning-based 3D pose estimation approaches require plenty ...
research
09/21/2018

Adversarial 3D Human Pose Estimation via Multimodal Depth Supervision

In this paper, a novel deep-learning based framework is proposed to infe...
research
11/20/2019

Unsupervised Monocular Depth Prediction for Indoor Continuous Video Streams

This paper studies unsupervised monocular depth prediction problem. Most...
research
09/18/2017

Direct Pose Estimation with a Monocular Camera

We present a direct method to calculate a 6DoF pose change of a monocula...
research
08/18/2019

Distill Knowledge from NRSfM for Weakly Supervised 3D Pose Learning

We propose to learn a 3D pose estimator by distilling knowledge from Non...
research
07/06/2020

Generative Model-Based Loss to the Rescue: A Method to Overcome Annotation Errors for Depth-Based Hand Pose Estimation

We propose to use a model-based generative loss for training hand pose e...
research
12/20/2018

SfMLearner++: Learning Monocular Depth & Ego-Motion using Meaningful Geometric Constraints

Most geometric approaches to monocular Visual Odometry (VO) provide robu...

Please sign up or login with your details

Forgot password? Click here to reset