Colonoscopy Landmark Detection using Vision Transformers

09/22/2022
by   Aniruddha Tamhane, et al.
0

Colonoscopy is a routine outpatient procedure used to examine the colon and rectum for any abnormalities including polyps, diverticula and narrowing of colon structures. A significant amount of the clinician's time is spent in post-processing snapshots taken during the colonoscopy procedure, for maintaining medical records or further investigation. Automating this step can save time and improve the efficiency of the process. In our work, we have collected a dataset of 120 colonoscopy videos and 2416 snapshots taken during the procedure, that have been annotated by experts. Further, we have developed a novel, vision-transformer based landmark detection algorithm that identifies key anatomical landmarks (the appendiceal orifice, ileocecal valve/cecum landmark and rectum retroflexion) from snapshots taken during colonoscopy. Our algorithm uses an adaptive gamma correction during preprocessing to maintain a consistent brightness for all images. We then use a vision transformer as the feature extraction backbone and a fully connected network based classifier head to categorize a given frame into four classes: the three landmarks or a non-landmark frame. We compare the vision transformer (ViT-B/16) backbone with ResNet-101 and ConvNext-B backbones that have been trained similarly. We report an accuracy of 82 snapshots.

READ FULL TEXT

page 5

page 6

research
03/12/2022

DATR: Domain-adaptive transformer for multi-domain landmark detection

Accurate anatomical landmark detection plays an increasingly vital role ...
research
04/26/2022

U-Net with ResNet Backbone for Garment Landmarking Purpose

We build a heatmap-based landmark detection model to locate important la...
research
09/21/2021

LOTR: Face Landmark Localization Using Localization Transformer

This paper presents a novel Transformer-based facial landmark localizati...
research
02/15/2023

'Aariz: A Benchmark Dataset for Automatic Cephalometric Landmark Detection and CVM Stage Classification

The accurate identification and precise localization of cephalometric la...
research
10/22/2021

Feasibility of Remote Landmark Identification for Cricothyrotomy Using Robotic Palpation

Cricothyrotomy is a life-saving emergency intervention that secures an a...
research
09/19/2023

LineMarkNet: Line Landmark Detection for Valet Parking

We aim for accurate and efficient line landmark detection for valet park...
research
03/18/2023

Uncertainty-aware U-Net for Medical Landmark Detection

Heatmap-based methods play an important role in anatomical landmark dete...

Please sign up or login with your details

Forgot password? Click here to reset