Colonoscopy Polyp Detection: Domain Adaptation From Medical Report Images to Real-time Videos

12/31/2020
by   Zhi-Qin Zhan, et al.
11

Automatic colorectal polyp detection in colonoscopy video is a fundamental task, which has received a lot of attention. Manually annotating polyp region in a large scale video dataset is time-consuming and expensive, which limits the development of deep learning techniques. A compromise is to train the target model by using labeled images and infer on colonoscopy videos. However, there are several issues between the image-based training and video-based inference, including domain differences, lack of positive samples, and temporal smoothness. To address these issues, we propose an Image-video-joint polyp detection network (Ivy-Net) to address the domain gap between colonoscopy images from historical medical reports and real-time videos. In our Ivy-Net, a modified mixup is utilized to generate training data by combining the positive images and negative video frames at the pixel level, which could learn the domain adaptive representations and augment the positive samples. Simultaneously, a temporal coherence regularization (TCR) is proposed to introduce the smooth constraint on feature-level in adjacent frames and improve polyp detection by unlabeled colonoscopy videos. For evaluation, a new large colonoscopy polyp dataset is collected, which contains 3056 images from historical medical reports of 889 positive patients and 7.5-hour videos of 69 patients (28 positive). The experiments on the collected dataset demonstrate that our Ivy-Net achieves the state-of-the-art result on colonoscopy video.

READ FULL TEXT
research
03/30/2022

CycDA: Unsupervised Cycle Domain Adaptation from Image to Video

Although action recognition has achieved impressive results over recent ...
research
01/30/2021

DRIV100: In-The-Wild Multi-Domain Dataset and Evaluation for Real-World Domain Adaptation of Semantic Segmentation

Together with the recent advances in semantic segmentation, many domain ...
research
08/07/2017

Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos

Despite rapid advances in face recognition, there remains a clear gap be...
research
11/07/2022

Facial Tic Detection in Untrimmed Videos of Tourette Syndrome Patients

Tourette Syndrome (TS) is a behavior disorder that onsets in childhood a...
research
09/22/2022

Deep Domain Adaptation for Detecting Bomb Craters in Aerial Images

The aftermath of air raids can still be seen for decades after the devas...
research
10/07/2015

Diverse Large-Scale ITS Dataset Created from Continuous Learning for Real-Time Vehicle Detection

In traffic engineering, vehicle detectors are trained on limited dataset...
research
03/06/2020

Meta-SVDD: Probabilistic Meta-Learning for One-Class Classification in Cancer Histology Images

To train a robust deep learning model, one usually needs a balanced set ...

Please sign up or login with your details

Forgot password? Click here to reset