Lesion Harvester: Iteratively Mining Unlabeled Lesions and Hard-Negative Examples at Scale

01/21/2020
by   Jinzheng Cai, et al.
3

Acquiring large-scale medical image data, necessary for training machine learning algorithms, is frequently intractable, due to prohibitive expert-driven annotation costs. Recent datasets extracted from hospital archives, e.g., DeepLesion, have begun to address this problem. However, these are often incompletely or noisily labeled, e.g., DeepLesion leaves over 50 its lesions unlabeled. Thus, effective methods to harvest missing annotations are critical for continued progress in medical image analysis. This is the goal of our work, where we develop a powerful system to harvest missing lesions from the DeepLesion dataset at high precision. Accepting the need for some degree of expert labor to achieve high fidelity, we exploit a small fully-labeled subset of medical image volumes and use it to intelligently mine annotations from the remainder. To do this, we chain together a highly sensitive lesion proposal generator and a very selective lesion proposal classifier. While our framework is generic, we optimize our performance by proposing a 3D contextual lesion proposal generator and by using a multi-view multi-scale lesion proposal classifier. These produce harvested and hard-negative proposals, which we then re-use to finetune our proposal generator by using a novel hard negative suppression loss, continuing this process until no extra lesions are found. Extensive experimental analysis demonstrates that our method can harvest an additional 9,805 lesions while keeping precision above 90 benefits of our approach, we show that lesion detectors trained on our harvested lesions can significantly outperform the same variants only trained on the original annotations, with boost of average precision of 7 open source our code and annotations at https://github.com/JimmyCai91/DeepLesionAnnotation.

READ FULL TEXT

page 1

page 9

research
03/27/2023

An End-to-End Framework For Universal Lesion Detection With Missing Annotations

Fully annotated large-scale medical image datasets are highly valuable. ...
research
09/05/2020

Learning from Multiple Datasets with Heterogeneous and Partial Labels for Universal Lesion Detection in CT

Large-scale datasets with high-quality labels are desired for training a...
research
01/26/2020

Brain Metastasis Segmentation Network Trained with Robustness to Annotations with Multiple False Negatives

Deep learning has proven to be an essential tool for medical image analy...
research
05/28/2020

Universal Lesion Detection by Learning from Multiple Heterogeneously Labeled Datasets

Lesion detection is an important problem within medical imaging analysis...
research
03/04/2019

Fine-grained lesion annotation in CT images with knowledge mined from radiology reports

In radiologists' routine work, one major task is to read a medical image...
research
10/22/2016

Deep image mining for diabetic retinopathy screening

Deep learning is quickly becoming the leading methodology for medical im...
research
06/21/2018

Crowd disagreement of medical images is informative

Classifiers for medical image analysis are often trained with a single c...

Please sign up or login with your details

Forgot password? Click here to reset