Differences between human and machine perception in medical diagnosis

11/28/2020
by   Taro Makino, et al.
38

Deep neural networks (DNNs) show promise in image-based medical diagnosis, but cannot be fully trusted since their performance can be severely degraded by dataset shifts to which human perception remains invariant. If we can better understand the differences between human and machine perception, we can potentially characterize and mitigate this effect. We therefore propose a framework for comparing human and machine perception in medical diagnosis. The two are compared with respect to their sensitivity to the removal of clinically meaningful information, and to the regions of an image deemed most suspicious. Drawing inspiration from the natural image domain, we frame both comparisons in terms of perturbation robustness. The novelty of our framework is that separate analyses are performed for subgroups with clinically meaningful differences. We argue that this is necessary in order to avert Simpson's paradox and draw correct conclusions. We demonstrate our framework with a case study in breast cancer screening, and reveal significant differences between radiologists and DNNs. We compare the two with respect to their robustness to Gaussian low-pass filtering, performing a subgroup analysis on microcalcifications and soft tissue lesions. For microcalcifications, DNNs use a separate set of high frequency components than radiologists, some of which lie outside the image regions considered most suspicious by radiologists. These features run the risk of being spurious, but if not, could represent potential new biomarkers. For soft tissue lesions, the divergence between radiologists and DNNs is even starker, with DNNs relying heavily on spurious high frequency components ignored by radiologists. Importantly, this deviation in soft tissue lesions was only observable through subgroup analysis, which highlights the importance of incorporating medical domain knowledge into our comparison framework.

READ FULL TEXT

page 3

page 4

page 18

research
03/23/2020

Understanding the robustness of deep neural network classifiers for breast cancer screening

Deep neural networks (DNNs) show promise in breast cancer screening, but...
research
09/19/2020

Reducing false-positive biopsies with deep neural networks that utilize local and global information in screening mammograms

Breast cancer is the most common cancer in women, and hundreds of thousa...
research
12/26/2018

A Whole Slide Image Grading Benchmark and Tissue Classification for Cervical Cancer Precursor Lesions with Inter-Observer Variability

The cervical cancer developing from the precancerous lesions caused by t...
research
10/05/2018

Medical Images Analysis in Cancer Diagnostic

This paper shows results of computer analysis of images in the purpose o...
research
09/25/2018

MPRAD: A Multiparametric Radiomics Framework

Multiparametric radiological imaging is vital for detection, characteriz...
research
12/07/2020

Sparse Fooling Images: Fooling Machine Perception through Unrecognizable Images

In recent years, deep neural networks (DNNs) have achieved equivalent or...
research
03/31/2017

Intraoperative margin assessment of human breast tissue in optical coherence tomography images using deep neural networks

Objective: In this work, we perform margin assessment of human breast ti...

Please sign up or login with your details

Forgot password? Click here to reset