Performance Deterioration of Deep Learning Models after Clinical Deployment: A Case Study with Auto-segmentation for Definitive Prostate Cancer Radiotherapy

10/11/2022
by   Biling Wang, et al.
5

In the past decade, deep learning (DL)-based artificial intelligence (AI) has witnessed unprecedented success and has led to much excitement in medicine. However, many successful models have not been implemented in the clinic predominantly due to concerns regarding the lack of interpretability and generalizability in both spatial and temporal domains. In this work, we used a DL-based auto segmentation model for intact prostate patients to observe any temporal performance changes and then correlate them to possible explanatory variables. We retrospectively simulated the clinical implementation of our DL model to investigate temporal performance trends. Our cohort included 912 patients with prostate cancer treated with definitive radiotherapy from January 2006 to August 2021 at the University of Texas Southwestern Medical Center (UTSW). We trained a U-Net-based DL auto segmentation model on the data collected before 2012 and tested it on data collected from 2012 to 2021 to simulate the clinical deployment of the trained model starting in 2012. We visualize the trends using a simple moving average curve and used ANOVA and t-test to investigate the impact of various clinical factors. The prostate and rectum contour quality decreased rapidly after 2016-2017. Stereotactic body radiotherapy (SBRT) and hydrogel spacer use were significantly associated with prostate contour quality (p=5.6e-12 and 0.002, respectively). SBRT and physicians' styles are significantly associated with the rectum contour quality (p=0.0005 and 0.02, respectively). Only the presence of contrast within the bladder significantly affected the bladder contour quality (p=1.6e-7). We showed that DL model performance decreased over time in concordance with changes in clinical practice patterns and changes in clinical personnel.

READ FULL TEXT

page 1

page 4

page 5

research
07/28/2021

A Proof-of-Concept Study of Artificial Intelligence Assisted Contour Revision

Automatic segmentation of anatomical structures is critical for many med...
research
02/15/2021

PSA-Net: Deep Learning based Physician Style-Aware Segmentation Network for Post-Operative Prostate Cancer Clinical Target Volume

Automatic segmentation of medical images with DL algorithms has proven t...
research
01/06/2023

A CAD System for Colorectal Cancer from WSI: A Clinically Validated Interpretable ML-based Prototype

The integration of Artificial Intelligence (AI) and Digital Pathology ha...
research
11/01/2019

The reliability of a deep learning model in clinical out-of-distribution MRI data: a multicohort study

Deep learning (DL) methods have in recent years yielded impressive resul...
research
03/03/2023

Need for Objective Task-based Evaluation of Deep Learning-Based Denoising Methods: A Study in the Context of Myocardial Perfusion SPECT

Artificial intelligence-based methods have generated substantial interes...
research
02/28/2022

Quality Monitoring and Assessment of Deployed Deep Learning Models for Network AIOps

Artificial Intelligence (AI) has recently attracted a lot of attention, ...

Please sign up or login with your details

Forgot password? Click here to reset