Who wants accurate models? Arguing for a different metrics to take classification models seriously

10/21/2019
by   Federico Cabitza, et al.
29

With the increasing availability of AI-based decision support, there is an increasing need for their certification by both AI manufacturers and notified bodies, as well as the pragmatic (real-world) validation of these systems. Therefore, there is the need for meaningful and informative ways to assess the performance of AI systems in clinical practice. Common metrics (like accuracy scores and areas under the ROC curve) have known problems and they do not take into account important information about the preferences of clinicians and the needs of their specialist practice, like the likelihood and impact of errors and the complexity of cases. In this paper, we present a new accuracy measure, the H-accuracy (Ha), which we claim is more informative in the medical domain (and others of similar needs) for the elements it encompasses. We also provide proof that the H-accuracy is a generalization of the balanced accuracy and establish a relation between the H-accuracy and the Net Benefit. Finally, we illustrate an experimentation in two user studies to show the descriptive power of the Ha score and how complementary and differently informative measures can be derived from its formulation (a Python script to compute Ha is also made available).

READ FULL TEXT

page 9

page 16

page 21

page 24

research
09/09/2021

OpenClinicalAI: enabling AI to diagnose diseases in real-world clinical settings

This paper quantitatively reveals the state-of-the-art and state-of-the-...
research
02/01/2021

Designing AI for Trust and Collaboration in Time-Constrained Medical Decisions: A Sociotechnical Lens

Major depressive disorder is a debilitating disease affecting 264 millio...
research
07/06/2021

Leveraging Clinical Context for User-Centered Explainability: A Diabetes Use Case

Academic advances of AI models in high-precision domains, like healthcar...
research
12/11/2019

Founding The Domain of AI Forensics

With the widespread integration of AI in everyday and critical technolog...
research
03/07/2023

"If I Had All the Time in the World": Ophthalmologists' Perceptions of Anchoring Bias Mitigation in Clinical AI Support

Clinical needs and technological advances have resulted in increased use...
research
08/10/2019

Deep ensemble network with explicit complementary model for accuracy-balanced classification

The average accuracy is one of major evaluation metrics for classificati...
research
05/25/2022

Towards Green AI with tensor networks – Sustainability and innovation enabled by efficient algorithms

The current standard to compare the performance of AI algorithms is main...

Please sign up or login with your details

Forgot password? Click here to reset