Density-Aware Personalized Training for Risk Prediction in Imbalanced Medical Data

07/23/2022
by   Zepeng Huo, et al.
0

Medical events of interest, such as mortality, often happen at a low rate in electronic medical records, as most admitted patients survive. Training models with this imbalance rate (class density discrepancy) may lead to suboptimal prediction. Traditionally this problem is addressed through ad-hoc methods such as resampling or reweighting but performance in many cases is still limited. We propose a framework for training models for this imbalance issue: 1) we first decouple the feature extraction and classification process, adjusting training batches separately for each component to mitigate bias caused by class density discrepancy; 2) we train the network with both a density-aware loss and a learnable cost matrix for misclassifications. We demonstrate our model's improved performance in real-world medical datasets (TOPCAT and MIMIC-III) to show improved AUC-ROC, AUC-PRC, Brier Skill Score compared with the baselines in the domain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/13/2021

AutoScore-Imbalance: An interpretable machine learning tool for development of clinical scores with rare events data

Background: Medical decision-making impacts both individual and public h...
research
06/20/2022

A Comparative Study on Application of Class-Imbalance Learning for Severity Prediction of Adverse Events Following Immunization

In collaboration with the Liaoning CDC, China, we propose a prediction s...
research
07/28/2021

Learning with Multiclass AUC: Theory and Algorithms

The Area under the ROC curve (AUC) is a well-known ranking metric for pr...
research
07/22/2023

DHC: Dual-debiased Heterogeneous Co-training Framework for Class-imbalanced Semi-supervised Medical Image Segmentation

The volume-wise labeling of 3D medical images is expertise-demanded and ...
research
11/30/2017

Highrisk Prediction from Electronic Medical Records via Deep Attention Networks

Predicting highrisk vascular diseases is a significant issue in the medi...
research
09/17/2022

AdaCC: Cumulative Cost-Sensitive Boosting for Imbalanced Classification

Class imbalance poses a major challenge for machine learning as most sup...
research
06/24/2020

Bayesian Sampling Bias Correction: Training with the Right Loss Function

We derive a family of loss functions to train models in the presence of ...

Please sign up or login with your details

Forgot password? Click here to reset