Enhancing Clinical Predictive Modeling through Model Complexity-Driven Class Proportion Tuning for Class Imbalanced Data: An Empirical Study on Opioid Overdose Prediction

05/09/2023
by   Yinan Liu, et al.
0

Class imbalance problems widely exist in the medical field and heavily deteriorates performance of clinical predictive models. Most techniques to alleviate the problem rebalance class proportions and they predominantly assume the rebalanced proportions should be a function of the original data and oblivious to the model one uses. This work challenges this prevailing assumption and proposes that links the optimal class proportions to the model complexity, thereby tuning the class proportions per model. Our experiments on the opioid overdose prediction problem highlight the performance gain of tuning class proportions. Rigorous regression analysis also confirms the advantages of the theoretical framework proposed and the statistically significant correlation between the hyperparameters controlling the model complexity and the optimal class proportions.

READ FULL TEXT
research
12/05/2018

An empirical study on hyperparameter tuning of decision trees

Machine learning algorithms often contain many hyperparameters whose val...
research
11/10/2022

Review of Methods for Handling Class-Imbalanced in Classification Problems

Learning classifiers using skewed or imbalanced datasets can occasionall...
research
02/03/2016

Discriminative Sparse Neighbor Approximation for Imbalanced Learning

Data imbalance is common in many vision tasks where one or more classes ...
research
05/06/2023

Rethinking Class Imbalance in Machine Learning

Imbalance learning is a subfield of machine learning that focuses on lea...
research
03/01/2021

Empirical Bayes Model Averaging with Influential Observations: Tuning Zellner's g Prior for Predictive Robustness

We investigate the behavior of Bayesian model averaging (BMA) for the no...
research
04/17/2021

Potential Anchoring for imbalanced data classification

Data imbalance remains one of the factors negatively affecting the perfo...
research
03/22/2022

Dazzle: Using Optimized Generative Adversarial Networks to Address Security Data Class Imbalance Issue

Background: Machine learning techniques have been widely used and demons...

Please sign up or login with your details

Forgot password? Click here to reset