A Huber loss-based super learner with applications to healthcare expenditures

05/13/2022
by   Ziyue Wu, et al.
0

Complex distributions of the healthcare expenditure pose challenges to statistical modeling via a single model. Super learning, an ensemble method that combines a range of candidate models, is a promising alternative for cost estimation and has shown benefits over a single model. However, standard approaches to super learning may have poor performance in settings where extreme values are present, such as healthcare expenditure data. We propose a super learner based on the Huber loss, a "robust" loss function that combines squared error loss with absolute loss to down-weight the influence of outliers. We derive oracle inequalities that establish bounds on the finite-sample and asymptotic performance of the method. We show that the proposed method can be used both directly to optimize Huber risk, as well as in finite-sample settings where optimizing mean squared error is the ultimate goal. For this latter scenario, we provide two methods for performing a grid search for values of the robustification parameter indexing the Huber loss. Simulations and real data analysis demonstrate appreciable finite-sample gains in cost prediction and causal effect estimation using our proposed method.

READ FULL TEXT
research
10/27/2019

An outlier-robust model averaging approach by Mallows-type criterion

Model averaging is an alternative to model selection for dealing with mo...
research
03/07/2017

Propensity score prediction for electronic healthcare databases using Super Learner and High-dimensional Propensity Score Methods

The optimal learner for prediction modeling varies depending on the unde...
research
12/30/2020

Adversarial Estimation of Riesz Representers

We provide an adversarial approach to estimating Riesz representers of l...
research
12/29/2022

Gaussian Heteroskedastic Empirical Bayes without Independence

In this note, we propose empirical Bayes methods under heteroskedastic G...
research
06/30/2021

Do we need to estimate the variance in robust mean estimation?

This paper studies robust mean estimators for distributions with only fi...
research
06/22/2022

Diagnostic Tool for Out-of-Sample Model Evaluation

Assessment of model fitness is an important step in many problems. Model...
research
08/21/2023

Simulation Experiments as a Causal Problem

Simulation methods are among the most ubiquitous methodological tools in...

Please sign up or login with your details

Forgot password? Click here to reset