Using massive health insurance claims data to predict very high-cost claimants: a machine learning approach

12/30/2019
by   José M. Maisog, et al.
0

Due to escalating healthcare costs, accurately predicting which patients will incur high costs is an important task for payers and providers of healthcare. High-cost claimants (HiCCs) are patients who have annual costs above $250,000 and who represent just 0.16 currently account for 9 develop a high-performance algorithm to predict HiCCs to inform a novel care management system. Using health insurance claims from 48 million people and augmented with census data, we applied machine learning to train binary classification models to calculate the personal risk of HiCC. To train the models, we developed a platform starting with 6,006 variables across all clinical and demographic dimensions and constructed over one hundred candidate models. The best model achieved an area under the receiver operating characteristic curve of 91.2 performance (84 high-cost status (89 lack pharmacy claims data (88 curve of 23.1 program enrolling 500 people with the highest HiCC risk is expected to treat 199 true HiCCs and generate a net savings of $7.3 million per year. Our results demonstrate that high-performing predictive models can be constructed using claims data and publicly available data alone, even for rare high-cost claimants exceeding $250,000. Our model demonstrates the transformational power of machine learning and artificial intelligence in care management, which would allow healthcare payers and providers to introduce the next generation of care management programs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/12/2016

Identifying Diabetic Patients with High Risk of Readmission

Hospital readmissions are expensive and reflect the inadequacies in heal...
research
10/06/2018

Artificial Intelligence for Diabetes Case Management: The Intersection of Physical and Mental Health

Diabetes is a major public health problem in the United States, affectin...
research
11/05/2022

Unsupervised Machine Learning for Explainable Medicare Fraud Detection

The US federal government spends more than a trillion dollars per year o...
research
01/28/2023

Predicting Visit Cost of Obstructive Sleep Apnea using Electronic Healthcare Records with Transformer

Background: Obstructive sleep apnea (OSA) is growing increasingly preval...
research
10/18/2017

Can Machine Learning Create an Advocate for Foster Youth?

Statistics are bleak for youth aging out of the United States foster car...
research
08/30/2018

A Self-Attention Network for Hierarchical Data Structures with an Application to Claims Management

Insurance companies must manage millions of claims per year. While most ...
research
02/03/2022

Who will Leave a Pediatric Weight Management Program and When? – A machine learning approach for predicting attrition patterns

Childhood obesity is a major public health concern. Multidisciplinary pe...

Please sign up or login with your details

Forgot password? Click here to reset