PIKS: A Technique to Identify Actionable Trends for Policy-Makers Through Open Healthcare Data

04/05/2023
by   A. Ravishankar Rao, et al.
0

With calls for increasing transparency, governments are releasing greater amounts of data in multiple domains including finance, education and healthcare. The efficient exploratory analysis of healthcare data constitutes a significant challenge. Key concerns in public health include the quick identification and analysis of trends, and the detection of outliers. This allows policies to be rapidly adapted to changing circumstances. We present an efficient outlier detection technique, termed PIKS (Pruned iterative-k means searchlight), which combines an iterative k-means algorithm with a pruned searchlight based scan. We apply this technique to identify outliers in two publicly available healthcare datasets from the New York Statewide Planning and Research Cooperative System, and California's Office of Statewide Health Planning and Development. We provide a comparison of our technique with three other existing outlier detection techniques, consisting of auto-encoders, isolation forests and feature bagging. We identified outliers in conditions including suicide rates, immunity disorders, social admissions, cardiomyopathies, and pregnancy in the third trimester. We demonstrate that the PIKS technique produces results consistent with other techniques such as the auto-encoder. However, the auto-encoder needs to be trained, which requires several parameters to be tuned. In comparison, the PIKS technique has far fewer parameters to tune. This makes it advantageous for fast, "out-of-the-box" data exploration. The PIKS technique is scalable and can readily ingest new datasets. Hence, it can provide valuable, up-to-date insights to citizens, patients and policy-makers. We have made our code open source, and with the availability of open data, other researchers can easily reproduce and extend our work. This will help promote a deeper understanding of healthcare policies and public health issues.

READ FULL TEXT

page 2

page 5

page 9

page 17

research
04/05/2023

A system for exploring big data: an iterative k-means searchlight for outlier detection on open health data

The interactive exploration of large and evolving datasets is challengin...
research
10/30/2017

Hiding in plain sight: insights about health-care trends gained through open health data

The open data movement constitutes an approach to achieving accountabili...
research
04/05/2023

Building predictive models of healthcare costs with open healthcare data

Due to rapidly rising healthcare costs worldwide, there is significant i...
research
12/12/2017

Outlier Detection by Consistent Data Selection Method

Often the challenge associated with tasks like fraud and spam detection[...
research
03/18/2017

An Automated Auto-encoder Correlation-based Health-Monitoring and Prognostic Method for Machine Bearings

This paper studies an intelligent ultimate technique for health-monitori...
research
02/28/2021

An iterative technique to identify browser fingerprinting scripts

Browser fingerprinting is a stateless identification technique based on ...
research
05/19/2022

Identifying outliers in astronomical images with unsupervised machine learning

Astronomical outliers, such as unusual, rare or unknown types of astrono...

Please sign up or login with your details

Forgot password? Click here to reset