A Bayesian Multi-Layered Record Linkage Procedure to Analyze Functional Status of Medicare Patients with Traumatic Brain Injury

05/18/2020
by   Mingyang Shan, et al.
0

Understanding the association between injury severity and patients' potential for recovery is crucial to providing better care for patients with traumatic brain injury (TBI). Estimation of this relationship requires clinical information on injury severity, patient demographics, and healthcare utilization, which are often obtained from separate data sources. Because of privacy and confidentiality regulations, these data sources do not include unique identifiers to link records across data sources. Record linkage is a process to identify records that represent the same entity across data sources in the absence of unique identifiers. These processes commonly rely on agreement between variables that appear in both data sources to link records. However, when the number of records in each file is large, this task is computationally intensive and may result in false links. Blocking is a data partitioning technique that reduces the number of possible links that should be considered. Healthcare providers can be used as blocks in applications of record linkage with healthcare datasets. However, providers may not be uniquely identified across files. We propose a Bayesian record linkage procedure that simultaneously performs block-level and record-level linkage. This iterative approach incorporates the record-level linkage within block pairs to improve the accuracy of the block-level linkage. Subsequently, the algorithm improves record-level linkage using the accurate partitioning of the linkage space through blocking. We demonstrate that our proposed method provides improved performance compared to existing Bayesian record linkage methods that do not incorporate blocking. The proposed procedure is then used to merge registry data from the National Trauma Data Bank with Medicare claims data to estimate the relationship between injury severity and TBI patients' recovery.

READ FULL TEXT
research
08/10/2023

Bayesian Record Linkage with Variables in One File

In many healthcare and social science applications, information about un...
research
09/20/2016

An Ensemble Blocking Scheme for Entity Resolution of Large and Sparse Datasets

Entity Resolution, also called record linkage or deduplication, refers t...
research
03/12/2020

Assessing the accuracy of individual link with varying block sizes and cut-off values using MaCSim approach

Record linkage is the process of matching together the records from diff...
research
12/20/2012

An Experiment with Hierarchical Bayesian Record Linkage

In record linkage (RL), or exact file matching, the goal is to identify ...
research
09/30/2020

Maximum Entropy classification for record linkage

By record linkage one joins records residing in separate files which are...
research
09/27/2017

Scaling Author Name Disambiguation with CNF Blocking

An author name disambiguation (AND) algorithm identifies a unique author...
research
03/12/2020

MaCSim approach to assess the accuracy of individual matched records with varying block sizes and cut-off values

Record linkage is the process of matching together the records from diff...

Please sign up or login with your details

Forgot password? Click here to reset