Data Harmonization Via Regularized Nonparametric Mixing Distribution Estimation

10/12/2021
by   Steven Wilkins-Reeves, et al.
0

Data harmonization is the process by which an equivalence is developed between two variables measuring a common trait. Our problem is motivated by dementia research in which multiple tests are used in practice to measure the same underlying cognitive ability such as language or memory. We connect this statistical problem to mixing distribution estimation. We introduce and study a non-parametric latent trait model, develop a method which enforces uniqueness of the regularized maximum likelihood estimator, show how a nonparametric EM algorithm will converge weakly to its maximizer, and additionally propose a faster algorithm for learning a discretized approximation of the latent distribution. Furthermore, we develop methods to assess goodness of fit for the mixing likelihood which is an area neglected in most mixing distribution estimation problems. We apply our method to the National Alzheimer's Coordination Center Uniform Data Set and show that we can use our method to convert between score measurements and account for the measurement error. We show that this method outperforms standard techniques commonly used in dementia research. Full code is available at https://github.com/SteveJWR/Data-Harmonization-Nonparametric.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/11/2017

Fast nonparametric near-maximum likelihood estimation of a mixing density

Mixture models are regularly used in density estimation applications, bu...
research
06/15/2022

A semiparametric probability distribution estimator of sample maximums

Several approaches of nonparametric inference for extreme values have be...
research
12/09/2022

Non-parametric estimation of mixed discrete choice models

In this paper, different strands of literature are combined in order to ...
research
12/05/2018

On nonparametric estimation of a mixing density via the predictive recursion algorithm

Nonparametric estimation of a mixing density based on observations from ...
research
10/20/2020

Estimating a mixing distribution on the sphere using predictive recursion

Mixture models are commonly used when data show signs of heterogeneity a...
research
06/04/2018

A Fast Algorithm for Maximum Likelihood Estimation of Mixture Proportions Using Sequential Quadratic Programming

Maximum likelihood estimation of mixture proportions has a long history ...
research
06/05/2021

Network Estimation by Mixing: Adaptivity and More

Networks analysis has been commonly used to study the interactions betwe...

Please sign up or login with your details

Forgot password? Click here to reset