Glucose values prediction five years ahead with a new framework of missing responses in reproducing kernel Hilbert spaces, and the use of continuous glucose monitoring technolo

by   Marcos Matabuena, et al.

AEGIS study possesses unique information on longitudinal changes in circulating glucose through continuous glucose monitoring technology (CGM). However, as usual in longitudinal medical studies, there is a significant amount of missing data in the outcome variables. For example, 40 percent of glycosylated hemoglobin (A1C) biomarker data are missing five years ahead. With the purpose to reduce the impact of this issue, this article proposes a new data analysis framework based on learning in reproducing kernel Hilbert spaces (RKHS) with missing responses that allows to capture non-linear relations between variable studies in different supervised modeling tasks. First, we extend the Hilbert-Schmidt dependence measure to test statistical independence in this context introducing a new bootstrap procedure, for which we prove consistency. Next, we adapt or use existing models of variable selection, regression, and conformal inference to obtain new clinical findings about glucose changes five years ahead with the AEGIS data. The most relevant findings are summarized below: i) We identify new factors associated with long-term glucose evolution; ii) We show the clinical sensibility of CGM data to detect changes in glucose metabolism; iii) We can improve clinical interventions based on our algorithms' expected glucose changes according to patients' baseline characteristics.


page 1

page 2

page 3

page 4


On using Reproducible Hilbert Spaces for the analysis of Replicated Spatial Point Processes

This paper focuses on the use of the theory of Reproducing Kernel Hilber...

Missing data imputation for a multivariate outcome of mixed variable types

Data collected in clinical trials are often composed of multiple types o...

Glucodensities: a new representation of glucose profiles using distributional data analysis

Biosensor data has the potential ability to improve disease control and ...

Hypothesis testing for matched pairs with missing data by maximum mean discrepancy: An application to continuous glucose monitoring

A frequent problem in statistical science is how to properly handle miss...

A wavelet-mixed landmark survival model for the effect of short-term oscillations in longitudinal biomarker's profiles

Statistical methods to study the association between a longitudinal biom...

A LQD-RKHS-based distribution-to-distribution regression method and its application to restore distributions of missing SHM data

Data loss is a critical problem in structural health monitoring (SHM). P...

Interpreting Missing Data Patterns in the ICU

PURPOSE: Clinical examinations are performed on the basis of necessity. ...

Please sign up or login with your details

Forgot password? Click here to reset