Effective Learning of Probabilistic Models for Clinical Predictions from Longitudinal Data

11/02/2018
by   Shuo Yang, et al.
0

With the expeditious advancement of information technologies, health-related data presented unprecedented potentials for medical and health discoveries but at the same time significant challenges for machine learning techniques both in terms of size and complexity. Those challenges include: the structured data with various storage formats and value types caused by heterogeneous data sources; the uncertainty widely existing in every aspect of medical diagnosis and treatments; the high dimensionality of the feature space; the longitudinal medical records data with irregular intervals between adjacent observations; the richness of relations existing among objects with similar genetic factors, location or socio-demographic background. This thesis aims to develop advanced Statistical Relational Learning approaches in order to effectively exploit such health-related data and facilitate the discoveries in medical research. It presents the work on cost-sensitive statistical relational learning for mining structured imbalanced data, the first continuous-time probabilistic logic model for predicting sequential events from longitudinal structured data as well as hybrid probabilistic relational models for learning from heterogeneous structured data. It also demonstrates the outstanding performance of these proposed models as well as other state of the art machine learning models when applied to medical research problems and other real-world large-scale systems, reveals the great potential of statistical relational learning for exploring the structured health-related data to facilitate medical research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/25/2017

Relational Learning and Feature Extraction by Querying over Heterogeneous Information Networks

Many real world systems need to operate on heterogeneous information net...
research
08/18/2017

Statistical Latent Space Approach for Mixed Data Modelling and Applications

The analysis of mixed data has been raising challenges in statistics and...
research
09/20/2018

Recurrent Neural Networks based Obesity Status Prediction Using Activity Data

Obesity is a serious public health concern world-wide, which increases t...
research
04/30/2023

Sensitive Data Detection with High-Throughput Machine Learning Models in Electrical Health Records

In the era of big data, there is an increasing need for healthcare provi...
research
11/13/2019

Federated Learning for Healthcare Informatics

Recent rapid development of medical informatization and the correspondin...
research
04/22/2018

HeteroMed: Heterogeneous Information Network for Medical Diagnosis

With the recent availability of Electronic Health Records (EHR) and grea...
research
06/04/2022

Interpretable Models Capable of Handling Systematic Missingness in Imbalanced Classes and Heterogeneous Datasets

Application of interpretable machine learning techniques on medical data...

Please sign up or login with your details

Forgot password? Click here to reset