Phenotyping with Positive Unlabelled Learning for Genome-Wide Association Studies

02/15/2022
by   Andre Vauvelle, et al.
0

Identifying phenotypes plays an important role in furthering our understanding of disease biology through practical applications within healthcare and the life sciences. The challenge of dealing with the complexities and noise within electronic health records (EHRs) has motivated applications of machine learning in phenotypic discovery. While recent research has focused on finding predictive subtypes for clinical decision support, here we instead focus on the noise that results in phenotypic misclassification, which can reduce a phenotypes ability to detect associations in genome-wide association studies (GWAS). We show that by combining anchor learning and transformer architectures into our proposed model, AnchorBERT, we are able to detect genomic associations only previously found in large consortium studies with 5× more cases. When reducing the number of controls available by 50%, we find our model is able to maintain 40% more significant genomic associations from the GWAS catalog compared to standard phenotype definitions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/08/2021

Searching for consistent associations with a multi-environment knockoff filter

This paper develops a method based on model-X knockoffs to find conditio...
research
07/25/2022

A unified quantile framework reveals nonlinear heterogeneous transcriptome-wide associations

Transcriptome-wide association studies (TWAS) are powerful tools for ide...
research
06/21/2018

Bayesian hierarchical models for SNP discovery from genome-wide association studies, a semi-supervised machine learning approach

Genome-wide association studies (GWASs) aim to detect genetic risk facto...
research
10/11/2021

Genetic Regulation of Cytokine Response in Patients with Acute Community-acquired Pneumonia

Background: Community-acquired pneumonia (CAP) is an acute disease condi...
research
11/25/2020

Large-scale machine learning-based phenotyping significantly improves genomic discovery for optic nerve head morphology

Genome-wide association studies (GWAS) require accurate cohort phenotypi...
research
01/07/2023

Unsupervised ensemble-based phenotyping helps enhance the discoverability of genes related to heart morphology

Recent genome-wide association studies (GWAS) have been successful in id...
research
11/10/2021

Can you always reap what you sow? Network and functional data analysis of VC investments in health-tech companies

"Success" of firms in venture capital markets is hard to define, and its...

Please sign up or login with your details

Forgot password? Click here to reset