Prior Knowledge based mutation prioritization towards causal variant finding in rare disease

10/10/2017
by   Vasundhara Dehiya, et al.
0

How do we determine the mutational effects in exome sequencing data with little or no statistical evidence? Can protein structural information fill in the gap of not having enough statistical evidence? In this work, we answer the two questions with the goal towards determining pathogenic effects of rare variants in rare disease. We take the approach of determining the importance of point mutation loci focusing on protein structure features. The proposed structure-based features contain information about geometric, physicochemical, and functional information of mutation loci and those of structural neighbors of the loci. The performance of the structure-based features trained on 80% of HumDiv and tested on 20% of HumDiv and on ClinVar datasets showed high levels of discernibility in the mutation's pathogenic or benign effects: F score of 0.71 and 0.68 respectively using multi-layer perceptron. Combining structure- and sequence-based feature further improve the accuracy: F score of 0.86 (HumDiv) and 0.75 (ClinVar). Also, careful examination of the rare variants in rare diseases cases showed that structure-based features are important in discerning importance of variant loci.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/27/2021

MutFormer: A context-dependent transformer-based model to predict pathogenic missense mutations

A missense mutation is a point mutation that results in a substitution o...
research
12/29/2022

SESNet: sequence-structure feature-integrated deep learning method for data-efficient protein engineering

Deep learning has been widely used for protein engineering. However, it ...
research
12/30/2022

Topical Hidden Genome: Discovering Latent Cancer Mutational Topics using a Bayesian Multilevel Context-learning Approach

Statistical inference on the cancer-site specificities of collective ult...
research
08/07/2023

Nonparametric Bayes multiresolution testing for high-dimensional rare events

In a variety of application areas, there is interest in assessing eviden...
research
11/18/2022

Protein language model rescue mutations highlight variant effects and structure in clinically relevant genes

Despite being self-supervised, protein language models have shown remark...
research
12/03/2021

Bayesian nonparametric strategies for power maximization in rare variants association studies

Rare variants are hypothesized to be largely responsible for heritabilit...
research
10/01/2008

Determining the Unithood of Word Sequences using a Probabilistic Approach

Most research related to unithood were conducted as part of a larger eff...

Please sign up or login with your details

Forgot password? Click here to reset