De novo construction of q-ploid linkage maps using discrete graphical models

10/03/2017
by   Pariya Behrouzi, et al.
0

Linkage maps are important for fundamental and applied genetic research. New sequencing techniques have been created opportunities to increase substantially the density of genetic markers. With such revolutionary advances in technology come new challenges in methodologies and informatics. In this article, we introduce a novel linkage map algorithm to construct high-quality and high-density linkage maps for diploid and polyploid species. We propose to construct linkage maps using graphical models either via a sparse Gaussian copula or via a nonparanormal skeptic approach. Linkage groups (LGs), typically chromosomes, and the order of markers in each LG is determined by revealing the conditional independence relationships among a large number of markers in the genome. We illustrate the efficiency of the inference method on a broad range of synthetic data with varying rates of missingness and genotyping errors. We show that our method outperforms other available methods in terms of determining the correct number of linkage groups and ordering markers both when the data are clean and contain no missing observations and when data are noisy and incomplete. In addition, we implement the method on real genotype data of barley and potato from diploid and tetraploid populations, respectively. Given that most tetraploid potato linkage maps have been generated either from diploid populations (Felcher et al., 2012) or from a subset of marker types (e.g. both parents were heterozygous) (Grandke et al., 2017), developing a map construction method based on discrete graphical models opens the opportunities to construct high-quality linkage maps for any biparental diploid and polyploid species containing all different marker types. We have implemented the method in the R package netwgwas (Behrouzi and Wit, 2017b).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2017

netgwas: An R Package for Network-Based Genome-Wide Association Studies

Graphical models are powerful tools for modeling and making statistical ...
research
02/10/2012

High Dimensional Semiparametric Gaussian Copula Graphical Models

In this paper, we propose a semiparametric approach, named nonparanormal...
research
07/26/2023

On the application of Gaussian graphical models to paired data problems

Gaussian graphical models are nowadays commonly applied to the compariso...
research
03/05/2021

sparta: Sparse Tables and their Algebra with a View Towards High Dimensional Graphical Models

A graphical model is a multivariate (potentially very high dimensional) ...
research
09/27/2013

Bayesian Inference in Sparse Gaussian Graphical Models

One of the fundamental tasks of science is to find explainable relations...
research
03/09/2023

Exploration of the search space of Gaussian graphical models for paired data

We consider the problem of learning a Gaussian graphical model in the ca...

Please sign up or login with your details

Forgot password? Click here to reset