Partial Orthology, Paralogy and Xenology Relations - Satisfiability in terms of Di-Cographs

11/01/2017
by   Nikolai Nøjgaard, et al.
0

A variety of methods based on sequence similarity, reconciliation, synteny or functional characteristics, can be used to infer homology relations, that is, orthology, paralogy and xenology relations between genes of a given gene family G. The (inferred) homology relations might not cover each pair of genes and thus, provide only partial knowledge on the full set of homology relations. Moreover, for particular pairs of genes it might be known with a high degree of certainty that they are not orthologs (resp. paralogs, xenologs) which yields forbidden pairs of genes. The question arises as whether such sets of (partial) homology relations with or without forbidden gene pairs are satisfiable, i.e., can they simultaneously co-exist in an evolutionary history for G. In this contribution, we characterize satisfiable homology relations. To this end, we employ the graph structure provided by these relations. In particular, the latter allows us to characterize full satisfiable homology relations as so-called di-cographs. Let m denote the total number of known and forbidden gene pairs. We provide a simple O(|G|^2 + m|G|)-time algorithm to determine whether such homology relations are satisfiable and, in the positive case, to construct event-labeled gene trees containing speciation, duplication and horizontal gene transfer (HGT) events that can explain the relations. The provided algorithm is evaluated on large-scaled simulated data sets. As we shall see, a comparably small amount of information about the original homologous relationships between the gene pairs is necessary to reconstruct most of the original relations. The algorithm and datasets are freely-available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/29/2019

Reconstruction of time-consistent species trees

The history of gene families – which are equivalent to event-labeled gen...
research
12/01/2021

Orientation of Fitch Graphs and Detection of Horizontal Gene Transfer in Gene Trees

Horizontal gene transfer events partition a gene tree T and thus, its le...
research
04/24/2023

The Theory of Gene Family Histories

Most genes are part of larger families of evolutionary related genes. Th...
research
11/18/2019

Generalized Fitch Relations II: Sets of Binary Relations that are explained by Edge-labeled Trees

Fitch graphs G=(X,E) are digraphs that are explained by {∅, 1}-edge-labe...
research
12/18/2017

Phylogenomics with Paralogs

Phylogenomics heavily relies on well-curated sequence data sets that con...
research
05/07/2018

GeneVis - An interactive visualization tool for combining cross-discipline datasets within genetics

GeneVis is a web-based tool to visualize complementary data sets of diff...
research
11/06/2017

Reconstructing Gene Trees From Fitch's Xenology Relation

Two genes are xenologs in the sense of Fitch if they are separated by at...

Please sign up or login with your details

Forgot password? Click here to reset