MPGM: Scalable and Accurate Multiple Network Alignment

04/26/2018
by   Ehsan Kazemi, et al.
0

Protein-protein interaction (PPI) network alignment is a canonical operation to transfer biological knowledge among species. The alignment of PPI-networks has many applications, such as the prediction of protein function, detection of conserved network motifs, and the reconstruction of species' phylogenetic relationships. A good multiple-network alignment (MNA), by considering the data related to several species, provides a deep understanding of biological networks and system-level cellular processes. With the massive amounts of available PPI data and the increasing number of known PPI networks, the problem of MNA is gaining more attention in the systems-biology studies. In this paper, we introduce a new scalable and accurate algorithm, called MPGM, for aligning multiple networks. The MPGM algorithm has two main steps: (i) SEEDGENERATION and (ii) MULTIPLEPERCOLATION. In the first step, to generate an initial set of seed tuples, the SEEDGENERATION algorithm uses only protein sequence similarities. In the second step, to align remaining unmatched nodes, the MULTIPLEPERCOLATION algorithm uses network structures and the seed tuples generated from the first step. We show that, with respect to different evaluation criteria, MPGM outperforms the other state-of-the-art algorithms. In addition, we guarantee the performance of MPGM under certain classes of network models. We introduce a sampling-based stochastic model for generating k correlated networks. We prove that for this model, if a sufficient number of seed tuples are available, the MULTIPLEPERCOLATION algorithm correctly aligns almost all the nodes. Our theoretical results are supported by experimental evaluations over synthetic networks.

READ FULL TEXT

page 12

page 13

page 14

page 15

research
11/02/2018

SPECTRE: Seedless Network Alignment via Spectral Centralities

Network alignment consists of finding a correspondence between the nodes...
research
10/09/2020

Exact p-values for global network alignments via combinatorial analysis of shared GO terms

Network alignment aims to uncover topologically similar regions in the p...
research
08/20/2023

SBSM-Pro: Support Bio-sequence Machine for Proteins

Proteins play a pivotal role in biological systems. The use of machine l...
research
03/07/2021

Bio-JOIE: Joint Representation Learning of Biological Knowledge Bases

The widespread of Coronavirus has led to a worldwide pandemic with a hig...
research
09/21/2018

Low rank methods for multiple network alignment

Multiple network alignment is the problem of identifying similar and rel...
research
11/06/2019

Using Residual Dipolar Couplings from Two Alignment Media to Detect Structural Homology

The method of Probability Density Profile Analysis has been introduced p...
research
05/11/2020

Exact Parallelization of the Stochastic Simulation Algorithm for Scalable Simulation of Large Biochemical Networks

Comprehensive simulations of the entire biochemistry of cells have great...

Please sign up or login with your details

Forgot password? Click here to reset