HetNetAligner: Design and Implementation of an algorithm for heterogeneous network alignment on Apache Spark

06/11/2018
by   Pietro H Guzzi, et al.
0

The importance of the use of networks to model and analyse biological data and the interplay of bio-molecules is widely recognised. Consequently, many algorithms for the analysis and the comparison of networks (such as alignment algorithms) have been developed in the past. Recently, many different approaches tried to integrate into a single model the interplay of different molecules, such as genes, transcription factors and microRNAs. A possible formalism to model such scenario comes from node coloured networks (or heterogeneous networks) implemented as node/ edge-coloured graphs. Consequently, the need for the introduction of alignment algorithms able to analyse heterogeneous networks arises. To the best of our knowledge, all the existing algorithms are not able to mine heterogeneous networks. We propose a two-step alignment strategy that receives as input two heterogeneous networks (node-coloured graphs) and a similarity function among nodes of two networks extending the previous formulations. We first build a single alignment graph. Then we mine this graph extracting relevant subgraphs. Despite this simple approach, the analysis of such networks relies on graph and subgraph isomorphism and the size of the data is still growing. Therefore the use of high-performance data analytics framework is needed. We here present HetNetAligner a framework built on top of Apache Spark. We also implemented our algorithm, and we tested it on some selected heterogeneous biological networks. Preliminary results confirm that our method may extract relevant knowledge from biological data reducing the computational time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/04/2020

Extracting Dense and Connected Subgraphs in Dual Networks by Network Alignment

The use of network based approaches to model and analyse large datasets ...
research
04/20/2013

A Markov Model for Ontology Alignment

The explosion of available data along with the need to integrate and uti...
research
07/12/2018

A new graph modelisation for molecule similarity

In order to define the process of restrosynthesis of a new organic molec...
research
01/28/2019

Heterogeneous Network Motifs

Many real-world applications give rise to large heterogeneous networks w...
research
08/23/2022

CAPER: Coarsen, Align, Project, Refine - A General Multilevel Framework for Network Alignment

Network alignment, or the task of finding corresponding nodes in differe...
research
12/24/2021

Dual Hierarchical Attention Networks for Bi-typed Heterogeneous Graph Learning

Bi-typed heterogeneous graphs are applied in many real-world scenarios. ...
research
11/22/2018

Scalable Label Propagation Algorithms for Heterogeneous Networks

Heterogeneous networks are large graphs consisting of different types of...

Please sign up or login with your details

Forgot password? Click here to reset