VeRNAl: A Tool for Mining Fuzzy Network Motifs in RNA

09/01/2020
by   Carlos Oliver, et al.
0

Motivation: RNAs are ubiquitous molecules involved in many regulatory and catalytic processes. Their ability to form complex structures is often key to support these functions. Remarkably, RNA 3D structures are articulated around smaller 3D sub-units referred as RNA 3D motifs that can be found in unrelated molecules. The classification of these 3D motifs is thus essential to characterize RNA structures, but current methods can only retrieve motifs with identical base interaction patterns. Results: Here, we relax this constraint by posing the motif finding problem as a graph representation learning and clustering task. This framing takes advantage of the continuous nature of graph representations to model the flexibility of RNA motifs while retaining the convenient encoding of RNAs as graphs. We propose a set of node similarity functions, clustering methods, and motif construction algorithms to recover flexible RNA motifs. We show that our methods are able to retrieve and expand known classes of motifs, but also to identify new motifs. Our tool, VeRNAl can be easily customized by users to desired levels of motif flexibility, abundance and size. Availability and Implementation: The source code, data, and a webserver are available at vernal.cs.mcgill.ca

READ FULL TEXT
research
02/17/2023

Multiresolution Graph Transformers and Wavelet Positional Encoding for Learning Hierarchical Structures

Contemporary graph learning algorithms are not well-defined for large mo...
research
09/24/2020

COBI-GRINE: A Tool for Visualization and Advanced Evaluation of Communities in Mass Channel Similarity Graphs

The detection of groups of molecules that co-localize with histopatholog...
research
04/28/2023

MUDiff: Unified Diffusion for Complete Molecule Generation

We present a new model for generating molecular data by combining discre...
research
07/12/2018

A new graph modelisation for molecule similarity

In order to define the process of restrosynthesis of a new organic molec...
research
12/04/2018

A Retrieve-and-Edit Framework for Predicting Structured Outputs

For the task of generating complex outputs such as source code, editing ...
research
07/26/2022

Learning Protein Representations via Complete 3D Graph Networks

We consider representation learning for proteins with 3D structures. We ...

Please sign up or login with your details

Forgot password? Click here to reset