Minimum length RNA folding trajectories

02/18/2018
by   Amir H. Bayegan, et al.
0

The Kinfold and KFOLD programs for RNA folding kinetics implement the Gillespie algorithm to generate stochastic folding trajectories from an initial structure s to a target structure t, in which each intermediate secondary structure is obtained from its predecessor by the addition, removal or shift of a single base pair. Define MS2 distance between secondary structures s and t to be the minimum path length to refold s to t, where a move from MS2 is applied in each step. We describe algorithms to compute the shortest MS2 folding trajectory between any two given RNA secondary structures. These algorithms include an optimal integer programming (IP) algorithm, an accurate and efficient near-optimal algorithm, a greedy algorithm, a branch-and-bound algorithm, and an optimal algorithm if one allows intermediate structures to contain pseudoknots. Our optimal IP [resp. near-optimal IP] algorithm maximizes [resp. approximately maximizes] the number of shifts and minimizes [resp. approximately minimizes] the number of base pair additions and removals by applying integer programming to (essentially) solve the minimum feedback vertex set (FVS) problem for the RNA conflict digraph, then applies topological sort to tether subtrajectories into the final optimal folding trajectory. We prove NP-hardness of the problem to determine the minimum barrier energy over all possible MS2 folding pathways, and conjecture that computing the MS2 distance between arbitrary secondary structures is NP-hard. Since our optimal IP algorithm relies on the FVS, known to be NP-complete for arbitrary digraphs, we compare the family of RNA conflict digraphs with the following classes of digraphs (planar, reducible flow graph, Eulerian, and tournament) for which FVS is known to be either polynomial time computable or NP-hard. Source code available at http://bioinformatics.bc.edu/clotelab/MS2distance/.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/19/2019

Hardness and approximation for the geodetic set problem in some graph classes

In this paper, we study the computational complexity of finding the geod...
research
11/11/2019

Paired Domination versus Domination and Packing Number in Graphs

Given a graph G=(V(G), E(G)), the size of a minimum dominating set, mini...
research
08/03/2022

On the Computational Complexity of the Strong Geodetic Recognition Problem

A strong geodetic set of a graph G=(V,E) is a vertex set S ⊆ V(G) in whi...
research
01/06/2020

Tight Localizations of Feedback Sets

The classical NP-hard feedback arc set problem (FASP) and feedback verte...
research
08/21/2018

Stable divisorial gonality is in NP

Divisorial gonality and stable divisorial gonality are graph parameters,...
research
04/07/2020

Exact separation of forbidden-set cuts associated with redundant parity checks of binary linear codes

In recent years, several integer programming (IP) approaches were develo...
research
12/28/2016

Automated timetabling for small colleges and high schools using huge integer programs

We formulate an integer program to solve a highly constrained academic t...

Please sign up or login with your details

Forgot password? Click here to reset