On an enhancement of RNA probing data using Information Theory

09/12/2019
by   Thomas J. X. Li, et al.
0

Identifying the secondary structure of an RNA is crucial for understanding its diverse regulatory functions. This paper focuses on how to enhance target identification in a Boltzmann ensemble of structures via chemical probing data. We employ an information-theoretic approach to solve the problem, via considering a variant of the Rényi-Ulam game. Our framework is centered around the ensemble tree, a hierarchical bi-partition of the input ensemble, that is constructed by recursively querying about whether or not a base pair of maximum information entropy is contained in the target. These queries are answered via relating local with global probing data, employing the modularity in RNA secondary structures. We present that leaves of the tree are comprised of sub-samples exhibiting a distinguished structure with high probability. In particular, for a Boltzmann ensemble incorporating probing data, which is well established in the literature, the probability of our framework correctly identifying the target in the leaf is greater than 90%.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/25/2022

Envelope imbalanced ensemble model with deep sample learning and local-global structure consistency

The class imbalance problem is important and challenging. Ensemble appro...
research
01/27/2023

Algorithms for ranking and unranking the combinatorial set of RNA secondary structures

In this paper, we study the combinatorial set of RNA secondary structure...
research
04/08/2022

RubCSG at SemEval-2022 Task 5: Ensemble learning for identifying misogynous MEMEs

This work presents an ensemble system based on various uni-modal and bi-...
research
01/05/2023

Optimal lower bounds for Quantum Learning via Information Theory

Although a concept class may be learnt more efficiently using quantum sa...
research
03/27/2023

Interpretable machine learning of amino acid patterns in proteins: a statistical ensemble approach

Explainable and interpretable unsupervised machine learning helps unders...
research
08/22/2014

A Bayesian Ensemble Regression Framework on the Angry Birds Game

An ensemble inference mechanism is proposed on the Angry Birds domain. I...
research
09/15/2022

Structure preservation via the Wasserstein distance

We show that under minimal assumptions on a random vector X∈ℝ^d, and wit...

Please sign up or login with your details

Forgot password? Click here to reset