[Strategy Market] Difference in energy between target and 2nd best

Timeroot · June 27, 2012, 12:15am

So, I’m thinking that the most common failure of an RNA to fold will be when it folds into the second-lowest energy state. The score, then, would simply be the difference in free energy between the target shape and the energy of the second lowest, with the larger difference being better.

EDIT: After looking at the ViennaRNA package again I saw there was a built-in program to computer suboptimal shapes, RNAsubopt. The below can be ignored then, I guess just use RNAsubopt with increasing energy margins until a second structure is returned. Ignore the below if you so choose, it’s just me discussing how the algorithm could work if it was needed~

I realize that computing the second-lowest-energy state is something the software is probably not equipped to do, but after reading the through the algorithm used by ViennaRNA, I’m confident that it would be perfectly straightforward. I’ll explain how it could be efficiently implemented below.

The existing algorithm takes subsequences of nts i through j of the RNA and looks for the lowest energy of each. The lowest energy of a larger one can then be found by surveying the j-i ways to split it up, and summing their energies. To find the second lowest-energy all that would be needed would be to store the TWO lowest energies from each subsequence, as the two lowest energies of a larger subsequence will be a combination of the lowest energy from all parts but one.

That is, using the notation in http://www.ncbi.nlm.nih.gov/pmc/artic…, and using V2(i,j) and W2(i,j) to mean the second lowest states, as opposed to V(i,j) and W(i,j) which are the smallest…

V2(i,j) will be the second smallest of:
-E(FH(i,j)) - as there is only one way to create a hairpin, so there is no second-lowest way to arrange it
-either of the 2 lowest values obtained from the “E2” value, instead of just the minimum
-lowest value obtained from a modified “E2” which uses V2 in lieu of V
-either of the 2 lowest values of “E3”
-lowest value of “E3” with one instance of W replaced by W2

W2(i,j) will be the second smallest of:
-W(i,j-1)
-W2(i,j-1)
-W(i+1,j)
-W2(i+1,j)
-V(i,j)
-V2(i,j)
-either of the 2 lowest values of “E4”
-lowest value of a modified “E4” with one instance of W replaced by W2

You can see that in many ways it’s like the original algorithm, and requires simply adding two new arrays of memoized results (V2 and W2), and a new computation for these two.

I just hope that this request isn’t too tall - but I think it would make an effective prediction algorithm!

rhiju · June 27, 2012, 3:44pm

I think you’re exactly right, and we’ve talked about this feature for a while… an ability to see not just the lowest energy structure but perhaps even a ‘spectrum’ of low energy structures. You’re right that there are already algorithms for generating these structures within ViennaRNA, which is the engine behind EteRNA.

I think this ‘spectrum’ will be particularly helpful as we focus more and more on designing switches that bind and respond to other RNAs or molecules. Often the states that we want to switch between are the ensemble of lowest energy conformations even in the absence of the binding partner.

I’ll bring this back up with the devs! We’re a bit swamped now with preparations for EteRNA 2, but I think there will be a big push soon to update the game itself.

jandersonlee · June 27, 2012, 7:14pm

The barriers approach of showing multiple related low-energy configurations is appealing in this regard. With barriers you can also see how much energy is required or released by adding or subtracting one-or-two bonds at a time.

I wrote about adding a manual barriers mode to the lab/puzzle tool in another thread. The idea would be to allow users a chance to see what the energy difference would be if some bond(s) were made or broken. Think of it as an “un-natural” mode. Or sort of like a FoldIt-yourself mode, with free-energy feedback. I won’t go into too much detail here as I covered it once before.