Folding Algorithm

jeanmichel.sellier · February 26, 2015, 7:36am

Hello Everyone,

I am curious to know how the folding algorithm of this game works. As far as I know it should be a simplification of some deeper quantum model but I am not sure. Could someone give me more details?

Thanks!

JM

nando · February 26, 2015, 7:51am

Hi Jean-Michel,

Secondary structure prediction is far from being a simplification of some quantum model, as far as I can tell. The currently most accurate softwares (including the one in use in the game) are based on the nearest neighbor model. As detailed at http://rna.urmc.rochester.edu/NNDB/he… the thermodynamic stability of a motif is assumed to be determined by the ‘elements’ enclosing motifs, essentially (canonical and wobble) base pairs, eventually enclosing loops. A common simplification consists in ignoring pseudoknots, which allows for solving the problem in polynomial time using dynamic programming.

Hope this helps

jeanmichel.sellier · February 26, 2015, 7:57am

Hi Nando,

this is very helpful! Many thanks for the explanation and for the link!

I appreciate,

JM

nando · February 26, 2015, 8:02am

oh, and I forgot to mention: in terms of QM, the only related calculation I ever found was on http://www.begdb.com/ and was a prediction of the stacking interaction between Cytosines and Adenines, some of them eventually protonated. Unfortunately, I can’t seem to find the data again… In any case, this was still very far from a prediction of any kind of secondary structure.

jeanmichel.sellier · February 26, 2015, 8:05am

I am frankly amazed by this nearest neighbor model. It seems to be very fast and does not involve first-principle simulations. Therefore, some other questions:

Is this model always accurate? I mean, compared to a full fledged quantum simulation, how good are the predictions made by this model? Are they always in agreement?

nando · February 26, 2015, 8:11am

Depending on the “classes” of RNA sequences, and specially depending on their length, accuracy vary from reported 90+% down to 50% and sometimes less. Also, algorithms ignoring pseudoknots are necessarily going to mispredict the secondary structure of any sequence that tends to create these motifs.

As you can see, there are quite a few limitations…

jeanmichel.sellier · February 26, 2015, 8:14am

Now the thing makes sense to me. Many thanks for helping about that!

So, would that mean that when I try to fold my RNA in the game, sometimes I may have innacurate folding because of the model? Or do the developers make sure that the only available RNAs in the game are the ones that belong to the range of applicability of the nearest neighbor model?

nando · February 26, 2015, 8:21am

The game simply uses whatever secondary structure prediction the folding engine makes for the given primary structure (the sequence). And there are no guarantees about the accuracy of that prediction.

I just found this: http://dx.doi.org/10.1007/978-1-61779…
It looks like a good starting point (specially, one of the authors, Matthew Seetin used to be part of the EteRNA team), unfortunately I don’t have access, so I can’t say more about it.

jeanmichel.sellier · February 26, 2015, 8:24am

This is a VERY good link, thank you much about that!

I love this game and I want to understand it in details, so thanks again!

nando · February 26, 2015, 8:32am

My pleasure Jean-Michel

Eli_Fisker · February 26, 2015, 9:13am

There is a peak inside the book option:

jeanmichel.sellier · February 26, 2015, 9:17am

I will download the paper from my office as I work in a university. Many thanks anyway

Hyphema · February 27, 2015, 3:06am

What a fantastic conversation. A must read for those interested in understanding RNA and the concepts of RNA folding and its secondary structure. Good stuff! Nando and Jean-Michel. I also like the link to the nearest neighbor model description.