[Strategy market] Tetraloop similarity

Eli_Fisker · June 5, 2011, 5:22pm

I would like a strategy that ad points for having similar energy content in tetraloops (like quardruplet) as best, double twin tetraloops as next best.

For designs with 4 tetraloops:
If all tetraloops are similar in energy content, ad 5 points
If there are two pair of twin tetraloops, ad 5 points (2 x 2 identical tetraloops)
If three tetraloops are similar in energy content ad 3
If two tetraloops are similar and the rest un even, ad 2 points
If all tetraloops are different ad 0 point

For designs with only 3 tetraloop:
If all 3 tetraloops are similar, ad 5 points
If only 2 tetraloops are similar, ad 3 points
If none are similar, ad 0 points

JeehyungLee · June 8, 2011, 7:35am

Dear Eli,

Your strategy has been added to our implementation queue with task id 5. You can check the schedule of the implementation here.

Unfortunately we are having bit of problems calculating local loop energies - we’ll get to your algorithm as soon as we fix the problem!

EteRNA team

desaic · June 26, 2011, 3:57am

Hi Eli,

Does this strategy only apply to puzzles with three or four tetra loops? Does it mean that any other design should get a default score of zero?

Desai

Eli_Fisker · June 26, 2011, 7:27am

Hi Desai!

I’m glad you asked, this was a good question. I just checked the asymmetry lab. Here the pattern is similar in some cases - that is energy twin loops.

But another pattern occours and that’s why I won’t include two loops. In the Asymmetry lab where there are different number of nucleotides between the arms and the arms are different too, it seems to be enough that energy in the tetraloops is close to each other. (0,2-0,4 difference)

So make my strategy for just 3 and 4 tetraloops and give the rest of the designs a default score of zero. I will post a seperate market strategy for two tetraloops in unsymmetric designs, as they do not follow the same clear pattern as the symmetric designs.

But if we in the future make symetric designs with just two loops, I might want them included.

Eli

JeehyungLee · June 29, 2011, 3:22am

Dear Eli,

We are glad to report that your strategy has been implemented and tested.

While implementing your strategy, we have made small changes to the parameters you specified to optimize the performance.

Note that we’ll always run a optimization over the parameters you specify, so you won’t have to worry about fine tuning all the numbers you use.

Just the idea and rough numbers are enough to run your algorithm!

Length : Your strategy was implmented with 40 line of code.

Ordering : We ran your strategy on all synthesized designs and ordered them based on predicted scores. The correlation of your strategy’s ordering with the ordering based on the actual scores was 0.165. (1.0 is the best score, -1.0 is the worst score. A

completely random prediction would have 0 correlation)

Please note that the numbers specified above will change in future as we’ll rerun your algorithm whenever new synthesis data is available.

More detailed result has been posted on the strategy market page. Thank you for sharing your idea, and we look forward to other brilliant strategies from you!

Eli_Fisker · March 13, 2014, 11:43pm

It is fully possible to break the pattern with similar tetraloop energy. I tried it myself in several cloud lab designs on purpose.

http://eterna.cmu.edu/web/browse/view…

I want to adjust my strategy as I see a clearer picture arise. My first strategy is not fully wrong, it is just possible to get around it if the stems in the design are stable enough to hold the loop. And in particular if the design has longer stems. However even in the long stem designs there is a tendency for more loops with uneven energy in the loops among the lower scoring.

Where I see more of a pattern now is for designs with shorter stems on a multiloop. There is a higher prevalence of uneven energy distribution in end loops for the lower scoring designs than among the winners. It is still fully possible to break pattern. But still there is a tendency, but I think it will be more outspoken in designs with 4 bp stems and shorter. This should be reflected in the strategy, for the best use of it.

New strategy
Give double penalty in stems 4 bp and shorter if for uneven energy in same size end loops. The bigger the difference, the worse. Also I believe there is an optimal energy range for each size loop.

I think if stems are well constructed and in themselves stable, it has a chance, even though the loops are very different in energy. However if the stems has some instabilities, too different loop energies, will tip the cardhouse. I think this goes too in bigger end loops. Though a certain percentage in energy difference is allowed.

Exceptions

http://eterna.cmu.edu/game/solution/2…

Hyphema found another of the exceptions in the Half branches lab, in designs by Merriskies and wondered.

Hyphema: been looking more at the Half of the Branches lab and have no idea why Merriskies scored so high with the exception that the tetraloops were so asymmetric energetically
Hyphema: thankfully merriskies had another similar design synthd and it did poorly and the big difference was the boosting of the tetraloop
Eli Fisker: Uneven energy in tetraloop is allowed
Hyphema: but was that the key?
Hyphema: or just some fluke?
Eli Fisker: but I think it generally goes better for designs with long stems
Hyphema: certainly that
Eli Fisker: Usually this goes worse in designs with shorter stems
Eli Fisker: However I think it flies as the two stems are quite different

Here are the designs that Hyphema was thinking about: http://eterna.cmu.edu/game/browse/742…

I’m still not sure that what I say about loop energy similarity is the full truth, but I think it is closer to the truth this time. It will be interesting to see what it end up with.

Hyphema · March 18, 2014, 5:39pm

Just noticed this addition you made Eli and thank you! It should be noted that the shape signals were weak for both designs. And I am not sure about the scoring system here but I wonder if it was based on a scaling system where the 97 winner was just the top design although overall weakly folded based on the shape values.

Eli_Fisker · March 18, 2014, 6:14pm

Thx Hyphema!

Yes, when you brought up the raw SHAPE values when we were discussing the designs, they seemed to have overall rather low signals than in general compared to other labs. You are correct that the data gets scaled.

The raw numbers themselves vary from lab to lab. I’m still struggling with understanding this part of the SHAPE numbers. However Nando gives a decent explanation in here:

https://getsatisfaction.com/eternagam…