Eli brought to my attention that a lot of lab data has been (recently?) published, both in-game and in the RDAT files. I was especially interested in the round 85 results because it is appears to be the first good set of data for the Reproducibility lab in a long time. But the first design I looked at, 2333413/2676335/Triloop Hairpin Test didn’t make sense in the Eterna UI. I compared it against the RDAT files and found several problems.
Here’s a screenshot of the display.
Here are the issues I see:
-
The display is in target mode, but the target structure isn’t right. There should be three unpaired bases at the 5’ end, not 1. As a result, all of the SHAPE scores are misaligned with the structure.
-
The synthesis score is 80/100. This is probably a result of the misalignment with the target. But even when I realigned it in my head, it didn’t look like the results I expected. So I turned on the SHAPE values. Here’s a closeup.
-
Note that the base 49 is a bright yellow, even though its value is 0.40, and base 51 is still quite yellow at a value of 0.28. So it appears that legacy scaling is being used. I certainly thought that the UI had switched to absolute scaling. Is this a regression bug?
-
I never fully understood how legacy scaling was computed, but from observing it, it didn’t quite make sense to me that it should set the threshold for this design so low. So I cross-checked against the RDAT file. Here’s the main entry:
This part of the RDAT files contains the general description of the design, without the base-by-base data. Notice that the target structure and Eterna score are plausible (as opposed to what is displayed in the UI.) But the min, max and threshold values don’t seem right, but do seem consistent with the UI.
- To verify that the min_SHAPE and max_SHAPE values aren’t right, here’s the section from the RDAT files that contains the SHAPE values. Note that within a single RDAT file, separate entries for the same design share a small integer identifier, in this case 949.
As a cross-check to make sure I was looking at the right data, I’ve highlighted (magenta) the values for bases 48-51, which match up with the values reported in the UI. I’ve also highlighted the actual minimum and maximum SHAPE scored (green) which are -0.02 and 2.00, contrary to what the the min_SHAPE and max_SHAPE values in the screenshot above.
I haven’t looked further to determine whether all the labs in this synthesis round have the same issues, or whether it affects more synthesis rounds than 85.