Kei Nishimura-Gasparian comments on Will Any Crap Cause Emergent Misalignment?

Kei Nishimura-Gasparian 30 Aug 2025 13:48 UTC
1 point
0
Am I correctly understanding that the effect size shown in the graph is very small? It seems like the mean harmfulness score is not much higher for any of the evals, even if the effect size is technically statistically significant.