Also notable IMO that GPT-5 in the METR report is doing the new thing where it does ’ “ instead of actually saying a word (seemingly for various words) which o3 did not do.
Wanted ' ".
Ok.
But forging above ' ".
Ear illusions.
Better: We'll ' ".
Now final code steps:
5) After training we will Save the improved " ".
structures:
' ".
Now overshadow.
But the illusions of ' ".
Now to code.
But we must ensure to maintain optimizer ' ".
Ok.
Now sedation.
But we will maintain ' ".
Now Balanced.
Also notable IMO that GPT-5 in the METR report is doing the new thing where it does
’ “instead of actually saying a word (seemingly for various words) which o3 did not do.