The higher degree of verbalized eval awareness of 4.7 Opus over Mythos (by all accounts a bigger and smarter, but earlier trained, model) is some weak evidence in favor of my view. If recent models’ greater eval awareness is primarily a factor of greater general intelligence, we should expect eval awareness to go monotonically up with broader model capabilities.
The higher degree of verbalized eval awareness of 4.7 Opus over Mythos (by all accounts a bigger and smarter, but earlier trained, model) is some weak evidence in favor of my view. If recent models’ greater eval awareness is primarily a factor of greater general intelligence, we should expect eval awareness to go monotonically up with broader model capabilities.