Linch comments on Linch’s Shortform

Linch 23 Apr 2026 1:14 UTC
2 points
0
The higher degree of verbalized eval awareness of 4.7 Opus over Mythos (by all accounts a bigger and smarter, but earlier trained, model) is some weak evidence in favor of my view. If recent models’ greater eval awareness is primarily a factor of greater general intelligence, we should expect eval awareness to go monotonically up with broader model capabilities.