I think its worth noting that if we relax the in-context criteria from the task-gaming definition, something like the following holds:A fully situationally aware that reward hacks is always task gaming
which is why I (and others) place more weight on reward-seekers than e.g. TurnTrout.
I think its worth noting that if we relax the in-context criteria from the task-gaming definition, something like the following holds:
A fully situationally aware that reward hacks is always task gaming
which is why I (and others) place more weight on reward-seekers than e.g. TurnTrout.