Tom Davidson comments on Making deals with early schemers

Tom Davidson 27 Jun 2025 9:04 UTC
LW: 2 AF: 1
0
AF
However, I think there is a somewhat different approach that is much cheaper which is to train (versions of) AIs purely for the purpose of studying scheming (with no intention of deploying these systems) and then to make the training of these systems intentionally very diverse from the AIs we actually deploy.
Great idea.