This is the first technical approach to alignment I’ve seen that seems genuinely hopeful to me, rather than just another band-aid which won’t hold up to the stresses of a more intelligent model.
This is the first technical approach to alignment I’ve seen that seems genuinely hopeful to me, rather than just another band-aid which won’t hold up to the stresses of a more intelligent model.