I agree this is really important—particularly because I think many of the theoretical arguments for expecting misalignment provide empirical comparative hypotheses. Being able to look at semi-independent replicates of behaviour relies on old models being available. I don’t know the best way forward because I doubt any frontier lab would release old models under a CC license—maybe some kind of centralised charitable foundation.
I agree this is really important—particularly because I think many of the theoretical arguments for expecting misalignment provide empirical comparative hypotheses. Being able to look at semi-independent replicates of behaviour relies on old models being available. I don’t know the best way forward because I doubt any frontier lab would release old models under a CC license—maybe some kind of centralised charitable foundation.