(Given the likely training time for Orion, which became GPT4.5, this seems to be in line with Altman’s reputation for misleading statements).
I don’t think this holds—there was just under 2 years between the letter and the 4.5 release, so I doubt a 6 month pause would have affected training timelines for 4.5 directly.
Really not sure what heuristic leads you to count people working on ARC-Theory working on an ambitious, speculative version of interp as working on alignment but not any of the people working to build from current interp paradigms. Similarly, anyone working on e.g. making models more honest in prod models is in fact learning a bunch of lessons about what scalable oversight looks like (albeit not publishing, which i agree is sad). Or doing any science of misalignment, or doing any empirical character work, or experimenting with making models adhere to a spec, or carefully understanding their generalisation patterns, or just trying to understand what the actual objects that we are creating right now are??
It seems like having any current interaction with frontier models is seen as disqualifying for actually doing alignment work?