OpenAI Superalignment: Weak-to-strong generalization

Link post