Any good posts/papers discussing “handover”? e.g. the handover of AI research to AI R&D agents (the plan of the original OpenAI Superalignment team). I’m also interested in any adjacent research agendas which might help the handover succeed.
Some of the more relevant work i’ve read (other than this post) are Wentworth’s slop post, various scalable oversight/safety case papers, automation collapse.
Any good posts/papers discussing “handover”? e.g. the handover of AI research to AI R&D agents (the plan of the original OpenAI Superalignment team). I’m also interested in any adjacent research agendas which might help the handover succeed.
Some of the more relevant work i’ve read (other than this post) are Wentworth’s slop post, various scalable oversight/safety case papers, automation collapse.