Mikhail Samin comments on Shallow review of live agendas in alignment & safety

Mikhail Samin 4 Dec 2023 12:07 UTC
1 point
0

try to formalise a more realistic agent, understand what it means for it to be aligned with us, […], and produce desiderata for a training setup that points at coherent AGIs similar to our model of an aligned agent.

Finally, people are writing good summaries of the learning-theoretic agenda!