TsviBT comments on Abstract advice to researchers tackling the difficult core problems of AGI alignment

TsviBT 22 Nov 2025 16:16 UTC
6 points
0
I would be up for it. I truly don’t know if we actually disagree though; many of them might just say “yeah it’s hard to tell whether this will get anywhere any time soon, but this seems like some natural next steps of investigation, with some degree of canonicalness, but this could take a really long time”. Or maybe many would actually say “yes this is on the mainline for alignment research and could work in a small number of decades”, I don’t know. I guess my strongest position would be “there’s some other type of thing which still would be really hard and might not work, but which has a better shot”, which we could debate about, though that would also be frustrating because my position is basically just a guess about methodology about theory, so doubly/triply hard to find cruxes about.