johnswentworth comments on Study Guide

johnswentworth 11 Nov 2021 16:35 UTC
4 points
I don’t have recommendations for courses or principles to select them beyond what’s in the post. (Otherwise I would have put them in the post.)
I don’t have any sense of what the math of agency and alignment is like, and I hope to get a feel for it sometime in the next year, but I can’t right now — by the way, any recommendations on how to do that?
I don’t think you’re going to find anybody with existing good answers. The embedded agency sequence is the best articulation of the problems which I currently know of. (Even there I disagree with the degree of emphasis placed on various subproblems/frames, but it is nonetheless very good.)
If you want a useful tarting point to think about these things yourself: ask how to calculate the world-model and preferences of an e-coli directly from a low-level specification of the cell (i.e. all the reaction dynamics and concentrations and forces and whatnot).