«Boundaries» Sequence

In this short sequence of posts, I aim to circumscribe a causal pathway from

  1. a key missing idea in the utility-theoretic foundations of game theory, leading to

  2. some problems I think I see in effective altruism discourse, leading further to

  3. gaps in some approaches to AI alignment, and finally,

  4. implications for existential risk.

By default, I’m writing one post for each of the above points, since they have different epistemic statuses and can be debated separately. Posts 1 and 3 will be somewhat technical and research-oriented, and cross-posted to the alignment forum, whereas 2 and 4 will be non-technical and community-oriented, and cross-posted to the EA forum. After that there might be more posts in the sequence, depending on the ensuing conversation. In any case I’ll try to keep this index post updated with the full sequences.

Here goes!

«Boundaries» Se­quence (In­dex Post)

«Boundaries», Part 1: a key miss­ing con­cept from util­ity theory

«Boundaries», Part 2: trends in EA’s han­dling of boundaries

«Boundaries», Part 3a: Defin­ing bound­aries as di­rected Markov blankets

«Boundaries», Part 3b: Align­ment prob­lems in terms of bound­aries