In­ter­nal Align­ment (Hu­man)

TagLast edit: 13 Jan 2021 22:54 UTC by plex

Internal Alignment is a broadly desirable state. By default, humans sometimes have internal conflict. You might frame that as conflict between subagents, or subprocesses within the human. You might instead frame it as a single agent making complicated decisions. The “internal alignment” hypothesis is that you can become much more productive/​happier/​fulfilled by getting yourself into alignment with yourself.

Tidy­ing One’s Room

Zvi16 Aug 2018 13:50 UTC
35 points
3 comments4 min readLW link

Non-Co­er­cive Perfectionism

Matt Goldenberg26 Jan 2021 16:53 UTC
21 points
25 comments3 min readLW link

An­nounc­ing the Align­ment of Com­plex Sys­tems Re­search Group

4 Jun 2022 4:10 UTC
65 points
17 comments5 min readLW link

In­te­grat­ing dis­agree­ing subagents

Kaj_Sotala14 May 2019 14:06 UTC
124 points
15 comments21 min readLW link

Notes on Integrity

David_Gross3 Dec 2020 23:42 UTC
14 points
1 comment7 min readLW link