Internal Alignment (Human)

TagLast edit: 16 Aug 2020 19:26 UTC by Raemon

Internal Alignment. By default, humans sometimes have internal conflict. You might frame that as conflict between subagents, or subprocesses within the human. You might instead frame it as a single agent making complicated decisions. The “internal alignment” hypothesis is that you can become much more productive/happier/fulfilled by getting yourself into alignment with yourself.

Tidying One’s Room

Zvi16 Aug 2018 13:50 UTC

35 points

3 comments4 min readLW link

(thezvi.wordpress.com)

Integrating disagreeing subagents

Kaj_Sotala14 May 2019 14:06 UTC

141 points

15 comments21 min readLW link

Notes on Integrity

David Gross3 Dec 2020 23:42 UTC

18 points

1 comment7 min readLW link

Non-Coercive Perfectionism

Matt Goldenberg26 Jan 2021 16:53 UTC

24 points

25 comments3 min readLW link

Announcing the Alignment of Complex Systems Research Group

Jan_Kulveit and technicalities

4 Jun 2022 4:10 UTC

91 points

20 comments5 min readLW link

Artificial Moral Advisors: A New Perspective from Moral Psychology

David Gross28 Aug 2022 16:37 UTC

25 points

1 comment1 min readLW link

(dl.acm.org)

The shard theory of human values

Quintin Pope and TurnTrout

4 Sep 2022 4:28 UTC

235 points

66 comments24 min readLW link 2 reviews

Internal communication framework

rosehadshar and Nora_Ammann

15 Nov 2022 12:41 UTC

38 points

14 comments12 min readLW link

My Model Of EA Burnout

LoganStrohl25 Jan 2023 17:52 UTC

237 points

49 comments5 min readLW link

Please don’t throw your mind away

TsviBT15 Feb 2023 21:41 UTC

336 points

44 comments18 min readLW link

Trust develops gradually via making bids and setting boundaries

Richard_Ngo19 May 2023 22:16 UTC

125 points

12 comments4 min readLW link

If you are too stressed, walk away from the front lines

Neil 12 Jun 2023 14:26 UTC

42 points

14 comments5 min readLW link

Ruby 29 Jan 2021 20:30 UTC
2 points
Note: maybe find a starting sentence that defines the concept better.