Alternate Alignment Ideas

15 May 2019 17:22 UTC

These are ‘brainstorming’ posts, around the theme of what it means for a system to be helpful to a human.

Stable Pointers to Value: An Agent Embedded in Its Own Utility Function

abramdemski17 Aug 2017 0:22 UTC

15 points

9 comments5 min readLW link

Stable Pointers to Value II: Environmental Goals

abramdemski9 Feb 2018 6:03 UTC

19 points

2 comments4 min readLW link

Stable Pointers to Value III: Recursive Quantilization

abramdemski21 Jul 2018 8:06 UTC

20 points

4 comments4 min readLW link

Policy Alignment

abramdemski30 Jun 2018 0:24 UTC

50 points

25 comments8 min readLW link

Non-Consequentialist Cooperation?

abramdemski11 Jan 2019 9:15 UTC

49 points

15 comments7 min readLW link