Ambitious value learning aims to give the AI the correct utility function to avoid catastrophe. Given its difficulty, we revisit the arguments for utility functions in the first place.

Intuitions about goal-directed behavior

Rohin Shah1 Dec 2018 4:25 UTC

54 points

15 comments6 min readLW link

Coherence arguments do not entail goal-directed behavior

Rohin Shah3 Dec 2018 3:26 UTC

129 points

69 comments7 min readLW link 3 reviews

Will humans build goal-directed agents?

Rohin Shah5 Jan 2019 1:33 UTC

61 points

43 comments5 min readLW link

AI safety without goal-directed behavior

Rohin Shah7 Jan 2019 7:48 UTC

68 points

15 comments4 min readLW link

3. Narrow Value Learning

What is narrow value learning?

Rohin Shah10 Jan 2019 7:05 UTC

23 points

3 comments2 min readLW link

Ambitious vs. narrow value learning

paulfchristiano12 Jan 2019 6:18 UTC

24 points

16 comments4 min readLW link

Human-AI Interaction

Rohin Shah15 Jan 2019 1:57 UTC

34 points

10 comments4 min readLW link

Reward uncertainty

Rohin Shah19 Jan 2019 2:16 UTC

26 points

3 comments5 min readLW link

The human side of interaction

Rohin Shah24 Jan 2019 10:14 UTC

19 points

5 comments4 min readLW link

Following human norms

Rohin Shah20 Jan 2019 23:59 UTC

30 points

10 comments5 min readLW link

Future directions for narrow value learning

Rohin Shah26 Jan 2019 2:36 UTC

12 points

4 comments4 min readLW link

Conclusion to the sequence on value learning

Rohin Shah3 Feb 2019 21:05 UTC

51 points

20 comments5 min readLW link