This is a sequence investigating the feasibility of one approach to AI alignment: value learning.
1. Ambitious Value Learning
2. Goals vs Utility Functions
Ambitious value learning aims to give the AI the correct utility function to avoid catastrophe. Given its difficulty, we revisit the arguments for utility functions in the first place.