Do-What-I-Mean hierarchy

WikiLast edit: 7 Jun 2016 2:38 UTC by Eliezer Yudkowsky

Do-What-I-Mean refers to an aligned AGI’s ability to produce better-aligned plans, based on an explicit model of what the user wants or believes.

Successive levels of DWIM-ness:

Risks from pushing toward higher levels of DWIM might include:

No comments.