The standard LW argument is that the AI produces infinite paperclips because the human can’t successfully program the AI to do what he means rather than exactly what he programs into it.
Is that different from what I was saying? My memory of the sequences, and from standard AI literature is that of paperclip maximizers as ‘simple’ utility maximizers with hard-coded utility functions. It’s relatively straight-forward to write an AI with a self-modifiable goal system. It is also very easy to write a system where its goals are unchanging. The problem of FAI which EY spends significant time explaining in the sequences is that we have no simple goal that we can program into a steadfast goal-driven system, and result in a moral creature. Nor does it even seem possible to write down such a goal, short of encoding a random sampling of human brains in complete detail.
Is that different from what I was saying? My memory of the sequences, and from standard AI literature is that of paperclip maximizers as ‘simple’ utility maximizers with hard-coded utility functions. It’s relatively straight-forward to write an AI with a self-modifiable goal system. It is also very easy to write a system where its goals are unchanging. The problem of FAI which EY spends significant time explaining in the sequences is that we have no simple goal that we can program into a steadfast goal-driven system, and result in a moral creature. Nor does it even seem possible to write down such a goal, short of encoding a random sampling of human brains in complete detail.