Donald Hobson comments on Contra Yudkowsky on AI Doom

Donald Hobson 24 Apr 2024 0:39 UTC
0 points
0
mirroring much or our seemingly idiosyncratic cognitive biases, quirks, and limitations.
True.
They also have a big pile of their own new idiosyncratic quirks.
https://www.lesswrong.com/posts/aPeJE8bSo6rAFoLqg/solidgoldmagikarp-plus-prompt-generation
These are bizarre behaviour patterns that don’t resemble any humans.
This looks less like a human, and more like a very realistic painted statue. It looks like a human, complete with painted on warts, but scratch the paint, and the inhuman nature shows through.
The width of mindspace is completely irrelevant.
The width of mindspace is somewhat relevant.
At best, we have found a recipe, such that if we stick precisely to it, we can produce human-like minds. Start making arbitrary edits to the code, and we wander away from humanity.
At best we have found a small safe island in a vast and stormy ocean.

The likes of chatGPT are trained with RLHF. Humans don’t usually say “as a large language model, I am unable to …” so we are already wandering somewhat from the human.