These are bizarre behaviour patterns that don’t resemble any humans.
This looks less like a human, and more like a very realistic painted statue. It looks like a human, complete with painted on warts, but scratch the paint, and the inhuman nature shows through.
The width of mindspace is completely irrelevant.
The width of mindspace is somewhat relevant.
At best, we have found a recipe, such that if we stick precisely to it, we can produce human-like minds. Start making arbitrary edits to the code, and we wander away from humanity.
At best we have found a small safe island in a vast and stormy ocean.
The likes of chatGPT are trained with RLHF. Humans don’t usually say “as a large language model, I am unable to …” so we are already wandering somewhat from the human.
True.
They also have a big pile of their own new idiosyncratic quirks.
https://www.lesswrong.com/posts/aPeJE8bSo6rAFoLqg/solidgoldmagikarp-plus-prompt-generation
These are bizarre behaviour patterns that don’t resemble any humans.
This looks less like a human, and more like a very realistic painted statue. It looks like a human, complete with painted on warts, but scratch the paint, and the inhuman nature shows through.
The width of mindspace is somewhat relevant.
At best, we have found a recipe, such that if we stick precisely to it, we can produce human-like minds. Start making arbitrary edits to the code, and we wander away from humanity.
At best we have found a small safe island in a vast and stormy ocean.
The likes of chatGPT are trained with RLHF. Humans don’t usually say “as a large language model, I am unable to …” so we are already wandering somewhat from the human.