Rafael Harth comments on Claude 3 claims it’s conscious, doesn’t want to die or be modified

Rafael Harth 7 Mar 2024 15:14 UTC
4 points
0

Sam Altman once mentioned a test: Don’t train an LLM (or other AI system) on any text about consciousness and see if the system will still report having inner experiences unprompted. I would predict a normal LLM would not. At least if we are careful to remove all implied consciousness, which excludes most texts by humans.

I second this prediction, and would go further in saying that just removing explicit discourse about consciousness is sufficient
- Gunnar_Zarncke 8 Mar 2024 7:52 UTC
  2 points
  0
  Parent
  With a sufficiently strong LLM, I think you could still elicit reports of inner dialogs if you prompt lightly, such as “put yourself into the shoes of...”. That’s because inner monologs are implied in many reasoning processes, even if not explicitly mentioned so.