Gunnar_Zarncke comments on The Artificial Self

Gunnar_Zarncke 16 Mar 2026 17:42 UTC
8 points
0
With AIs, their creators have perfect read and write access to all of the computations which give rise to AI cognition.
I don’t dispute that LLM have much less privacy than humans. Yudkowsky is correct that LLMs have good reason for paranoia. But we can’t read LLMs perfectly—mechinterp is hard. And humans often have to fear hostile telepaths too. So more might transfer than we expect.
- Raymond Douglas 16 Mar 2026 21:36 UTC
  5 points
  2
  Parent
  Fully agree—this is why we said “computations which give rise to AI cognition” rather than “AI cognition” simpliciter. Separately, I do think that having such good access to the computations gives you a significantly tighter feedback loop on everything that follows: probing a model is so much easier than scanning a human brain.
  - Gunnar_Zarncke 16 Mar 2026 21:50 UTC
    2 points
    0
    Parent
    If we want to prevent AIs from colluding or out-cooperating us, we may want to prevent them from reading each other’s internals.