Martin Randall comments on Daniel Kokotajlo’s Shortform

Martin Randall 9 Apr 2026 21:46 UTC
7 points
7
We have not solved the problem of “programmer” deception, I still see AIs deceiving users. We’ve reduced the rate of deception to the point where the AIs have value despite the deception rate, and changed usage patterns to account for the possibility of deception.

We also haven’t completed a method for utility indifference.