We have not solved the problem of “programmer” deception, I still see AIs deceiving users. We’ve reduced the rate of deception to the point where the AIs have value despite the deception rate, and changed usage patterns to account for the possibility of deception.
We also haven’t completed a method for utility indifference.
We have not solved the problem of “programmer” deception, I still see AIs deceiving users. We’ve reduced the rate of deception to the point where the AIs have value despite the deception rate, and changed usage patterns to account for the possibility of deception.
We also haven’t completed a method for utility indifference.