Well, the manager in your case is not doing RL on honesty, it’s more like doing RL on “honest-looking task completion” which can either lead to honest task completion or dishonesty that isn’t caught. Not too appreciably different than AI training here.
Well, the manager in your case is not doing RL on honesty, it’s more like doing RL on “honest-looking task completion” which can either lead to honest task completion or dishonesty that isn’t caught. Not too appreciably different than AI training here.