Noosphere89 comments on “Behaviorist” RL reward functions lead to scheming