That would be true for any AI safety plan that relies on a precise definition of agency. I agree that some safety approaches (like direct cosmopolitan value alignment) would probably need this, but I mostly don’t like those plans and think of agents as an empirical cluster.
That would be true for any AI safety plan that relies on a precise definition of agency. I agree that some safety approaches (like direct cosmopolitan value alignment) would probably need this, but I mostly don’t like those plans and think of agents as an empirical cluster.