Eliezer, it is not clear you even approve of changes due to learning new facts, as you’d distrust an AI persuading you only via telling you new facts. And yes if your approval of the outcome depends on the order in which you heard arguments then you also need a way to distinguish approved from not approved argument orders.
Eliezer, it is not clear you even approve of changes due to learning new facts, as you’d distrust an AI persuading you only via telling you new facts. And yes if your approval of the outcome depends on the order in which you heard arguments then you also need a way to distinguish approved from not approved argument orders.