I want a formalism capable of modelling and imitating how humans handle these situations, and we don’t usually have dynamic consistency (nor do boundedly rational agents).
Now, I don’t want to weaken requirements “just because”, but it may be that dynamic consistency is too strong a requirement to properly model what’s going on. It’s also useful to have AIs model human changes of morality, to figure out what humans count as values, so getting closer to human reasoning would be necessary.
Boundedly rational agents definitely can have dynamic consistency, I guess it depends on just how bounded you want them to be. IIUC what you’re looking for is a model that can formalize “approximately rational but doesn’t necessary satisfy any crisp desideratum”. In this case, I would use something like my quantitative AIT definition of intelligence.
I want a formalism capable of modelling and imitating how humans handle these situations, and we don’t usually have dynamic consistency (nor do boundedly rational agents).
Now, I don’t want to weaken requirements “just because”, but it may be that dynamic consistency is too strong a requirement to properly model what’s going on. It’s also useful to have AIs model human changes of morality, to figure out what humans count as values, so getting closer to human reasoning would be necessary.
Boundedly rational agents definitely can have dynamic consistency, I guess it depends on just how bounded you want them to be. IIUC what you’re looking for is a model that can formalize “approximately rational but doesn’t necessary satisfy any crisp desideratum”. In this case, I would use something like my quantitative AIT definition of intelligence.