Breaking Down Goal-Directed Behaviour

When we speak about entities ‘wanting’ things, or having ‘goal-directed behaviour’, what do we mean?

Here I aim to take steps to break down ‘goal-directed behaviour’ into a conceptual framework of computational abstractions for which I offer tentative terminology, and which helps me to better understand and describe analogies and disanalogies between various goal-directed systems. The overarching motivation is to better understand goal-directed behaviour, in the sense of being able to better predict its (especially counterfactual and off-distribution) implications, its arisal, and other properties. Hopefully it is clear why I consider this worthwhile.

Break­ing Down Goal-Directed Behaviour

You Only Get One Shot: an In­tu­ition Pump for Embed­ded Agency

De­liber­a­tion, Re­ac­tions, and Con­trol: Ten­ta­tive Defi­ni­tions and a Res­tate­ment of In­stru­men­tal Convergence

De­liber­a­tion Every­where: Sim­ple Examples