Mario Giulianelli

Karma: 26

Associate Professor of Computational Linguistics at University College London and Member of the European Laboratory for Learning and Intelligent Systems

Mario Giulianelli 15 Apr 2026 9:25 UTC
1 point
0
on: From personas to intentions: towards a science of motivations for AI models
Really interesting. Do you think beliefs and values can be good primitives? (But still, even assuming we have beliefs and values as primitives, if the state space is very large or even infinite, we can’t enumerate all states and check their values so we need something more compact, structured, and interpretable.)

We recently tried to work on a simplified version of this problem using a combination of behavioural evals and analysis of model internals. The state space is much more manageable in grid worlds, but we’re excited about (1) formalising goals, goal inference, goal generalisation, etc. and (2) scaling up the approach to complex, realistic tasks.

A Behavioural and Representational Evaluation of Goal-directedness in Language Model Agents

Gabriele Sarti, Raghu Arghal, ndalton, Fade Chen, Evgenii Kortukov, Calum McNamara, Angelos Nalmpantis, Moksh Nirvaan and Mario Giulianelli

5 Mar 2026 1:08 UTC

20 points

0 comments7 min readLW link

Modelling, Measuring, and Intervening on Goal-directed Behaviour in AI Systems

Mario Giulianelli, Raghu Arghal, Fade Chen, ndalton, Evgenii Kortukov, Calum McNamara, Angelos Nalmpantis, Moksh Nirvaan and Gabriele Sarti

31 Oct 2025 1:28 UTC

15 points

0 comments8 min readLW link

Mario Giulianelli

A Be­havi­oural and Rep­re­sen­ta­tional Eval­u­a­tion of Goal-di­rect­ed­ness in Lan­guage Model Agents

Model­ling, Mea­sur­ing, and In­ter­ven­ing on Goal-di­rected Be­havi­our in AI Systems

A Behavioural and Representational Evaluation of Goal-directedness in Language Model Agents

Modelling, Measuring, and Intervening on Goal-directed Behaviour in AI Systems