RSS

Evgenii Kortukov

Karma: 0

Aspiring AI safety researcher. Currently doing my PhD at Fraunhofer HHI in Berlin, focusing on LLM interpretability. Interested in the internal structure underlying safety-relevant behaviors in LLMs: prompt injections, jailbreaks, deception.

Model­ling, Mea­sur­ing, and In­ter­ven­ing on Goal-di­rected Be­havi­our in AI Systems

31 Oct 2025 1:28 UTC
8 points
0 comments8 min readLW link