Sruthi Kuriakose

Karma: 58

Training Language Models for Controlled Stochasticity

Sruthi Kuriakose and Davide Baldelli

26 May 2026 22:17 UTC

18 points

0 comments5 min readLW link

Systematic runaway-optimiser-like LLM failure modes on Biologically and Economically aligned AI safety benchmarks for LLMs with simplified observation format (BioBlue)

Roland Pihlakas, Sruthi Kuriakose, shrutidattagupta and Three Laws

16 Mar 2025 23:23 UTC

45 points

8 comments16 min readLW link