RSS

Sruthi Kuriakose

Karma: 58

Train­ing Lan­guage Models for Con­trol­led Stochasticity

26 May 2026 22:17 UTC
18 points
0 comments5 min readLW link

Sys­tem­atic run­away-op­ti­miser-like LLM failure modes on Biolog­i­cally and Eco­nom­i­cally al­igned AI safety bench­marks for LLMs with sim­plified ob­ser­va­tion for­mat (BioBlue)

16 Mar 2025 23:23 UTC
45 points
8 comments16 min readLW link