Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
Sruthi Kuriakose
Karma:
58
All
Posts
Comments
New
Top
Old
Training Language Models for Controlled Stochasticity
Sruthi Kuriakose
and
Davide Baldelli
26 May 2026 22:17 UTC
18
points
0
comments
5
min read
LW
link
Systematic runaway-optimiser-like LLM failure modes on Biologically and Economically aligned AI safety benchmarks for LLMs with simplified observation format (BioBlue)
Roland Pihlakas
,
Sruthi Kuriakose
and
shrutidattagupta
16 Mar 2025 23:23 UTC
45
points
8
comments
16
min read
LW
link
Back to top