Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
shrutidattagupta
Karma:
9
All
Posts
Comments
New
Top
Old
Systematic runaway-optimiser-like LLM failure modes on Biologically and Economically aligned AI safety benchmarks for LLMs with simplified observation format (BioBlue)
Roland Pihlakas
,
Sruthi Kuriakose
and
shrutidattagupta
16 Mar 2025 23:23 UTC
45
points
8
comments
13
min read
LW
link
Back to top