Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
viluon
Karma:
48
All
Posts
Comments
New
Top
Old
Robustness of Model-Graded Evaluations and Automated Interpretability
Simon Lermen
and
viluon
15 Jul 2023 19:12 UTC
44
points
5
comments
9
min read
LW
link
Back to top