Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
ryan_greenblatt comments on
Interpretability is the best path to alignment
ryan_greenblatt
6 Sep 2025 2:26 UTC
3
points
0
See also
To be legible, evidence of misalignment probably has to be behavioral
.
Back to top
See also To be legible, evidence of misalignment probably has to be behavioral.