Archive
Sequences
About
Search
Log In
Questions
Events
Shortform
Alignment Forum
AF Comments
Home
Featured
All
Tags
Recent
Comments
RSS
tobypullan
Karma:
24
All
Posts
Comments
New
Top
Old
Reasoning Long Jump: Why we shouldn’t rely on CoT monitoring for interpretability
tobypullan
26 Jan 2026 10:10 UTC
9
points
2
comments
6
min read
LW
link
Pro or Average Joe? Do models infer our technical ability and can we control this judgement?
tobypullan
12 Jan 2026 20:52 UTC
12
points
0
comments
9
min read
LW
link
Back to top