RSS

harrymayne

Karma: 37

A Pos­i­tive Case for Faith­ful­ness: LLM Self-Ex­pla­na­tions Help Pre­dict Model Behavior

26 Feb 2026 17:03 UTC
23 points
0 comments4 min readLW link

LLMs Don’t Know Their Own De­ci­sion Boundaries. Why Is This Im­por­tant?

17 Sep 2025 16:39 UTC
9 points
0 comments5 min readLW link
(arxiv.org)

Are re­cent LLMs bet­ter at rea­son­ing or bet­ter at mem­o­riz­ing?

7 Mar 2025 2:44 UTC
11 points
0 comments4 min readLW link