RSS

harrymayne

Karma: 319

Es­ti­mat­ing No-CoT Task-Com­ple­tion Time Hori­zons of Fron­tier AI Models

10 Jun 2026 17:58 UTC
170 points
3 comments4 min readLW link

Ne­ga­tion Ne­glect: When mod­els fail to learn nega­tions in training

18 May 2026 18:37 UTC
119 points
37 comments8 min readLW link

A Pos­i­tive Case for Faith­ful­ness: LLM Self-Ex­pla­na­tions Help Pre­dict Model Behavior

26 Feb 2026 17:03 UTC
26 points
0 comments4 min readLW link

LLMs Don’t Know Their Own De­ci­sion Boundaries. Why Is This Im­por­tant?

17 Sep 2025 16:39 UTC
9 points
0 comments5 min readLW link
(arxiv.org)

Are re­cent LLMs bet­ter at rea­son­ing or bet­ter at mem­o­riz­ing?

7 Mar 2025 2:44 UTC
11 points
0 comments4 min readLW link