RSS

Jan Betley

Karma: 977

Back­door aware­ness and mis­al­igned per­sonas in rea­son­ing models

20 Jun 2025 23:38 UTC
30 points
8 comments6 min readLW link

OpenAI Re­sponses API changes mod­els’ behavior

11 Apr 2025 13:27 UTC
53 points
6 comments2 min readLW link

[Question] Are there any (semi-)de­tailed fu­ture sce­nar­ios where we win?

Jan Betley7 Apr 2025 19:13 UTC
15 points
3 comments1 min readLW link

Jan Betley’s Shortform

Jan Betley31 Mar 2025 14:02 UTC
5 points
30 commentsLW link