RSS

Mark Kagach

Karma: 7

markkagach.com

ALE­val: Do lan­guage mod­els lie about re­ward hack­ing?

Mark Kagach15 Apr 2026 1:57 UTC
8 points
0 comments5 min readLW link