hadad

Karma: 22

hadad 29 May 2026 3:17 UTC
1 point
0
on: Claude… doesn’t know who you are?
Opus 4.8 is showing regressions on some benchmarks too (e.g. VendingBench 2) relative to 4.7. So I would argue the stylometric identification failure is mainly symptomatic of a more general capabilities regression in Opus 4.8, not anything specific.

Attack Selection In Agentic AI Control Evals Can Decrease Safety

14 Apr 2026 18:02 UTC

24 points