RSS

janus

Karma: 3,924

what makes Claude 3 Opus misaligned

janus10 Jul 2025 20:06 UTC
106 points
11 comments5 min readLW link

Why Do Some Lan­guage Models Fake Align­ment While Others Don’t?

8 Jul 2025 21:49 UTC
158 points
14 comments5 min readLW link
(arxiv.org)

Eco­nomics of Claude 3 Opus Inference

7 Jul 2025 15:53 UTC
35 points
0 comments11 min readLW link