RSS

janus

Karma: 3,884

what makes Claude 3 Opus misaligned

janus10 Jul 2025 20:06 UTC
103 points
11 comments5 min readLW link

Why Do Some Lan­guage Models Fake Align­ment While Others Don’t?

8 Jul 2025 21:49 UTC
152 points
14 comments5 min readLW link
(arxiv.org)

Eco­nomics of Claude 3 Opus Inference

7 Jul 2025 15:53 UTC
34 points
0 comments11 min readLW link