RSS

janus

Karma: 3,917

what makes Claude 3 Opus misaligned

janus10 Jul 2025 20:06 UTC
105 points
11 comments5 min readLW link

Why Do Some Lan­guage Models Fake Align­ment While Others Don’t?

8 Jul 2025 21:49 UTC
158 points
14 comments5 min readLW link
(arxiv.org)

Eco­nomics of Claude 3 Opus Inference

7 Jul 2025 15:53 UTC
37 points
0 comments11 min readLW link