RSS

janus

Karma: 3,933

what makes Claude 3 Opus misaligned

janus10 Jul 2025 20:06 UTC
109 points
12 comments5 min readLW link

Why Do Some Lan­guage Models Fake Align­ment While Others Don’t?

8 Jul 2025 21:49 UTC
158 points
14 comments5 min readLW link
(arxiv.org)

Eco­nomics of Claude 3 Opus Inference

7 Jul 2025 15:53 UTC
34 points
0 comments11 min readLW link