RSS

joshc

Karma: 1,744

If any­one builds it, ev­ery­one will plau­si­bly be fine

joshc18 Sep 2025 20:03 UTC
29 points
24 comments7 min readLW link

Re­cent Red­wood Re­search pro­ject proposals

14 Jul 2025 22:27 UTC
91 points
0 comments3 min readLW link

Align­ment fak­ing CTFs: Ap­ply to my MATS stream

joshc4 Apr 2025 16:29 UTC
61 points
0 comments4 min readLW link

Train­ing AI to do al­ign­ment re­search we don’t already know how to do

joshc24 Feb 2025 19:19 UTC
45 points
24 comments7 min readLW link