RSS

Nelson Gardner-Challis

Karma: 35

[Paper] When can we trust un­trusted mon­i­tor­ing? A safety case sketch across col­lu­sion strategies

10 Mar 2026 17:28 UTC
44 points
0 comments6 min readLW link

Wi­den­ing AI Safety’s tal­ent pipeline by meet­ing peo­ple where they are

25 Sep 2025 20:50 UTC
33 points
3 comments8 min readLW link