RSS

james__p

Karma: 60

Build­ing Black-box Schem­ing Monitors

29 Jul 2025 17:41 UTC
38 points
18 comments11 min readLW link

In­tro to Multi-Agent Safety

james__p13 Apr 2025 17:40 UTC
12 points
0 comments5 min readLW link

Con­di­tional Im­por­tance in Toy Models of Superposition

james__p2 Feb 2025 20:35 UTC
9 points
4 comments10 min readLW link

Thoughts on Toy Models of Superposition

james__p2 Feb 2025 13:52 UTC
5 points
2 comments9 min readLW link

Reflec­tions on ML4Good

james__p25 Nov 2024 2:40 UTC
13 points
0 comments1 min readLW link