RSS

Benjamin Arnav

Karma: 106

https://​​benjaminarnav.com

Ensem­ble mon­i­tor­ing for AI con­trol: di­verse sig­nals out­weigh more compute

31 May 2026 1:21 UTC
12 points
0 comments7 min readLW link

Toward a Bet­ter Eval­u­a­tions Ecosystem

Benjamin Arnav5 May 2026 22:29 UTC
24 points
0 comments5 min readLW link

Un­faith­ful Rea­son­ing Can Fool Chain-of-Thought Monitoring

2 Jun 2025 19:08 UTC
78 points
17 comments3 min readLW link