RSS

aksh-n

Karma: 27

An ML engineer and ethicist turned AI alignment researcher.

Train­ing De­liber­a­tive Mon­i­tors for Black-Box Schem­ing Detection

4 Jun 2026 16:43 UTC
32 points
3 comments6 min readLW link

Con­tex­tual Con­sti­tu­tional AI

aksh-n28 Sep 2024 23:24 UTC
16 points
2 comments12 min readLW link