aksh-n

Karma: 27

An ML engineer and ethicist turned AI alignment researcher.

Training Deliberative Monitors for Black-Box Scheming Detection

aksh-n, adityasinha, Victor Gillioz, Simon Storf, Kilian Merkelbach, richbc, Axel Højmark and Marius Hobbhahn

4 Jun 2026 16:43 UTC

33 points

5 comments6 min readLW link

Contextual Constitutional AI

aksh-n28 Sep 2024 23:24 UTC

16 points

2 comments12 min readLW link