RSS

Moksh Nirvaan

Karma: 14

Model­ling, Mea­sur­ing, and In­ter­ven­ing on Goal-di­rected Be­havi­our in AI Systems

31 Oct 2025 1:28 UTC
8 points
0 comments8 min readLW link

Prob­ing Power-Seek­ing in LLMs

Moksh Nirvaan13 Aug 2025 16:04 UTC
7 points
0 comments12 min readLW link

Will AGI Emerge Through Self-Gen­er­ated Re­ward Loops?

Moksh Nirvaan30 Jul 2025 13:17 UTC
5 points
0 comments1 min readLW link