RSS

Ram Potham

Karma: 41

Finding warning shots of Loss of Control in controlled scenarios and Improving Corrigibility in LLMs to mitigate this, connecting work to AI Governance.

Previously built a VC-backed AI Agents startup and graduated with a degree in AI from Carnegie Mellon.

I subscribe to Crocker’s Rules and am especially interested to hear unsolicited constructive criticism. http://​​sl4.org/​​crocker.html—inspired by Daniel Kokotajlo.

(xkcd meme)

(xkcd meme)

AI Con­trol Meth­ods Liter­a­ture Review

Ram PothamApr 18, 2025, 9:15 PM
9 points
1 comment9 min readLW link

Ram Potham’s Shortform

Ram PothamMar 23, 2025, 3:08 PM
1 point
13 commentsLW link