RSS

RohanS

Karma: 670

Hi, I’m Rohan! I aim to promote welfare and reduce suffering as much as possible for all sentient beings, which has led me to work on AGI safety research. I am particularly interested in foundation model agents (FMAs): systems like Claude Code that equip foundation models with memory, tool use, and other affordances so they can perform multi-step tasks autonomously.

I am an AI labor grantmaker at Coefficient Giving. Previously, I founded Aether, an independent research lab focused on foundation model agent safety. I am also on leave from my PhD at the University of Toronto, where I am supervised by Professor Zhijing Jin. Before that, I completed an undergrad in CS and Math at Columbia, where I helped run Columbia Effective Altruism and Columbia AI Alignment Club (CAIAC). I have done research internships with AI Safety Hub Labs (now LASR Labs), UC Berkeley’s Center for Human-Compatible AI (CHAI), and the ML Alignment & Theory Scholars (MATS) program.

I love playing tennis, listening to rock and indie pop music, playing social deduction games, reading fantasy books, watching a fairly varied set of TV shows and movies, and playing the saxophone, among other things.

What’s Con­tinual Learn­ing, and Why Might We Ex­pect To See It In Ad­vanced LLM Agents?

12 Jun 2026 18:43 UTC
24 points
2 comments17 min readLW link

Im­pli­ca­tions of Con­tinual Learn­ing for LLM Agents: Introduction

12 Jun 2026 18:36 UTC
39 points
0 comments6 min readLW link

Should We Train Against (CoT) Mon­i­tors?

RohanS23 Apr 2026 19:19 UTC
50 points
7 comments33 min readLW link

[Paper] How does in­for­ma­tion ac­cess af­fect LLM mon­i­tors’ abil­ity to de­tect sab­o­tage?

11 Feb 2026 21:25 UTC
26 points
0 comments6 min readLW link