RohanS

Karma: 409

I aim to promote welfare and reduce suffering as much as possible. This has led me to work on AGI safety research. I am particularly interested in foundation model agents (FMAs): systems like AutoGPT and Devin that equip foundation models with memory, tool use, and other affordances so they can perform multi-step tasks autonomously.

Previously, I completed an undergrad in CS and Math at Columbia, where I helped run Columbia Effective Altruism and Columbia AI Alignment Club (CAIAC).

Hidden Reasoning in LLMs: A Taxonomy

Rauno Arike, RohanS and Shubhorup Biswas

25 Aug 2025 22:43 UTC

62 points

8 comments12 min readLW link

How we spent our first two weeks as an independent AI safety research group

RohanS, Rauno Arike and Shubhorup Biswas

11 Aug 2025 19:32 UTC

28 points

0 comments10 min readLW link

Extract-and-Evaluate Monitoring Can Significantly Enhance CoT Monitor Performance (Research Note)

Rauno Arike, RohanS and Shubhorup Biswas

8 Aug 2025 10:41 UTC

51 points

7 comments10 min readLW link

Efficiently Detecting Hidden Reasoning with a Small Predictor Model

RohanS, Vishnu Vardhan Sai Lanka, yaumeng and daria

13 Jul 2025 16:04 UTC

33 points

3 comments16 min readLW link

Aether July 2025 Update

RohanS, Rauno Arike and Shubhorup Biswas

1 Jul 2025 21:08 UTC

23 points

7 comments3 min readLW link

RohanS’s Shortform

RohanS31 Dec 2024 16:11 UTC

3 points

28 comments LW link

~80 Interesting Questions about Foundation Model Agent Safety

RohanS and Govind Pimpale

28 Oct 2024 16:37 UTC

48 points

4 comments15 min readLW link

Transformers Explained (Again)

RohanS22 Oct 2024 4:06 UTC

4 points

0 comments18 min readLW link

Apply to Aether—Independent LLM Agent Safety Research Group

RohanS21 Aug 2024 9:47 UTC

12 points

0 comments7 min readLW link

(forum.effectivealtruism.org)

Notes on “How do we become confident in the safety of a machine learning system?”

RohanS26 Oct 2023 3:13 UTC

4 points

0 comments13 min readLW link

Quick Thoughts on Language Models

RohanS18 Jul 2023 20:38 UTC

6 points

0 comments4 min readLW link

~100 Interesting Questions

RohanS30 Mar 2023 13:57 UTC

53 points

18 comments9 min readLW link

A Thorough Introduction to Abstraction

RohanS13 Jan 2023 0:30 UTC

9 points

1 comment18 min readLW link

Content and Takeaways from SERI MATS Training Program with John Wentworth

RohanS24 Dec 2022 4:17 UTC

28 points

3 comments12 min readLW link

Follow along with Columbia EA’s Advanced AI Safety Fellowship!

RohanS2 Jul 2022 17:45 UTC

3 points

0 comments2 min readLW link

(forum.effectivealtruism.org)