Uzay Macar comments on Open Thread—Summer 2025

Uzay Macar 27 Jun 2025 3:28 UTC
7 points
0
Hi, I joined a few days ago and I’m looking forward to contributing to this great community.
I’m transitioning back to research from startups. Currently based in London.
I’m particularly interested in mechanistic interpretability, chain-of-thought monitoring, and reasoning model interpretability. I’m excited to engage with the thoughtful discussions here on alignment and to collaborate with others.
- Martin Vlach 29 Jun 2025 22:06 UTC
  1 point
  0
  Parent
  What’s your view on sceptic claims about RL on transformer LMs like https://arxiv.org/abs/2504.13837v2 or one that CoT instruction yields better results than <thinking> training?