Raghav Singhal

Karma: 108

AI PhD @ EPFL | Foundation Models, AI Safety

Raghav Singhal 16 Jun 2026 16:43 UTC
1 point
0
on: Model Spec Midtraining: Improving How Alignment Training Generalizes
This is very cool work, thanks!

I was just reading the paper and realised all training (MSM and AFT) was presumably done with LoRA.

Could you elaborate on this choice and why it was done? Wouldn’t LoRA introduce some artifacts of its own? And did you maybe run some experiments with full fine-tuning as well?
Thanks a lot!

Raghav Singhal 30 May 2026 12:27 UTC
1 point
0
in reply to: TEnglert’s comment on: Synthetic Persona Pretraining: Alignment from Token Zero
We chose the rule-oriented constitutional framework both for clarity and practical reasons. This seemed to be a reasonable and scalable way to generate reflections. It might be possible that better ways exist for this; we plan to improve our synthetic data generation pipeline for future scaling runs.

Raghav Singhal 30 May 2026 12:24 UTC
2 points
0
in reply to: RogerDearnaley’s comment on: Synthetic Persona Pretraining: Alignment from Token Zero
That sounds great! We would love to chat about this. One of our main priorities right now is to reliably measure the effect of our pretraining on model internals.

Synthetic Persona Pretraining: Alignment from Token Zero

Julian Minder, Raghav Singhal, Viktor Moskvoretskii, Stefan Krsteski, ashtonanderson, rolandaydin and Robert West

20 May 2026 14:16 UTC

118 points

27 comments17 min readLW link