Former MATS scholar working on scalable oversight and adversarial robustness.
Current theme: default
Less Wrong (text)
Less Wrong (link)