I am broadly interested in theoretical computer science and neuroscience.
Recently I’ve been thinking more about gradual disempowerment risks due to AI and potential mitigation strategies.
Projects that I’m working on
Improving the discourse on the trajectory of AGI and its potential implications—Superposition
Some proposals on improving empowerment and accelerating AI policy making and governance
After the recent DoW vs Ant fiasco, I’m really skeptical if this is actually safety evals. One man’s safety evals is another man’s capabilities eval. And this seems like we are just don’t free labour for DoW