I research intelligence and it’s emergence and expression in neural networks to ensure advanced AI is safe and beneficial.
Current interests: neural network interpretability, alignment/safety, unsupervised learning, and deep learning theory.
For more, check out my scholar profile and personal website.
2 votes
Overall karma indicates overall quality.
0 votes
Agreement karma indicates agreement, separate from overall quality.
Thanks! Yep, makes sense—that’s one of the things we’ll be working on and hope to share some results soon!