Undergraduate CS student working on mechanistic interpretability and empirical AI safety. Currently focused on residual stream representations and AI alignment.
IvanC
Undergraduate CS student working on mechanistic interpretability and empirical AI safety. Currently focused on residual stream representations and AI alignment.