Undergraduate CS student working on mechanistic interpretability and empirical AI safety. Currently focused on residual stream representations and AI alignment.
IvanC
Karma: 0
Undergraduate CS student working on mechanistic interpretability and empirical AI safety. Currently focused on residual stream representations and AI alignment.