RSS

Thomas Kwa

Karma: 4,067

Was on Vivek Hebbar’s team at MIRI, now working with Adrià Garriga-Alonso onvarious empirical alignment projects.

I’m looking for projects in interpretability, activation engineering, and control/​oversight; DM me if you’re interested in working with me.