RSS

Thomas Kwa

Karma: 4,108

Was on Vivek Hebbar’s team at MIRI, now working with Adrià Garriga-Alonso onvarious empirical alignment projects.

I’m looking for projects in interpretability, activation engineering, and control/​oversight; DM me if you’re interested in working with me.

My ex­pe­rience with the “ra­tio­nal­ist un­canny valley”

Thomas Kwa23 Apr 2020 20:27 UTC
66 points
18 comments5 min readLW link