Was on Vivek Hebbar’s team at MIRI, now working with Adrià Garriga-Alonso onvarious empirical alignment projects.
I’m looking for projects in interpretability, activation engineering, and control/oversight; DM me if you’re interested in working with me.
What do you think about the possibility that there is an undetected outbreak of ~1000 cases in Indonesia? There’s indirect evidence based on people traveling from Indonesia to countries with better testing (e.g. Singapore). This post and the Metaculus post I linked both seem convincing, but it’s unlikely that they’re both true.