Background in mathematics (descriptive set theory, Banach spaces) and game-theory (mostly zero-sum, imperfect information games). CFAR mentor. Usually doing alignment research.

For­mal­iz­ing Ob­jec­tions against Sur­ro­gate Goals

Risk Map of AI Systems

