I believe they were mainly inspired by Demski and Garrabrant, but we were in contact for the last few months and I’m glad to see that some of my recent work was applicable. We arrived at the idea of using a joint distribution with a grain of truth independently, and they introduce a novel “RUI” construction, but also study (what I’ve been calling) AEDT wrt rOSI in section 5.2. The differences are pretty technical, IMO the RUI approach is halfway between rOSI and logical induction.
It’s so long that even I’m still reading it, and I got a copy early. Assuming you’re familiar with Solomonoff induction / AIXI / embedded agency (which it sounds like you are) the core of it is section 3 and section 5 (particularly 5.1-5.3 I think). The appendix is like 100 pages and so far doesn’t seem essential unless you want to extend the results (also some of it will be familiar if you read my GOT paper).
Author here. We were heavily inspired by multiple things, including Demski and Garrabrant, the 1990′s work of Kalai and Lehrer, empirical work in our group inspired by neuroscience pointing towards systems that predict their own actions, and the earlier work on reflective oracles by Leike . We were not aware of @Cole Wyeth et al.’s excellent 2025 paper which puts the reflective oracle work on firmer theoretical footing, as our work was (largely but not entirely) done before this paper appeared.
I believe they were mainly inspired by Demski and Garrabrant, but we were in contact for the last few months and I’m glad to see that some of my recent work was applicable. We arrived at the idea of using a joint distribution with a grain of truth independently, and they introduce a novel “RUI” construction, but also study (what I’ve been calling) AEDT wrt rOSI in section 5.2. The differences are pretty technical, IMO the RUI approach is halfway between rOSI and logical induction.
It’s so long that even I’m still reading it, and I got a copy early. Assuming you’re familiar with Solomonoff induction / AIXI / embedded agency (which it sounds like you are) the core of it is section 3 and section 5 (particularly 5.1-5.3 I think). The appendix is like 100 pages and so far doesn’t seem essential unless you want to extend the results (also some of it will be familiar if you read my GOT paper).
Author here. We were heavily inspired by multiple things, including Demski and Garrabrant, the 1990′s work of Kalai and Lehrer, empirical work in our group inspired by neuroscience pointing towards systems that predict their own actions, and the earlier work on reflective oracles by Leike . We were not aware of @Cole Wyeth et al.’s excellent 2025 paper which puts the reflective oracle work on firmer theoretical footing, as our work was (largely but not entirely) done before this paper appeared.