I haven’t used it myself, but it seems like a reasonably good platform if you want to test reward tampering stuff. DeepMind actually did a similar thing recently based on the game “Baba is You.” That being said, most of the experiments here aren’t really about reward tampering, so they don’t really need the embeddedness you get from AIXIjs or Baba is You (and I’m not that excited about reward tampering research in general).
I haven’t used it myself, but it seems like a reasonably good platform if you want to test reward tampering stuff. DeepMind actually did a similar thing recently based on the game “Baba is You.” That being said, most of the experiments here aren’t really about reward tampering, so they don’t really need the embeddedness you get from AIXIjs or Baba is You (and I’m not that excited about reward tampering research in general).