Algon comments on All AGI Safety questions welcome (especially basic ones) [~monthly thread]

Algon 5 Nov 2022 20:58 UTC
1 point
0
Sure, that’s quite plausible. Though I should have been clear and said I wanted some examples of grokking in Deep RL. Mostly because I was thinking of running some experiments trying to prevent grokking ala Omnigrok and wanted to see what the best examples of grokking were.
- jacob_cannell 5 Nov 2022 22:01 UTC
  2 points
  0
  Parent
  Curios—why would you want to prevent grokking? Normally one would want to encourage it.
  - Algon 6 Nov 2022 0:07 UTC
    1 point
    0
    Parent
    To see if Omnigrok’s mechanism for enabling/stopping grokking works beyond the three areas they investigated. If it works, then we are more sure we know how to stop it occuring, and instead force the model to reach the same performance incrementally. Which might make it easier to predict future performance, but also just to get some more info about the phenomenon. Plus, like, I’m implementing some deep RL algorithms anyway, so might as well, right?