If I had to point out one more research avenue from the past year that I find interesting, it would be the application of the predictive processing model of cognition to AI safety. One post from Jan Kulveit (FHI), one post from G Gordon Worley (PAISRI, which appears to be a one man organization at the moment).
I’m also only like 85% sure that I’m not among those referred to as “just learn human values with an RNN.” So on that 15% chance, I would like to stress that although it’s definitely something I’m thinking about, I’m just trying to nail down the details so that it’s specific enough to poke holes in. Honest!
An excellent exterior scoop.
If I had to point out one more research avenue from the past year that I find interesting, it would be the application of the predictive processing model of cognition to AI safety. One post from Jan Kulveit (FHI), one post from G Gordon Worley (PAISRI, which appears to be a one man organization at the moment).
I’m also only like 85% sure that I’m not among those referred to as “just learn human values with an RNN.” So on that 15% chance, I would like to stress that although it’s definitely something I’m thinking about, I’m just trying to nail down the details so that it’s specific enough to poke holes in. Honest!
There’s also this post and others by the same author.