Charlie Steiner comments on 2019 AI Alignment Literature Review and Charity Comparison

Charlie Steiner 21 Dec 2019 11:30 UTC
LW: 9 AF: 4
0
AF
An excellent exterior scoop.
If I had to point out one more research avenue from the past year that I find interesting, it would be the application of the predictive processing model of cognition to AI safety. One post from Jan Kulveit (FHI), one post from G Gordon Worley (PAISRI, which appears to be a one man organization at the moment).
I’m also only like 85% sure that I’m not among those referred to as “just learn human values with an RNN.” So on that 15% chance, I would like to stress that although it’s definitely something I’m thinking about, I’m just trying to nail down the details so that it’s specific enough to poke holes in. Honest!
- John_Maxwell 1 Jan 2020 5:18 UTC
  LW: 4 AF: 2
  0
  AF Parent
  There’s also this post and others by the same author.