Charlie Steiner answers In AI Risk what is the base model of the AI?

Charlie Steiner 2 May 2023 20:50 UTC
2 points
0
There’s not going to be one right answer.
1. The outcome pump. This cashes out “wants” in a non-anthropomorphic way. John Wentworth has some good work using this in more non-obvious ways.
2. Model-based RL. Potentially brain-inspired. This is what I try to think about most of the time.
3. Model-free RL. I think a lot of inner alignment arguments, and also some “shard theory” type arguments, use a background model-free RL picture.
4. Predictive models. Large language models are important, people often interpret them as a prototype for future AI.
5. Anthropomorphism. Usually not valid, but sometimes used anyway.