The theory will be a lot more useful once actual leakage ratios are estimated. This paper was mathematically specific, because the purpose of it was to establish a few equations to use when estimating the Friendliness ratio and constraints to AI projects. It was written more to build a mathematical foundation for that than it was a simple intro of the ideas to most readers.
Obviously this was meant as more of a research article than a blog post, but we felt like LessWrong was a good place to publish it given the subject.
It was written more to build a mathematical foundation
That’s a pretty simple mathematical foundation for a toy problem.
How about introducing uncertainty into your framework? You will not be dealing with hard numbers, you will be dealing with probability distributions and that makes things considerably more complex.
This paper was focused on establishing the mentioned models instead of estimating input values. If the models are considered useful, there should be more research to estimate these numbers. The models could also be improved to incorporate uncertainty, the growing returns of research, and other important limitations that we haven’t considered. Finally, the friendliness ratio concept naturally generalizes to other technology induced existential risks.
The theory will be a lot more useful once actual leakage ratios are estimated. This paper was mathematically specific, because the purpose of it was to establish a few equations to use when estimating the Friendliness ratio and constraints to AI projects. It was written more to build a mathematical foundation for that than it was a simple intro of the ideas to most readers.
Obviously this was meant as more of a research article than a blog post, but we felt like LessWrong was a good place to publish it given the subject.
That’s a pretty simple mathematical foundation for a toy problem.
How about introducing uncertainty into your framework? You will not be dealing with hard numbers, you will be dealing with probability distributions and that makes things considerably more complex.
Just pointing out that they have considered this: