Roman Leventov comments on Five neglected work areas that could reduce AI risk

Roman Leventov 24 Sep 2023 12:08 UTC
3 points
0
There is an argument that evaluating AI models should be formalised, i.e., turned into verification: see https://arxiv.org/abs/2309.01933 (and discussion on Twitter with Yudkowsky and Davidad).