RobertM comments on RobertM’s Shortform

RobertM 25 Jan 2023 8:20 UTC
2 points
0
We have models that demonstrate superhuman performance in some domains without then taking over the world to optimize anything further. “When and why does this stop being safe” might be an interesting frame if you find yourself stuck.