avturchin comments on Why there is still one instance of Eliezer Yudkowsky?

avturchin 30 Oct 2025 12:55 UTC
5 points
1
Yes, if MIRI spends a year on building as good model of Yudkowsky as possible, it can help in alignment and its measurable and doable thing. They can later ask that model about failure modes of other AIs and it will cry “Misaligned!”