Martin Vlach comments on Anthropic & Dario’s dream

Martin Vlach 11 Nov 2025 23:05 UTC
1 point
0
It’s simply not enough to develop AI gradually, perform evaluations and do interpretability work to build safe superintelligence.
but to develop AI gradually, perform evaluations and do interpretability to indicate whenever to stop developing( capabilities) seem sensibly safe.