Contemporary AI models are not “aligned” in any sense that would help the slightest bit against a superintelligence. You need stronger guardrails against stronger AI capabilities, and current “alignment” doesn’t even prevent stuff like ChatGPT’s recent sycophancy, or jailbreaking.
Contemporary AI models are not “aligned” in any sense that would help the slightest bit against a superintelligence. You need stronger guardrails against stronger AI capabilities, and current “alignment” doesn’t even prevent stuff like ChatGPT’s recent sycophancy, or jailbreaking.