Update: xAI saysthat the load-bearing thing for avoiding bio/chem misuse from Grok 4 is not inability but safeguards, and that Grok 4 robustly refuses “harmful queries.” So I think Igor is correct. If the Grok 4 misuse safeguards are ineffective, that shows that xAI failed at a basic safety thing it tried (and either doesn’t understand that or is lying about it).
I agree it would be a better indication of future-safety-at-xAI if xAI said “misuse mitigations for current models are safety theater.” That’s just not its position.
Update: xAI says that the load-bearing thing for avoiding bio/chem misuse from Grok 4 is not inability but safeguards, and that Grok 4 robustly refuses “harmful queries.” So I think Igor is correct. If the Grok 4 misuse safeguards are ineffective, that shows that xAI failed at a basic safety thing it tried (and either doesn’t understand that or is lying about it).
I agree it would be a better indication of future-safety-at-xAI if xAI said “misuse mitigations for current models are safety theater.” That’s just not its position.