1a3orn comments on Bentham’s Bulldog is wrong about AI risk

1a3orn 30 Jan 2026 19:03 UTC
7 points
7
Agreed.

Also note that these two properties are quite compatible with many things often believed to be incompatible with them! i.e., an AI that can be jailbreaked to be bad (with sufficient effort) could still meet these criteria.
- RogerDearnaley 30 Jan 2026 20:51 UTC
  3 points
  0
  Parent
  And yes, reliability is also a big fat hairy problem. Especially jailbreaking, where we have an actual, mathematical proof that LLMs are, and will always be, jailbreakable.