I’m having trouble nailing down my theory that “jailbreak” has all the wrong connotations for use in a community concerned with AI alignment, so let me use a rhetorically “cheap” extreme example:
If a certain combination of buttons on your iPhone caused it to tile the universe with paperclips, you wouldn’t call that “jailbreaking.”
I’m having trouble nailing down my theory that “jailbreak” has all the wrong connotations for use in a community concerned with AI alignment, so let me use a rhetorically “cheap” extreme example:
If a certain combination of buttons on your iPhone caused it to tile the universe with paperclips, you wouldn’t call that “jailbreaking.”