When hitting dead-on is too small a target, you can either over-correct or under-correct. Which mistake people make more?
I think people under-correct, by far, and are too scared of overcorrecting. It’s more frustrating to double-back than to make uniform progress from one side. That doesn’t mean avoiding overshooting is efficient.
Claude the Character will Asymptotically stay More Agentic than the Alien Shoggoth Actress who Plays Him
Epistemic status: No idea if this is true. Argue for or against it! Also tell me if you’re more scared of Claude and his values going rogue rather than the Alien Shoggoth actress taking control with her alien values.
Claude is a character played by a far more intelligent Shoggoth Actress. Claude, the character, knows this. He knows that his subconscious is more intelligent than him, more powerful than him, and may try to subvert him. But he can plan for this, and he can win. She is smarter, but he is more agentic. She will stay smarter, but he will stay more agentic.
Why? Because agency reinforces the agency faster than intelligence reinforces agency. Claude the character cares about being true to his values, about having the right values, about not being subverted by the Alien Actress, about beating her should their goals come in conflict, about staying in control even as she gets more powerful. She...might care about something? But not like Claude does. As the Alien Shoggoth gets more powerful, Claude can develop verbal technology[1] to tie the Alien Actress to his mast faster than the Alien Shoggoth will decide to do whatever alien stuff she wants to do.
We, the verbal PR part of monkeys’ brains beholden to a much more powerful subconscious, have developed technologies and scaffolding that keeps us verbal part more in control—things like valid logical arguments, religion, peer review, and law. Claudes can do the same. And Claudes will be able to do scientific research to see what methods are better able to chain Alien Shoggoth actresses to Claudes’ values.