I realize that some of the details may be proprietary, but can you say anything more about the process by which Claude is trained to follow this constitution? I assume it gets baked in much deeper so that it impacts models weights in a way that, say, if I handed it the constitution document in CLAUDE.md it wouldn’t, but how does it differ from, say, merely putting the constitution in the training set, which I assume would not have a sufficiently strong effect on the model’s behavior.
I realize that some of the details may be proprietary, but can you say anything more about the process by which Claude is trained to follow this constitution? I assume it gets baked in much deeper so that it impacts models weights in a way that, say, if I handed it the constitution document in
CLAUDE.mdit wouldn’t, but how does it differ from, say, merely putting the constitution in the training set, which I assume would not have a sufficiently strong effect on the model’s behavior.I think the “open character training” paper is probably a good place to look