My wife, who uses LLMs pretty much all day, says that Claude Opus 4.6 feels more ‘mature’ than 4.5.
Anthropomorphizing models is dangerous, but it’s always a bit of a delight when I notice us talking about software using human personality traits. It clearly points at coherent concepts, pieces of software now have distinct ‘personalities’ that compare naturally to those of humans.
Anyway, I use LLMs a fair bit too, and tend to agree with my wife’s assessment. Anecdotally, it is more cautious, more likely to catch itself going on a tangent, and tends towards more of a neutral stance than 4.5.
My wife, who uses LLMs pretty much all day, says that Claude Opus 4.6 feels more ‘mature’ than 4.5.
Anthropomorphizing models is dangerous, but it’s always a bit of a delight when I notice us talking about software using human personality traits. It clearly points at coherent concepts, pieces of software now have distinct ‘personalities’ that compare naturally to those of humans.
Anyway, I use LLMs a fair bit too, and tend to agree with my wife’s assessment. Anecdotally, it is more cautious, more likely to catch itself going on a tangent, and tends towards more of a neutral stance than 4.5.
Does this match your experience?