It is confusing that you are using Claude to analyze its own outputs and those of its peers. I would have preferred a close textual analysis, quoting passages from Claude and offering your comments about what is going on intellectually or computationally. What exactly is the “filler, hedging, and soft-pedaling” that you accuse the LLMs of producing?
You’re right that a higher-effort post could have been better in the specific way you suggest. That said, the linked chat with Claude is mostly me doing exactly that.
It is confusing that you are using Claude to analyze its own outputs and those of its peers. I would have preferred a close textual analysis, quoting passages from Claude and offering your comments about what is going on intellectually or computationally. What exactly is the “filler, hedging, and soft-pedaling” that you accuse the LLMs of producing?
You’re right that a higher-effort post could have been better in the specific way you suggest. That said, the linked chat with Claude is mostly me doing exactly that.
I linked to the Claude chat in the post. Here’s the Grok chat, in which my behavior is also mostly commenting on what I think is going on: https://grok.com/share/bGVnYWN5_3b87f5f4-ea45-483d-ac9d-12a8262bbed8
I had some difficulty figuring out how to share the ChatGPT transcripts in a usable form, but eventually I asked Claude to put them into readable form, and got this somewhat (explicitly) abridged document that I spot-checked and looks okay: https://docs.google.com/document/d/17HPLCxHij74CgFf2AGp2LydtlAzeyUsy/edit?usp=sharing&ouid=101317127625593501338&rtpof=true&sd=true
Thanks for the Grok link, I was awfully curious about that chat after the way you characterized it in your chat with Claude!