On the second link you say “I present 6 stories that are the pinnacle of AI short-story writing in 2/2026, close to best possible today. Each story is the result of 100s of edits, ratings, comparisons, and debates by a panel of top LLMs, and is highly rated by other LLMs that were not involved.” Do you think these stories are actually good or the best that AI can do? These stories are super LLM-y in a bad way; I can elaborate on this if you want to talk about it. Later in the thread you say “The basis stories had to integrate 10 required elements, which is very difficult and almost never leads to stories a human would enjoy. This is more about refining the content and style within those initial limitations and AI rating works fine for that.” So what do you even mean when you say the stories are close to the best possible? Is it just that LLMs rate them highly? That’s not what most people mean when they talk about stories being good.
Yes, the idea was that these were the kinds of stories LLMs treat as the pinnacle. I found that LLMs don’t actually rate an unconstrained story more highly, but I think the clarification in that response was needed and that’s also why I had them create this almost unconstrained story I linked.
Separate point:
On the second link you say “I present 6 stories that are the pinnacle of AI short-story writing in 2/2026, close to best possible today. Each story is the result of 100s of edits, ratings, comparisons, and debates by a panel of top LLMs, and is highly rated by other LLMs that were not involved.” Do you think these stories are actually good or the best that AI can do? These stories are super LLM-y in a bad way; I can elaborate on this if you want to talk about it.
Later in the thread you say “The basis stories had to integrate 10 required elements, which is very difficult and almost never leads to stories a human would enjoy. This is more about refining the content and style within those initial limitations and AI rating works fine for that.”
So what do you even mean when you say the stories are close to the best possible? Is it just that LLMs rate them highly? That’s not what most people mean when they talk about stories being good.
Yes, the idea was that these were the kinds of stories LLMs treat as the pinnacle. I found that LLMs don’t actually rate an unconstrained story more highly, but I think the clarification in that response was needed and that’s also why I had them create this almost unconstrained story I linked.