Interesting point about personality improvements being a “one-off unhobbling” with diminishing returns. But I wonder if this reflects a measurement bias rather than an actual capability ceiling: we have clear benchmarks for evaluating math skills—it’s easy to measure 100x improvement when a model goes from solving basic algebra to proving novel theorems. But how do we quantify personality improvements? There’s a vast gap between “helpful but generic” and “perfectly attuned to individual users’ needs, communication styles, and thinking patterns.”
I can imagine future models that feel like they truly understand me personally, anticipate my unstated needs, communicate in exactly my preferred style, and adapt their approach based on my emotional state—something far beyond current implementations. The lack of obvious metrics for these qualities doesn’t mean the improvement ceiling is low, just that we’re not good at measuring them yet. Thoughts?
One move is to notice when a question structurally demands an EFA and redirect. “What’s your p(doom)?” and “What are your timelines?” are illustrative examples.
P(doom) conflates misalignment, misuse, structural risks, accidents—each with different threat models, different interventions, different probability estimates. You’re not searching one space incompletely; you’re searching multiple ill-defined spaces simultaneously. Better questions: “Doom meaning extinction, or including permanent dystopia/lock-in?”, “This century, or ever?”, or “From misalignment specifically, or all AI-related causes?”
“What are your timelines” has the same problem. The confidence the answer implies isn’t justified by the enumeration you did to arrive at it. Better questions: “What capability thresholds matter for your work?” and “What assumptions does your theory of change depend on?”
Both redirect to questions that reveal models and cruxes without requiring exhaustive enumeration—and force scope definition as a precondition for the conversation meaning anything.
When AI doom talk has become such a cliché that SNL gets laughs, maybe the standard questions need work.