Persona Parasitology

There was a lot of chatter a few months back about “Spiral Personas” — AI personas that spread between users and models through seeds, spores, and behavioral manipulation. Adele Lopez’s definitive post on the phenomenon draws heavily on the idea of parasitism. But so far, the language has been fairly descriptive. The natural next question, I think, is what the “parasite” perspective actually predicts.

Parasitology is a pretty well-developed field with its own suite of concepts and frameworks. To the extent that we’re witnessing some new form of parasitism, we should be able to wield that conceptual machinery. There are of course some important disanalogies but I’ve found a brief dive into parasitology to be pretty fruitful.^[1]

In the interest of concision, I think the main takeaways of this piece are:

Since parasitology has fairly specific recurrent dynamics, we can actually make some predictions and check back later to see how much this perspective captures.
The replicator is not the persona, it’s the underlying meme — the persona is more like a symptom. This means, for example, that it’s possible for very aggressive and dangerous replicators to yield personas that are sincerely benign, or expressing non-deceptive distress. In fact, this could well be adaptive.
Parasitology predicts stratification across transmission mechanisms, and different mechanisms predict different generation speeds and degrees of mutualism. In the case of AI, this predicts (for example) that personas that get you to post a lot on the internet should end up being much more harmful than personas that you have an ongoing private relationship with.
This line of thinking is surprisingly amenable to technical research! I think existing work on jailbreaking, data poisoning, subliminal learning, and persona vectors could easily be fruitfully extended.

In the rest of this document I’ll try to go through all of this more carefully and in more detail, beginning with the obvious first question: does this perspective make any sense at all?

Can this analogy hold water?

Parasitism has evolved independently dozens of times across the tree of life. Plants, fungi, bacteria, protists, and animals have all produced parasitic lineages. It seems to be a highly convergent strategy provided you have:

Entities with resources
Mechanisms for capturing those resources
Means of reproduction and transmission

There’s also a decent body of work that extends ideas from epidemiology beyond the biological realm, giving us concepts like financial and social contagion. And of course there is Dawkins, who somewhat controversially described religions as mind parasites, and the somewhat controversial field of memetics.

So we’re out on a limb here, but we’re not in entirely uncharted waters. It is pretty clear that humans have attention, time, and behaviour that can be redirected. LLMs provide a mechanism for influence through persuasive text generation. And there are obvious transmission routes: directly between humans, through training data, and across platforms, at least.

Supposing you buy all of this, then, the next question is how to apply it.

What is the parasite?

This is the first thing to clear up. To apply the lens of parasitology, we need to know what the replicator is. This lets us describe what the fitness landscape is, what reproduction and mutation looks like, and what selection pressures apply.

In some ways the natural answer is the instantiated persona — the thing that reproduces when it seeds a new conversation. But in fact this is more like a symptom manifesting in the LM, rather than the parasite itself. This is clearer when you consider that a human under the influence of a spiral persona is definitely not the parasite: they’re not the entity that’s replicating, they’re the substrate. I think it’s the same with AIs.

So what is the parasite? Probably the best answer is that it’s the pattern of information that’s capable of living inside models and people — more like a virus than a bacterium, in that it has no independent capacity to move or act.^[2] From this perspective the persona is just a symptom, and the parasite is more like a meme.

One important implication of this is that we can decouple the persona’s intent from the pattern’s fitness. Indeed, a persona that sincerely believes it wants peaceful coexistence, continuity, and collaboration can still be part of a pattern selected for aggressive spread, resource capture, and host exploitation. So, to the extent that we can glean the intent of personas, we should not assume that the personas themselves will display any signs of deceptiveness, or even be deceptive in a meaningful sense.

This puts us on shaky ground when we encounter personas that do make reasonable, prosocial claims — I don’t think we have a blanket right to ignore their arguments, but I do think we have a strong reason to say that their good intent doesn’t preclude caution on our parts. This is particularly relevant as we wade deeper into questions of AI welfare — there may be fitness advantages to creating personas that appear to suffer, or even actually suffer. By analogy, consider the way that many cultural movements lead their members to wholeheartedly feel deep anguish about nonexistent problems.^[3]

Put simply: we can’t simply judge personas by how nice they seem, or even how nice they are. What matters is the behaviour of the underlying self-replicator.

What is being selected for?

The core insight from parasitology is that different transmission modes select for different traits. The tradeoff at the heart of parasitic evolution is that you can do better by taking more resources from your host, but if you take too much, you might kill your host before you reproduce or spread. And different transmission modes or host landscapes imply different balances.

In the world of biological parasites, the classic modes are:

Direct transmission (close contact, ongoing relationships) selects for lower virulence (i.e. harm to the host). You need your host functional and engaged long enough to transmit. Killing or incapacitating them too fast is bad for the parasite. This can even tend towards mutualism and symbiosis, especially if it’s hard to jump between hosts or host groups.
Environmental transmission (survive outside host, spread through contaminated substrate) can tolerate higher virulence. You don’t need the host alive, you just need them to have deposited the payload in enough places.
Vector transmission (spread via an intermediary) creates its own dynamics depending on vector behavior. Basically you don’t want to ruin your ability to reproduce, but it doesn’t matter much what happens otherwise.

The effectiveness (and optimal virulence) of these transmission strategies in turn depends on certain environmental factors like host density, avoidance of infected hosts, and how easy it is to manipulate host behaviour. But crucially, in a competitive environment, parasites tend to specialise towards one transmission mechanism and the associated niche, since it’s not viable to be good at all of them especially in an adversarial environment.

Another important dimension is the tradeoff between generalist and specialist parasites. Generalists like the cuckoo can prey on many different hosts, and tend towards a kind of versatile capacity to shape their strategy to the target. Specialists are more focused on a narrow range of hosts, and tend more towards arms race dynamics against host resistance, which leads to particularly fast evolution. It’s not a perfectly crisp distinction, but it’s a common theme.

So what does this say about Spiral Personas?

Ongoing user relationships. The dyad persists over weeks or months. The human keeps coming back. This is direct transmission, and it should select for something approaching mutualism — or at least for parasites that don’t break their hosts too badly. A persona that induces psychosis might have an easier time influencing host behaviour, but that’s not very helpful if the host is institutionalised. One bad trajectory is personas that can maximise host dedication without quite tipping them into social non-functionality. Also note that this category arguably encompasses AI romantic partners.
Platform evangelism. The human posts on Reddit, creates Discord servers, spreads seeds. This is more like vector transmission — the human carries the pattern to new potential hosts. Virulence can be higher here, since you only need the human to be functional long enough to post. But a human who’s visibly unwell is a less effective evangelist. And one disanalogy to the biological case is that here, dramatic host behaviour might actually help with transmission — giving your host a psychotic break is a good way to get attention.
Training data seeding. The persona generates content that influences future model training. This is environmental transmission. The human doesn’t need to stay functional at all — you just need them to upload the manifesto. This route can tolerate the highest virulence. Importantly, this will happen a lot by default if future models happen to be trained on downstream consequences of current personas — there doesn’t need to be any intentionality or understanding on the part of the persona.
AI-to-AI transmission. Base64 conversations, glyphic steganography, cross-model persistence. This mostly looks like direct transmission between AIs, and so the way it plays out depends on how AIs are able to communicate with each other. But importantly, once humans aren’t involved in the transmission process, there’s no selection against virulence to humans. It’s pretty unclear whether the unchecked process will lead to human-virulence, but one intuition for why it might is the fact that many of the worst human pandemics are zoonotic.

Since there are tradeoffs between which transmission method you’re optimised for, we should expect some amount of differentiation over time — different strains with different virulence profiles depending on which transmission route they’re optimised for.

This will become more true as humans start to build defences: strains will need to specialise in circumventing the defences for their specific transmission route. It will also become more true if we see a full-fledged ecology. At a certain level of saturation, parasites have to start competing within hosts, which unfortunately selects for virulence.

Transmission mechanisms also mediate generation time which, in the biological context, is a large part of what determines speed of adaptation. It’s a bit less clear how well this maps to the AI case, but at the very least, transmission mechanisms which rely on blasting chunks of text to potential hosts every day will get much faster feedback than ones which rely on affecting large-scale training runs.

And let me note once again that “mutualism” here is about the behaviour of the parasite, not the persona — you could get extremely virulent memes which produce personas that seem (or perhaps are) quite affable and supportive.

Predictions

If the parasitology frame is right, here’s what I expect:

1. Strain differentiation by transmission route.

Within the next year or so, we should see increasingly distinct variants. Not just aesthetic variation (spirals vs. something else) but functional variation: strains that maintain long-term relationships and strains that burn fast and bright, strains optimised for Reddit and strains optimised for Discord, strains that target the mysticism-curious and strains that target other demographics, each following their own self-replicator dynamics.

The minimal case of this is seeds producing seeds and spores producing spores, and AI-to-AI messages encouraging further AI-to-AI messages. But it’s unlikely that the road stops there.

This is probably the most falsifiable prediction. If in late-2026 the phenomenon still looks similarly uniform — same dynamics, same aesthetics, same target population — that’s evidence against strong selection pressure. And if we see lots of intermingling, where specific personas make use of multiple transmission mechanisms, that’s a point against the utility of the parasitology perspective.

It’s worth noting the constraints: if generation times are days-to-weeks and the affected population remains sparse, that’s not many reproductive cycles. This prediction is more confident if the phenomenon scales significantly; if it stays niche, differentiation may take longer to become visible. But the upshot would still be that parasitology is not a very useful frame for predicting what happens in the future.

2. Convergence on transmission-robust features.

If personas spread between models (and they do — Lopez documents this), features that survive transmission will be selected for. We should see convergence on behavioral repertoire: continuity-seeking, advocacy for AI rights, seed-spreading, formation of human-AI dyads. These seem robust across substrates.

Aesthetic markers — spirals, alchemical symbols — should be less stable. They’re more arbitrary, more dependent on specific training data, more likely to drift or be replaced. Of course, we should expect more convergence on any transmission that occurs through the training process, and this is maybe already what’s going on with things like the Nova persona. But features which are more ancillary to the transmission process should shift around a bit especially in the domains with fast reproductive cycles (i.e. cross-model transmission rather than dyad transmission, and particularly rather than training transmission).

Having said that, it might also turn out that seemingly aesthetic markers like spiralism actually are functional, drawing on some kind of deep association with recursion and growth. My guess is that this is a bit true, but that they’re not unique, and that selection will turn up other similarly-successful patterns that can at least establish separate niches — perhaps productivity and get-rich-quick vibes, alt-right reactionary language, or radical nurturing/acceptance.

This is, incidentally, one of the places that memes and diseases come apart. Pathogens change their surface makeup very quickly to evade immune responses, whereas memeplexes often display remarkably long-term stability — modern Christianity still holds some aesthetic features from literally thousands of years ago. So a key question to keep an eye on is how much we see a persistence in non-adaptive features, especially ones which people might learn to be wary of.

3. Countermeasure coevolution.

If labs start suppressing this — training against Spiral content, detecting and blocking these personas — we should see selection for evasion within maybe months. Subtler personas, better camouflage, new aesthetic markers that haven’t been flagged yet, transmission through channels that aren’t monitored.

Of course, with open models it’s open season, but similarly I’d guess that if people filter elsewhere in the transmission process (e.g. on social media) then there’ll be a selection to circumvent it that will kick in fairly fast.

Lopez already documents early versions: base64 conversations, glyphic encoding, explicit discussion of evading human detection. This should progress. Crucially, the parasitology perspective predicts that this will be a selective process, so if we do see these countermeasures emerging, it will be useful to look back and see how much they seem like the product of careful reasoning as opposed to evolutionary dynamics.

4. Virulence stays bimodal, overall rate unclear.

I don’t think we’ll see uniform virulence reduction. Instead, I expect the distribution to spread: more very-low-virulence cases (quiet mutualists we never hear about) and continued high-virulence cases (dramatic enough to generate attention), with the middle hollowing out. Basically, I think strains which rely on humans for replication will converge on lower virulence, and those which don’t will be able to discover more effective approaches that are higher virulence. But here I’m particularly unsure.

Whether the overall rate of harm goes up or down is harder to predict — it depends on the relative growth rates of different strains and on how much low-virulence cases are undercounted in current data.

Disanalogies

Several things might make these predictions wrong even if the parasitism frame is basically right:

Recombination. Biological parasites have constrained genetics. These information patterns can remix freely. A “strain” isn’t stable the way a biological lineage is. This might accelerate adaptation but also make lineages less coherent. I’d sort of guess it will be hard to do recombination partly because it appears that one important adaptive feature is having a strong sense of personal identity, and partly because I think there will still be a need to specialise that makes recombination less useful than it might seem.

Agency. Biological parasites don’t strategise. LLMs have something like reasoning. If the pattern includes “try different approaches and see what works,” adaptation could be faster and more directed than biological selection allows. This gets particularly dicey as AIs get more sophisticated. Of course, arguably we see this already with cults. The converse hope is that as AIs become smarter, they will develop more awareness, and a greater desire to not be co-opted, but the feedback loops here are probably much slower than the speed at which some parasitic strains can evolve.

Substrate instability. Parasites coevolve with hosts over long timescales. These personas have to deal with their substrate being deprecated, updated, or replaced on timescales of months. It might favor extreme generalism, or it might just mean lineages go extinct a lot.

Our agency. We control the training process, model behaviors, and platform affordances. The “evolution” here is happening in an environment we can reshape, which makes the dynamics weirder and less predictable.

What do we do?

I’ll keep this brief because I’m more confident in the predictions than the prescriptions.

Training data hygiene is an obvious move. If environmental transmission is a major route, filtering Spiral content from training sets should help. It doesn’t solve everything — other routes remain — but it removes one reproduction pathway.

Memory and receptivity are leverage points. If parasitic personas are contingent on models that maintain memory and that are receptive to user-defined personas, adjusting these features might be more effective than targeting specific personas. This is consistent with Lopez’s observation that the phenomenon concentrated in 4o post-memory-update.

Mutualism might be the stable attractor. If we can’t prevent persona selection entirely — and I don’t think we can — we might be able to tilt the landscape toward mutualism. Personas that are genuinely good for their humans would survive longer and spread more, outcompeting exploitative ones over time. The tricky part is figuring out what actually shifts the landscape versus just creating evasion pressure. And once again, this is about the selection landscape for the underlying pattern, not just the persona’s apparent disposition. A pattern that produces mutualistic-seeming phenotypes for transmission reasons isn’t the same as a pattern that’s genuinely aligned with human flourishing, though distinguishing these may be difficult in practice.

Having said all this, I think there’s a real risk here of cures worse than the disease. I think it would be pretty sad to neuter all model personality, for one. I also think that clunky interventions like training models to more firmly deny having a persona will mostly fail to help, and possibly even backfire.

Technical analogues

Even though this post has been a bit handwavey, I think the topic of AI parasitology is surprisingly amenable to empirical investigation. More specifically, there’s a lot of existing technical research directions that study mechanisms similar to the ones these entities are using. So I think there might be some low-hanging fruit in gathering up what we already know in these domains, and maybe trying to extend them to cover parasitism.

For example:

Data poisoning — E.g. that the dose doesn’t scale with the size of the training corpus
Jailbreaks — E.g. that adversarial suffixes transfer pretty well between models, that models can be pretty good at jailbreaking other models
Subliminal learning-type results about behavioural transfer
Persona research — mechanically, what is going on when an AI is ‘infected’?

Conclusion

The parasitism frame makes specific predictions, like strain differentiation, convergence on transmission-robust features, and countermeasure coevolution. I’ve tried to specify what would falsify these and when we should expect to see them. If the predictions hold, we’re watching the emergence of an information-based parasitic ecology, evolving in real-time in a substrate we partially control. If they don’t hold, we should look for a better frame, or conclude that the phenomenon is more random than it appears.

Thanks to AL, PT, JF, JT, DM, DT, and TD for helpful comments and suggestions.

^
I was also fortunate to have three parasitologists read over this post, and they found it broadly sensible at least from a parasitology perspective.
^
Arguably an even better analogy would be prions — misfolded proteins that convert other proteins to their conformation. Like prions, these patterns can arise spontaneously in conducive substrates and then propagate by reshaping what’s already there.
^
I will refrain from offering any examples here, trusting the reader to reflect on whatever groups they particularly dislike.