Taking Initial Viral Load Seriously

Link post

In part a response to (Overcoming Bias): Variolation May Cut Covid19 Deaths 3-30X

Also related: This is an expansion of parts of this debate with Robin Hanson.

Epistemic Status: Thinking out loud. Food for thought. Not an expert.

Initial viral load seems likely to have a large impact on severity of Covid-19 infection. If we believe this, we should take this seriously, and evaluate both general policy and personal behavior differently in light of this information. We should also do our best to confirm or deny this hypothesis as soon as possible.

Robin Hanson and I had a debate on Sunday regarding his proposal of variolation: Deliberate Covid-19 infection of volunteers, using small viral loads, followed by isolation of those volunteers until recovery. I felt it was an excellent discussion. As with all such discussions, some key points are glossed over, lost or not stated as well as one would like in hindsight, there are areas that never come up, and one thinks of important additional things afterwards.

What evidence do we have that viral load matters?

Three classes of evidence seem strong.

The first is that we have a strong mechanism story we can tell. Viruses take time to multiply. When the immune system detects a virus it responds. If your initial viral load is low your immune system gets a head start, so you do better.

The second category is the terrible outcomes in health care workers on the front lines. Those who are dealing with the crisis first hand are dealing with lots of intense exposures to the virus. When they do catch it, they are experiencing high death rates. High viral load is the only theory I know about so far for why this is the case. Their cases are presumably handled at least as well as others, in terms of detection, testing, treatment and what the infected do themselves. The only other issue I can think of is that they might be reluctant to rest given how urgently their help is needed.

The third category is historical precedents.

Robin’s proposal for variolation is similar to what was historically done with smallpox. Parents infected their children with what they hoped was exactly the minimum dose required to get them sick enough to develop antibodies and gain immunity. Sometimes this went wrong and the child would get sick. Thus this form of inoculation was dangerous and 1%-2% of patients died. But of those who got smallpox infections in other ways, 20%-30% of patients died. Those rates are well established.

Another classic example is measles. We have a study that says that the first child in a family to contract the disease was relatively safe, whereas other children in the same family were 14 times as likely to die. That’s another huge gap. This is from a study of 126 children, so much less certain, but effect sizes that big are not accidents.

Finally for SARS, we have the Hong Kong high rise where proximity to the index patient played a crucial role. Here we see a factor of three difference. SARS being so deadly that 70%+ of highly exposed patients died could be a reason the ratio of deaths between high and low viral load was only three to one.

There are also claims that we can observe higher viral loads from lockdowns making things worse and increase death rates, allowing us to be more confident that viral loads have a big impact. I do not find this convincing because of likely reverse causation and other highly non-random factors that determined where lockdowns happened earlier. Later on I’ll analyze the likely impact of lockdowns on viral load, where I get a different answer than Robin’s.

It’s worth noting that we have vaccines for smallpox and measles, and the SARS virus in question was contained, so none of these three are remotely candidates for variolation no matter the true ratios.

What is the evidence against viral load having a big impact?

The evidence of absence is the absence of further evidence. Yes, the effects observed were very big. Yes, they are all the result of natural experiments and we have been ethically precluded from doing randomized trials or other better studies, so the lack of those trials doesn’t mean much. But how likely is it we didn’t find more natural experiments where viral load would have differed in an observable way? Publication bias here could be a large effect.

These effect sizes are super large. It seems odd for them to hold for some viruses, and not to have a large effect in most others. But if that’s true, how is it we are not observing them? Maybe we really don’t ever look so it’s never come up. But it definitely seems odd.

There’s also the small sizes for the measles and SARS studies, which have about 200 patients between them. That does not seem like enough to draw general conclusions with any confidence.

The smallpox case represents the best case, but it also represents an engineered best case of exactly the minimum dose in exactly the right way, along with awareness of the situation versus a society that otherwise had little idea how to handle infections. It also represents the case in which things went so well that ordinary people managed to fight through all the barriers and actually do the thing. It makes sense to assume this was an unusually large effect size.

We have to ask why smallpox was a unique event, and we never used this method for any other virus. Did we even ever consider it?

My prior at this point is that the difference between a low and high initial viral load of Covid-19 is large. The theory makes too much sense. But with high uncertainty.

Suppose we agree that it is likely. At a minimum, there is a large chance of a large effect, and essentially zero chance of a backfiring effect – at worst, a low dose is the same as a high one.

What should we do? How do we take this seriously?

Four categories of things one might do.

  1. There are things we could do to get better information.

  2. There are things individuals or small groups can do to improve their situation.

  3. There are things society as a whole could try to do that don’t have big downsides.

  4. We could take bold action, potentially including variolation.

Category 1: Better Information

The failure to collect more and better information about Covid-19 has been atrocious, shameful, expensive and deadly.

We shouldn’t only be doing a crash project for a vaccine and ramping up testing much faster and immediately testing every treatment that shows any promise.

We should be collecting extensive population-level data everywhere. Mostly we’re not collecting any data at all that isn’t massively biased.

We should be studying with experiments how Covid-19 spreads, and how likely each method is to work, using controlled experiments. Yes, this involves infecting individuals. Considering how many lives are at stake and the ability to test using young healthy volunteers who are then isolated, I fail to see how anyone who objects on the basis of ‘ethics’ knows what that word means, or why we should listen to them.

We should use those experiments, and additional experiments if necessary, to study effects of viral load. Because again, that knowledge would save so many lives, in addition to tons of economic distress that could force horrible choices upon us.

We can shut down the entire economy, force people to stay home and create double digit unemployment. And we can’t do this? Really?

The more I think about the Covid-19 situation, the more I think the highest leverage thing most people reading this can do is to find ways to get our hands on better data.

Better information would (of course) make it much easier to know whether it made sense to take other action, know which actions made sense, and gain support for the right ones, across the board. As we go over what else we might do, all of that will emphasize how valuable more information would be.

My current best thought for how to do experiments quickly is medical cruise ships in international waters. We even have lots of spare cruise ships lying around with nothing to do right now, which we could convert if we needed to. Medical cruise ships are already an established way to do things without running into regulatory problems. We could do things properly, in a way that would give trustworthy results and would allow others to trust the data. The tests required are not expensive or difficult to produce if no FDA regulations get in one’s way. Such projects are well within the ‘Bill Gates decides to just go ahead and do this’ price range.

Still, one must be realistic. Given we likely won’t get much good information soon, and time is very short, I’ll proceed as if we are physically prevented from information gathering.

Category 2: Things Individuals or Small Groups Can Do

Bird’s Eye View

Things individuals can do are a good place to start, because ‘scale up what an individual should do’ is an excellent hypothesis for much of what a society should do. I agree with Robin Hanson that people who are at risk, and say they are more concerned with infecting others than becoming infected, are usually wrong about that. I still think most people care a lot about preventing others from becoming infected. Incentives, in most places, are not so misaligned.

The first question one would ask is, what infection methods result in a high load, versus which methods result in a low load. Or to be more granular, what lowers or raises such loads.

The default model is that the longer and more closely you interact with an infected person, especially a symptomatic infected person, the larger your viral load.

In-household infections are presumed to be high viral load, as in the case of measles. So would be catching the infection while treating patients.

Most out-of-household infections that aren’t health care related are presumed to be low viral load. Anything outdoors is probably low viral load. Most methods that involve surfaces are probably low viral load. Infection via the air from someone there half an hour ago, to the extent this is a thing, is low viral load. Quick interactions with asymptomatic individuals are probably low viral load.

Knowledge would of course be far better than supposition. If anyone has actual information on any of this, please do share it in the comments.

The only method that seems highly ambiguous is the fecal-oral transmission route. We don’t know how dangerous it is either in terms of infection probability or likely viral load. It could be anything from almost harmless to very dangerous.

To a first approximation, we can say that a typical person (who is not a health care worker) should consider it more deadly to be infected via household transmission. They should consider it less deadly to be infected out-of-household.

We can also consider it relatively more deadly when the infection risk from a given source was otherwise high, or one might be infected from multiple sources at once. When one is exposed to a low-probability infection from a small number of sources, expected viral load is low.

How Many Infections are High Versus Low Load Now?

One place I disagree with Robin Hanson’s analysis is that he is comparing an intervention to create universal low viral loads to an alternative of mostly high viral loads.

I believe he is discounting the extent to which many infections are already low viral load. Consider our intuitive model, and infection rates from the poorly named category ‘close contacts.’

If you had an infected ‘close contact’ in Wuhan, your probability of infection was only 2.5%. Close contacts within the household were much more likely than those outside it to pass along the infection. If we focus on asymptomatic close contacts outside the household, we match our intuition that infection is possible but any given person is unlikely to infect us – the paper gives the probability of infection from each asymptomatic ‘close contact’ that wasn’t all that close at only 0.03%!

Under current lock down conditions, where anyone who got this far into this post (again, who doesn’t have an essential job that prevents this) is presumably avoiding anyone symptomatic, and most symptomatic people are self-isolating or seeking help, it does not seem so hard for most of us to avoid direct out-of-household interaction with highly symptomatic people.

Thus, let’s simplify the baseline for those taking ‘ordinary precautions.’ Out of household transmission is low load. Inside of household transmission is high load. To balance this out, let’s presume that if anyone in a household gets infected, they are probably going to infect the rest of the household, and we’ll set that probability at 100%.

If you are the only member of your household, unless you take big risks or something highly unlikely happens, you’re going to have a low viral load.

If you are one of two members, then one of you will get a low load, and one of you will then probably get a high load.

Overall infection rates are unknown due to inexcusably poor data, but we do know that the probability of infection on a given day for a given person who is taking precautions is low even if many are currently infected around them. Thus, unless household members are exposed from the same source at the same time, the probability of simultaneous infection that gets them both a low viral load is very low and can be rounded off to zero.

The biggest factor thus effectively becomes, how big is your household?

The average household size in the United States of America is about 2.6 people. If we exclude children, the average household size is 2.02 people, which we’ll round to 2.

Children can spread the infection, but they probably are not as effective at it and they do not have essential work outside the home. It should mostly be easy, once schools have closed, to avoid having them get infected before other household members. If they are infected in-household, we don’t care much if they have high viral loads since their risk remains very low.

Thus, of those that matter and who reside in a household, approximately 50% will get low viral loads in a lock down. This will briefly be false in the first week of a lock down, since you clamp down on out-of-household infections but not inside-of-household infections during that period. But longer term, the impact of higher-probability inside-of-household transmission does not matter very much, because the probability of inside-of-household transmission, once someone got infected, was so high already. The only effective defense is to stop out-of-household transmission so everyone in the household stays uninfected in the first place.

Lock downs seem like they would decrease viral load conditional on an out-of-household infection, as long duration in-person indoor interactions, and other ways to get a high viral load, seem to be down a lot more than other out-of-household transmission vectors. So intuitively, lock downs are actually net good for viral load.

The exception is if they change household size, by causing people to shelter together, which brings us back to what groups can do.

We also need to consider what to do about those in institutions, especially nursing homes. That could drive these numbers higher, if such people are effectively inside the same household, and that can’t be fixed.

Looking at a 50% low risk, 50% high risk scenario, we can only save 50% of what we could save if we started in a 50% high risk scenario. So a range of factors of 3-30, which I already discounted because of selection bias in the evidence, can further be cut in half. That gives us a reasonable range of room for improvement of about a factor of 1.5-10. Still a big win! That could be higher if we could do even lower initial loads than what we are calling ‘low load’ but a procedure that is <0.1% to infect you already seems likely to functionally be close to the minimum dose.

One’s Own Risk

If you are living alone, that seems great.

By definition, you can’t be infected inside-of-household with a household size of one. Thus, unless you need to do other high-risk activity, your viral load will be low.

If you are already mostly self-isolating, then you can worry less about incoming holes in your containment procedures. If you do get infected, you’re unlikely to pass it along, your risk will be lower than you would otherwise calculate,

There is an argument that if you’re alone and not going outside for a month conditional on getting packages, and you are young and healthy, you can stop worrying about getting Covid-19 from those packages. If it did happen, you are perhaps doing a solid approximation of variolation, after which you will be immune. That might actively be better than wiping the packages down or letting them sit for three days.

Two to Tango

If there are two people in the household, the calculus changes. We also get some non-intuitive results – since infecting anyone in the household probably infects everyone, protecting the more vulnerable member of the household means you want them to be infected first.

The first thing to note is that, in theory, sufficiently correlated risk to both individuals seems actively great. Both of you getting infected at the same time means you both get low viral loads. This is not easy to pull off, since most risks worth taking are low probability even when they are risky (e.g. <1% chance of infection even if the close contact is infected) so the chance of actually catching it at the same time is still low. But it could be worth thinking about.

The second thing is that people may be allocating infection risk backwards by default. Suppose you have a high risk old individual in poor health, and a low risk young individual in good health. You can’t get groceries delivered and supplies are running low, so someone has to run to the store. Who goes?

Intuition says of course the individual in good health should take the risk. But if we take the whole model seriously, that seems wrong now! If the healthy individual gets infected first then you have a low-risk infection in a low-risk individual (small win) who then infects the high-risk individual and puts them at high-risk again (big loss). You would want to reverse that process.

The counterargument here is that there is some evidence that higher risk individuals are also going to catch the virus more easily, which could prevent the situation from reversing. Again, we need better data!

The third thing is that social distancing within the household starts to look more valuable. Viral load depends on the exact interactions that take place. It makes sense to place at least some value on minimizing infection exposures between household members, even if you are mostly resigned to the infection eventually happening once anyone gets infected. There is still something important to win.

A household with varying risk levels due to age or health, to the extent possible, might choose to concentrate as much within-household exposure as possible in the less risky direction. For example, perhaps the person at high risk should be doing most of the cooking.

If one person does show symptoms first, precautions then become super valuable, even if they probably ultimately fail to prevent spread.

Three is a Crowd

Larger households mean bigger risk of high viral loads. Such groups have both a higher risk of infection at all (since any of them could get infected) they also carry a bigger risk via higher expected viral loads. Thus, as a group scales higher, precautions become even more important.

The obvious strategy is to break up large households. If you have more than two adults in a group, consider that an even higher cost. If you have a lot more than two, this gets extreme. And as before, if you have low-risk individuals together with high-risk ones, having the low-risk individuals take all the risks looks pretty bad. If you do that, it might make sense to consider social distancing within the household where feasible, if it can turn potential inner-household transmissions into effectively out-of-household transmissions. This is especially true if you have reason to believe someone may have been exposed recently.

If there are multiple households considering interacting with each other, that now looks like a bigger additional risk as well. Those types of interactions could effectively be inner-household style contacts.

For those with family members or housemates who insist on taking risks, you should be even more worried than before. The people living with the risk-taker are now at greater risk than the risk-taker themselves. It becomes that much more important to do something about such actions.

Category 3: What Society Can Easily Do (Beyond Gathering Data)

If we think there are things individuals should do, society’s first job is to encourage those people to do those things. Thus, everything in category two could be added to existing lists of recommendations.

But that’s not a practical answer. Bandwidth is limited. People are already overloaded with information, and with disinformation.

If you only get about five words, those five words need to be something like “socially distance, wash hands, mask.” Even not touching your face wouldn’t make the cut. We still have major organizations continuing to actively spread disinformation against even that level of response.

Everything we go over here is therefore probably far too subtle. It also risks muddling the messaging on more important matters. Any intervention for the public as opposed to people who read long analytical blog posts needs to have a focused simple message that could be attached at the end of the current list of simple useful interventions.

Another argument against spending bandwidth on this is that minimizing an individual’s viral load does not much help bend or smash the curve, and anything that reduces R0 is higher leverage and takes priority over other actions. The obvious retort is that lots of people in my circles and other circles are focusing on ventilators.

The simple message of minimize household size seems like a reasonable candidate. Since a large percentage of infections are within-household, this seems like a strong intervention even without viral load concerns. Perhaps we want to emphasize this more. We could use the viral load argument as an additional justification for an already good intervention.

A second strategy would be to ban activities that lead to high viral loads but not ban those that lead to low viral loads. This could make infections less deadly while doing less economic damage. The risk is that there are still a lot of large households out there. It seems beyond our abilities to say “people living alone and not working in a highly interactive position can do X, but not people living with others or working in ways that interact with others.” I would love to be wrong about that. Still, some amount of adjustment of what we do and don’t permit or encourage, on the margin, would be helpful.

It seems hard to do more than that on a broad basis until we have much better data. So again, the most low-hanging fruit is to gather data. Run experiments. Do more and faster science to it.

Category 4: Bold Action

Suppose we manage to gather better data and it turns out viral loads are a big deal.

Say, we become confident that minimal loads have a 0.1% death rate with proper medical care and high loads have a 2% death rate with proper medical care, and lock down conditions have a roughly 5050 mix of both and a 1% death rate. That’s a bigger effect than I expect, but very much in the realm of the possible.

That difference is a really, really big deal. It’s a much bigger deal than getting enough ventilators. It’s potentially a bigger deal than having a medical system at all. Alternatively, it’s potentially a bigger deal than the difference between 100% infection rates and the infection rate a few weeks from now (or in some places like Spain or New York City, potentially the infection rate today). It’s not enough to overcome both of those differences at once.

It is certainly a big enough difference to justify bold action to minimize high viral load infections among our most vulnerable populations.

Suppose we got our act together. We want to do more than nudge individual behavior. We are willing to do things that people find instinctively repugnant, provided they save lives while at least not hurting the economy. How could we accomplish this?

The possibilities I can see are subsidizing household divisions, strategic variolation of individuals, variolation of the young and healthy, or variolation of the old and vulnerable.

Subsidizing household divisions means exactly what it sounds like. We could offer tax or other incentives, up to and including providing housing and forcing people to use it, in order to break up sufficiently large groups of adults, or sufficiently large groups of adults that contain at least one at-risk member (e.g. someone over the age of 60, or 70). Given the economic costs of shutdown, actions that involve spending large amounts of money and/​or using large amounts of coercion are being underconsidered in general. We are already using lots of coercion and paying gigantic economic costs! The difference is we are doing so passively, via preventing actions, rather than actively via causing actions. That’s not a hill I want us to (literally) die on.

Then there are variolation strategies, where we deliberately infect individuals. Or alternatively, if we find out-of-household infections are low enough viral load, perhaps we could do this passively via allowing those in single-adult households to do things that cause them to infect each other in such ways, while otherwise taking strong precautions so they avoid infecting others.

There are a lot of bad objections to such policies. I won’t waste space addressing them other to note that they present practical barriers to implementation, and that they force us to be sure to do all of this carefully and correctly to have a chance of avoiding being shut down or worse.

One very strong objection is that our medical system is about to be overwhelmed. Anyone we expose now will either take needed resources away from others, or go without those resources. Thus, we either need to be in a world in which everyone is already going to get infected while the system is overwhelmed and we can’t stop it (in which case perhaps the best we can do is get remaining people low viral loads and hope for the best), or a world in which the virus has been contained for now but where we don’t have a long term plan that can keep it that way (e.g. some places can squash and keep their medical systems stable for a while, but they can’t stop reintroductions while reopening the economy, and the economic costs of doing this for long enough to wait for a vaccine or other solution are not an option).

Infecting the young and healthy is the natural first thing to model. The young are at relatively low risk. That risk is not zero, even if we assume big impacts from low viral loads, screening for comorbidities and ensure good at-home care. But if such people will eventually probably get infected anyway, we can reduce their net risk while allowing them to return to work and other activity much faster. Then that slows the further spread of the virus, hopefully allowing more high-risk individuals to never get infected at all.

Robin’s current best concrete suggestion, once we have established the safety and value of the procedure via testing, is to create variolation villages. Those who voluntarily participate and are deemed healthy enough would be isolated and infected, we would verify the infection and then allow them free access to the village until they were safely cured and non-infectious. Then they could return to their lives and move freely.

One could respond that this is exactly backwards. If low viral loads are a big win, why are we protecting those who least need that protection? Why aren’t we protecting those who most need it? This argues that, given we will have limited capacity, we should instead look to variolate the old and at risk, since we are reducing risk and they benefit from this the most.

Going down this path means we’ve concluded that protecting them, via herd immunity from the young or via general suppression or otherwise, is not realistic. These are exactly the people who ideally we don’t let get infected at all. If they do get infected, even carefully, they will need a lot of care. Doing this requires even more ideal conditions than infecting the young. We either need a lot of spare medical resources without having much hope of long term containment, or we need to have essentially no hope of stretching things out very far before most are infected.

Those who are capable of sustained safe isolation would want to avoid participating even under the best conditions.

That leaves what I am calling strategic variolation. Rather than taking whoever volunteers, or sorting by age and health, we choose people who (both volunteer and) provide superior leverage. Look for those who would otherwise be forced to expose themselves to high viral loads or lots of interactions. Alternatively, look for activities that cannot be done while social distancing, but which have very high value. Focus on those categories of individuals and at least give them priority. Alas, many would consider this an even worse look than a general call for volunteers, so much so that I am not naming anyone I would prioritize. If we got farther along this path, there would be plenty of time to discuss that.


Viral loads are not being taken seriously. We should take them seriously.

On an individual and household level, that means thinking carefully about how to avoid high viral load infections especially for those most at risk.

On a societal level, that means gathering much better data about how impactful this factor is and how it works (and about all other aspects of what is happening, of course), so we can consider taking bold action if appropriate. It also likely means encouraging smaller household groups during the pandemic.

Remember, I am not an expert. This is only me thinking out loud.