Major Misconception About Acquired Herd Immunity

When herd immunity is achieved through large-scale population exposure, the epidemic doesn’t come to a halt. Millions more could ultimately be infected.

What, exactly, is herd immunity? Let’s look at some recent descriptions.

“Herd immunity occurs when enough people become immune to a disease to make its spread unlikely.” Herd immunity is “the point at which the virus can no longer spread widely because there are not enough vulnerable humans.” “Herd immunity occurs when a large portion of a community (the herd) becomes immune to a disease, making the spread of disease from person to person unlikely.”

Another authority has noted, “For example, if 80% of a population is immune to a virus, four out of every five people who encounter someone with the disease won’t get sick (and won’t spread the disease any further). In this way, the spread of infectious diseases is kept under control.”

And still another reference explains, “Herd immunity can be achieved when so many members of a population have become immune to an infectious disease that it can’t find new people to infect. There are two ways to get there: by exposing a large percentage of the population to a virus so they can develop antibodies on their own, or by vaccinating enough people to interrupt its transmission.”

The last description does draw an important distinction between herd immunity acquired through large-scale population exposure and herd immunity acquired through a campaign of immunization. But it does not go far enough. That’s because the two ways of achieving herd immunity have vastly different consequences. In fact, the widespread tendency to confuse the two has led to a major misconception as to what would happen the day after enough people got infected to cross the herd immunity threshold.

Herd Immunity Through Large-Scale Population Exposure

The above graphic shows the natural course of an epidemic governed by the classic SIR model, first described in 1927, which has served as the mainstay of mathematical epidemiology for nearly a century. This particular realization of the model has three features. First, the entire population is assumed to be naïve to the infectious agent at the outset. Nobody has natural immunity. Second, the epidemic is seeded by a very small number of infectious individuals imported from outside the population. Third, the basic reproductive number (or ${\Re_0}$ ) is equal to 2. At the very start of the epidemic, each infected person will, during the time he or she remains infectious, cause an average of two other persons to become infected.

At the start of the epidemic at the very left, the green curve shows that nearly 100 percent of the population is susceptible (S) to the infectious agent. The blue curve seems to show that no one is initially infected (I), but in fact the proportion seeding the epidemic is so small that we can’t see it on the graph. The red curve shows that zero percent of the population is initially resistant (R).

As the epidemic gets going, the proportion of infected people initially grows. But each infected individual remains infectious only for a limited time period. He or she eventually becomes resistant, either by recovering from the infection or dying. As more people get infected, the proportion remaining susceptible falls, and as more infected people recover from their infections, the proportion becoming resistant rises. At all times during the course of the epidemic charted in the graphic, the sums of the proportions susceptible, infected, and resistant add to 100 percent.

The Herd Immunity Threshold

In the classic epidemic model we’ve charted above, the herd immunity threshold is reached when the proportion of infected people reaches a peak. At this point, exactly half of the population remains susceptible. In mathematical terms, the remaining proportion of susceptible individuals at the point of herd immunity is the inverse of the basic reproductive number, that is, $1 / {\Re_0}$ .

At the start of the epidemic, each infected person was passing his or her infection to two other people. But with half of the population no longer susceptible, that won’t happen any longer. Infected person A will expose the infectious agent to individuals B and C. But the infection won’t take in the case of person C, who is either infected or resistant, and thus can’t acquire a new infection. Thus, each infected person (individual A) will be replaced by only one other infected person (individual B). The rate of growth of the infected population is exactly zero.

As the epidemic passes the immunity threshold, so that the proportion of susceptible persons falls below 50 percent, the rate of growth of the infected population turns negative. One infected person will give rise on average to less than one other infected person, and the proportion infected declines. That’s exactly what we see happening in the blue curve in the graphic.

The Catch

But there’s a problem, a catch, a rub. At the threshold of herd immunity, the red curve tells us that only 35 percent of the population is actually resistant, while the blue curve tells us that 15 percent are still infected. Each remaining infected person will indeed cause less than one new infection, and the percentage infected will indeed begin to fall. But there are still plenty of infected people to continue to pass their infections to the remaining susceptible individuals. In fact, at the far right of the graphic, by the time the epidemic finally fades away, the red curve tells us that about 80 percent of the population will have either recovered or died.

Millions More

Perhaps percentages are too abstract, too intangible. Here’s an application with absolute numbers. We start out with a population of 300 million susceptible people. We reach herd immunity when only 150 million remain susceptible. At that point, 45 million individuals will be actively infected and 105 million will have recovered or died. By the time the epidemic is over, however, 240 million will have either recovered or died. That is, 240 – 150 = 90 million additional people are infected after the herd immunity threshold is reached.

Why This Differs from Mass Immunization

Think about how different the roll-out of the epidemic would be if 50 percent of the susceptible individuals were instead immunized at the outset. When each of a handful of infected individuals is imported from outside, he or she will be unable to infect more than one other person. While it could still take some time for the resulting infections to completely dissipate, the extent of propagation will be minuscule in comparison to the our previous scenario of herd immunity through large-scale population exposure.

Total Deaths Are Understated, Too.

More than a few commentators have rightly pointed out that lots of people will die by the time we get to herd immunity. The message of the present analysis is that these estimates of the numbers of deaths are also understated.

For the sake of argument, let’s assume an infection fatality rate of 0.5 percent, which is at the low end of the World Health Organization’s recent estimate. Our SIR model teaches us that at the point of herd immunity, 50 percent of the population will have been infected. That means 0.5% $\times$ 50% = 0.25% will have died. In a population of 300 million, that’s 750,000 deaths. The point of this article is that, in fact, 80 percent will eventually be infected, so that 0.5% $\times$ 80% = 0.40% will have died . In the same population of 300 million, that’s 1,200,000 deaths.

While all of the foregoing results have been known for nearly a century, we have been searching far and wide for someone to explicitly acknowledge them in the context of the current COVID-19 pandemic. We have so far found only one instance. Almost as an afterthought to an article that likewise omits the long tail of infections in its calculation of how many millions will have to die, the Washington Post aptly quotes Carl Bergstrom of the University of Washington: “The epidemic doesn’t stop on a dime when you hit herd immunity. … The herd immunity point is when you’re at the peak of the epidemic. So you’ve come up the curve. But you still got to go all the way back down.”

But Aren’t We Closer to Herd Immunity Than We Thought?

A number of analysts have suggested that we may be closer to herd immunity than we thought. One source of evidence is that some individuals may already have a degree of cross-immunity from other prevalent coronaviruses – though the data are too meager at this juncture to know how many. Another line of argument is based on the idea of incomplete mixing. The entire population could reach herd immunity, so the argument goes, once the groups with the most infectious individuals become saturated with infections. Data from Florida, however, indicate that there is plenty of mixing from the most infectious to the most susceptible populations.

Still, in terms of our classic SIR model, these contentions share a common feature – namely, the initial proportion of susceptible persons is less than 100 percent. That would indeed change the scale of our model, but not the basic dynamics.

Let’s start out once more with a population of 300 million people, but this time assume that 100 million are not susceptible from the get-go. For the remaining 200 million, we reach the herd immunity threshold when 100 million (or 50 percent of the initial susceptible individuals) have gotten infected. By the time the epidemic has full dissipated, 160 million (or 80 percent) will have been infected.

What If Our Estimate of the Basic Reproductive Number is Wrong?

We assumed that the basic reproductive number ${\Re_0}$ is equal to 2 solely for illustrative purposes. This round number made it easier for us to communicate the basic ideas. In fact, we have estimated that in the early days of the epidemic in New York City, the basic reproductive number was on the order of 3.4. Still, as explained in the Technical Notes below, the same underlying dynamics apply generally to any value of ${\Re_0} > 1$ .

In our application of the classic SIR model, we further assumed that the population was closed. We could certainly complicate our model, allowing for new entrants and new exits, but the same overall dynamics would still apply. Of course, a country could encourage the immigration of millions of resistant individuals. But we don’t think that’s what anybody has in mind.

What About Social Distancing?

At this juncture, there is plenty of evidence that social distancing reduces viral transmission, and that the reversal of social distancing enhances transmission. One could imagine an endgame where social distancing measures are used to modulate the rate of infection until herd immunity is gradually achieved over the long run. That strategy would indeed mitigate the problem identified in this article.

But our objective here was not to recommend or predict how we will ultimately get out of this mess. Instead, our narrower goal was to bring to light the hidden costs of a strategy of letting lots of people get sick in the name of herd immunity.

Technical Notes

Classic SIR Model

We’ll work with the classic SIR model. It is the simplest mathematical model describing the time course of an epidemic. A more complicated model – of which there are a great many, including SEIR, SEIIR, and SEIAR – would do no more than distract attention from the main issue. Everything that follows here has been known for nearly 100 years.

Let $S (t)$ denote the proportion of the population that is susceptible to the disease at time $t \ge 0$ . Let $I (t)$ denote the proportion infected, and let $R (t)$ denote the proportion resistant. We assume a closed population, that is,

$S (t) + I (t) + R (t) = 1$

for all $t \ge 0$ . All infected people are assumed to be immediately contagious upon infection. That is, there is no latency period. Individuals can become resistant either through recovery or death.

The SIR model is governed by two coupled differential equations. The first equation is a law of mass action describing the rate at which susceptible individuals get infected. Specifically,

$\dot S(t) = - \alpha S(t) I(t)$

where $\alpha$ is a positive constant parameter. Here, we use the dot notation $\dot S (t) = dS (t) / dt$ for the first derivative.

The second equation describes the rate at which infected individuals become resistant. Specifically,

$\dot R (t) = \beta I (t)$

where $\beta > 0$ is also a constant parameter. Upon becoming infected, each individual thus remains infected for a mean time period equal to ${1 \mathord{\left/ {\vphantom {1 \beta }} \right. \kern-\nulldelimiterspace} \beta }$ . Given the constraint of a closed population, the corresponding differential equation for the number of infected individuals is therefore

$\dot I (t) = \alpha S (t) I (t) - \beta I (t)$

We start off our epidemic at time $t = 0$ assuming everyone is naïve to the infectious agent, that is, $R\left( 0 \right) = {R_o} = 0$ . The epidemic is initially seeded by $I\left( 0 \right) = {I_0} > 0$ infected individuals imported from outside. The initial number of susceptible individuals is $S (t) = {S_o} = 1 - {R_o} -{I_o} = 1 - {I_0}$ . If the initial number of infected individuals is small, then ${S_o} \approx 1$ .

How Many Are Infected in the Long Run

In the long run, as time $t \to \infty$ , our epidemic will eventually dissipate and there will be no remaining infected individuals, that is, $I\left( t \right) \to 0$ . At that point, some fraction of susceptible individuals will still not have been infected. We write $S (t) \to {S_\infty }$ and $R (t) \to {R_\infty }$ for the limiting numbers of susceptible and resistant individuals, where ${S_\infty } + {R_\infty } = 1$ and ${I_\infty } = 0$ .

To derive an expression for these limiting quantities, we combine our two differential equations $\dot S (t) = - \alpha S (t) I (t)$ and $\dot R (t) = \beta I (t)$ , to get $dS/dR = - \gamma S$ , where $\gamma = \alpha / \beta$ . The resulting differential equation has the closed-form solution

$S (t) = {S_0} \exp ( - \gamma R (t) )$

As time $t \ge 0$ advances, this equation traces out the phase diagram of the epidemic in the $\left( {R,S} \right)$ plane. At time $t \to \infty$ , we get ${S_\infty } = {S_0}\exp \left( { - {\gamma }{R_\infty }} \right)$ . Since ${S_\infty } + {R_\infty } = 1$ , we end up with

$1 - {R_\infty } = {S_0}\exp ( { - {\gamma }{R_\infty }} )$

The root of this equation is thus the limiting quantity ${R_\infty }$ . In what follows, we use the fact that ${R_\infty } = 0.7968$ when $\gamma = 2$ and ${S_o} \approx 1$ .

Reproductive Number and Herd Immunity Threshold

Let’s revisit the differential equation governing the growth in the proportion $I\left( t \right)$ of infected individuals, that is, $\dot I\left( t \right) = \left( {\alpha S\left( t \right) - \beta } \right)I\left( t \right)$ . We can rewrite this equation as $\dot I\left( t \right) = \beta \left( {\gamma S\left( t \right) - 1} \right)I\left( t \right)$ . The expression

$\Re \left( t \right) = \gamma S\left( t \right)$

is the reproductive number of the epidemic at time $t \ge 0$ . At any specific time $t \ge 0$ during the course of the epidemic $\Re \left( t \right)$ gives the average number of new infections generated by a single infected individual. We let $\Re \left(0 \right) = {\Re_0} = \gamma {S_0}$ denote the basic reproductive number at the start of the epidemic.

When the reproductive number $\Re \left( t \right)$ is exactly equal to 1, we’re at the herd immunity threshold and the growth rate of the infected population is zero, that is, $\dot I\left( t \right) = 0$ . When the reproductive number is less than 1, we’re past the herd immunity threshold and the growth rate of the infected population is negative, that is, $\dot I\left( t \right) < 0$ .

The Epidemic Does Not End at the Herd Immunity Threshold.

At the herd immunity threshold, where the growth rate of infected individuals is zero, there is still a positive number of infected individuals in the population, and they will continue to infect other susceptible persons.

Let’s further characterize the moment $t'$ at which the epidemic reaches the herd immunity threshold. We know that $\Re \left( t' \right) = \gamma S\left(t' \right)$ and $\Re \left( t' \right) = 1$ . We also have ${\Re_0} = \gamma {S_0}$ . So, the proportion of susceptible individuals at the herd immunity threshold equals

$S \left( t' \right)= {S_0} / {\Re_0}$

According, the combined number of infected and resistant individuals at the herd immunity threshold is $I \left( t' \right) + R \left( t' \right)= 1 - {{S_0}\mathord{\left/ {\vphantom {\alpha \beta }} \right. \kern-\nulldelimiterspace} {\Re_0} }$ .

At the threshold of herd immunity, when $\Re \left( {t'} \right) = 1$ , we have $\gamma S\left( {t'} \right) = 1$ . We already know that $S\left( {t'} \right) = {S_0}\exp \left( { - \gamma R\left( {t'} \right)} \right)$ , so $\Re \left( {t'} \right) = \gamma {S_0}\exp \left( { - \gamma R\left( {t'} \right)} \right) = 1$ . Since $\Re \left( 0 \right) = {\Re _0} = \gamma {S_0}$ , we have ${\Re _0}\exp \left( { - \gamma R\left( {t'} \right)} \right) = 1$ . This gives us the relation between the basic reproductive number ${\Re _0}$ and the number of resistant individuals at herd immunity $R\left( {t'} \right)$ :

$R\left( {t'} \right) = {S_0}{{\left( {\log {\Re _0}} \right)} \mathord{\left/{\vphantom {{\left( {\log {\Re _0}} \right)} {{\Re _0}}}} \right. \kern-\nulldelimiterspace} {{\Re _0}}}$

Accordingly, for an epidemic with $\gamma = 2$ and and ${S_o} \approx 1$ , the basic reproductive number is ${\Re _0} = 2$ and the proportion of resistant individuals at the herd immunity threshold is $R\left( {t'} \right) = {{\left( {\log 2} \right)} \mathord{\left/{\vphantom {{\left( {\log 2} \right)} {2}}} \right. \kern-\nulldelimiterspace} {2}} = 0.3466$ .

Comparing the Herd Immunity Threshold With the Long Run

We have assumed an SIR model where everyone is naive to the infectious agent and where the initial number of infected individuals is small, so that ${R_o} = 0$ and ${S_o} \approx 1$ . Under these conditions, the basic reproductive number of the epidemic is ${\Re _0} = 2$ . Based upon our calculations above, we can compare the proportions of susceptible, infected and resistant individuals at the herd immunity threshold when $t = {t'}$ and in the long run as $t \to \infty$ .