Author(s): D. L. Smith (corresponding author) [1]; J. Dushoff [1, 2]; R. W. Snow [3, 4]; S. I. Hay [3, 5]
Malaria is an important cause of global morbidity and mortality. The fact that some people are bitten more often than others has a large effect on the relationship between risk factors and prevalence of vector-borne diseases [1, 2, 3]. Here we develop a mathematical framework that allows us to estimate the heterogeneity of infection rates from the relationship between rates of infectious bites and community prevalence. We apply this framework to a large, published data set that combines malaria measurements from more than 90 communities [4]. We find strong evidence that heterogeneous biting or heterogeneous susceptibility to infection are important and pervasive factors determining the prevalence of infection: 20% of people receive 80% of all infections. We also find that individual infections last about six months on average, per infectious bite, and children who clear infections are not immune to new infections. The results have important implications for public health interventions: the success of malaria control will depend heavily on whether efforts are targeted at those who are most at risk of infection.
Economic, public-health, and medical advances over the past 100 years halved the global geographical extent of malaria [5]. Nonetheless, population growth and failure to prevent infection or manage disease in this reduced extent means that Plasmodium falciparum remains a leading cause of global morbidity [6] and mortality [4, 7, 8]. Central to the future of malaria control is more effective understanding of the relationships between risk factors, the frequency and longevity of infection, and disease outcome [9]. These relationships have been debated extensively in relation to disease control [10, 11].
An important aspect of this debate is the quantitative relationship that inevitably exists between the proportion of people who are infected with P. falciparum , (the parasite ratio, PR) and the rate at which people are bitten by infectious mosquitoes (the entomological inoculation rate, EIR). The EIR is rarely recorded in Africa [4], limiting informed debate on the relationship between vector biology, transmission intensity, clinical disease and mortality risks. Conversely, the PR is a widely measured index of infection risk, allowing a more detailed investigation of the interaction between transmission intensity, age, and disease burden. Using mathematical models, we seek here to explore the factors that determine the complex, nonlinear association between EIR and PR, and use these models to highlight key features of the biology of malaria transmission that relate to future disease control.
Ross first developed a mathematical model for the relationship between EIR and PR [12], but his model, as extended by Macdonald, performed poorly in the African Savannah [13]. Since then, several modifications to Ross' original model have been proposed that are now considered important in malaria epidemiology. Informed by a century of theory, we have developed a large set of mathematical models, fitted them to a comprehensive data set, and selected the best of these models to describe the relationship between EIR and PR (Methods).
It was assumed by Ross that human populations were homogeneous, but in fact some people are bitten by mosquitoes more than others because of proximity to larval habitat [14], differential attractiveness to mosquitoes [15], or other reasons. Moreover, some people are more susceptible to infection, per bite. Heterogeneous infection rates have important implications for the dynamics and control of malaria [1, 2, 3], and heterogeneity fundamentally changes the relationship between EIR and PR [14]. Those who are infected most play a role in malaria transmission that is analogous to the role of the most sexually active in transmission of sexually transmitted diseases [16]. We assume that relative infection rates follow a [GAMMA] distribution, with mean 1 and variance 1/k .
Ross assumed that infections clear at a constant rate, regardless of subsequent infections. The assumption was challenged by a growing consensus that superinfection with P. falciparum would increase the time to clear an infection [17, 18, 19, 20]. Let E denote the annual EIR, b the transmission efficiency (the probability that a bite by an infectious mosquito results in an infection--either a new infection or super-infection) and 1/r the expected time to clear each infection. Assuming superinfection and assuming infections clear independently, clearance occurs at the rate b E/(e b E/r - 1) (refs 17-19); thus, the time to clearance is longer when annual EIR is higher. We also considered immunity to reinfection by comparing SIS to SIRS dynamics (Methods). The full set of candidate models included SIS and SIRS models combined with heterogeneous infection and superinfection.
We identified 119 empirical estimates of EIR matched to coincidental measures of PR in African children under 15 years of age [4]; these data should be regarded with circumspection, as they were collected using different methods and for other purposes. One hundred and nine pairs measured positive EIR and PR. Ninety-one of these pairs also reported the sample sizes for the PR estimate. Two other modifications to these models were considered. First, microscopy errors bias estimates of PR and affect the analysis [21]. As the sensitivity and specificity of microscopy is not known, they were fitted along with the other model parameters. Second, the age ranges of the children sampled differed among studies. This introduces a potential bias if estimated prevalence varied substantially with age. We fitted each model with and without age corrections and microscopy errors (Methods).
Each candidate model was fitted to the data by maximum likelihood using R (ref. 22) and compared to a log-linear model [5, 23]. Based on these fits, the best overall model was selected using Akaike's information criterion (AIC) [24] (Table 1, Fig 1a). The best overall model was a simple function that incorporated heterogeneous infection rates, with no immunity to reinfection:[eq. 1]PR = 1 - 1 + b Erk -k (1) Equation (1) fitted better than the log-linear model (another two-parameter model) and had very strong evidential support over the log-linear model, with [DELTA]AIC of 185 (refs 5, 23). Moreover, equation (1) was selected over the log-linear model in every case when the analysis was repeated on 1,000 bootstrapped data sets.
The model has six fitted parameters, including sensitivity, specificity, age-corrections, k and b /r : b and r are exactly co-linear--the mathematical relationship between EIR and PR in these models depends only on their ratio. For the best overall model, the fitted sensitivity and specificity were 95.8% and 88.4%, respectively. The estimates were different for each model. A sensitivity analysis demonstrates that PR is much more sensitive to changes in k , which determines variance in infection rates, when annual EIR is greater than about 1, but PR is more sensitive to b /r when EIR is lower (Fig. 1b).
A simple summary of heterogeneous infection is the fraction of all infections received by the subpopulation that is infected most frequently; for 1/ k = 4.2, 20% of the population receives 80% of all infections, similar to a single study from Tanzania in which 20% of the population received 80% of all bites [3]. This represents an average across the 91 populations sampled--the distribution of infection rates in a particular population may be more or less heterogeneous, depending on the local ecology [14].
The fitted parameter b /r is the product of transmission efficiency and persistence times; alternatively, it is the expected duration of an infection, per infectious bite. If transmission efficiency were perfect, the best-fit parameter would correspond to a duration of infection of approximately 166 days; if transmission efficiency were approximately 50%, then persistence would be 11 months. These estimates are consistent with estimates of persistence from simple infections induced for malaria therapy [25] and with recent studies of persistence for natural infections [26, 27].
Consistent with other studies [28, 29] and the notion that immunity to clinical illness develops after repeated infection in early childhood, we found no evidence for immunity to infection among these populations of African children, as reflected in the relationship between EIR and PR. A direct comparison of SIS and SIRS models (Methods; equation (4) versus equations (6) or (7)) demonstrated that mathematical models for malaria infection in children should be SIS and not SIRS because children do not become immune to infection after clearing a single infection, but immunity requires repeated infection or possibly some change in immune function with age.
On the other hand, the analysis did reveal a strong decline in prevalence associated with the maximum age of the sample population. The lower bound for age was associated with a 0.8% increase in prevalence for each year of age, and the upper bound was associated with a 1.6% decrease in prevalence. On closer scrutiny, most of the effect was due to 16 studies for which the maximum age was larger than 12. One likely explanation is that these studies included many children who had become sufficiently immune to control the peripheral parasitaemia. Individuals who control peripheral parasitaemia may clear infections faster, or they may be more likely to return a false negative microscopy report, an issue that may be resolved with a more sensitive test such as PCR (polymerase chain reaction).
One objection to this analysis is that the heterogeneous composition of the population inevitably biases the study. The inferential perils of cross-level analysis can be avoided if the heterogeneous composition of the population is known in advance and incorporated into the analysis, but this is a practical impossibility. Our approach deals with this problem by assuming that the heterogeneity can be treated statistically. We specify a one-parameter family of distributions and allow a shape parameter to be fitted. This analysis corrects for the bias introduced by heterogeneity, assuming that the distribution is properly chosen, and that heterogeneous infection rates are the main source of bias.
We have found a simple approximating model for the relationship between EIR and PR. Our analysis suggests that heterogeneous biting or susceptibility to infection plays an important role in determining PR, that immunity to infection in early childhood does not, and that persistence times for malaria are at least six months, and possibly much longer.
Our findings have broad implications for malaria dynamics. Clearly, PR declines in adults [28, 29] owing to some kind of immunity that reduces infection, increases clearance, or that reduces apparent PR by lowering the parasite densities in peripheral parasitaemia and decreasing sensitivity by microscopy. The distinction between immunity to clinical disease, immunity to infection, transmission blocking immunity, and the ability of individuals to manage peripheral parasitaemia has been confusing in mathematical models; the epidemiological state often called 'recovered and immune' acts as a reservoir of P. falciparum in some models but not in others. A critical evaluation of the mathematical models of malaria is warranted.
The results here are consistent with earlier views of malaria epidemiology; very substantial reductions in EIR will be necessary to achieve modest reductions in PR throughout Africa. In quantitative terms, reducing EIR from 200 to 100 and then to 50 would reduce PR by 4% and then by an additional 5%. A corollary is that the effort required to reduce the disease burden in Africa would be enormous. Such findings have substantial implications for (1) the prospects of control with imperfect vaccines, (2) the development and persistence of anti-malarial drug resistance, and (3) public health interventions that aim to reduce disease without reducing PR. In particular, heterogeneous infection implies that malaria will be substantially more difficult to control if control measures are applied uniformly. Conversely, targeting malaria control at those who are bitten most, where practical, may provide a disproportionate impact and wider community benefits by reducing the frequency of asymptomatic infections, the sporozoite rate in the mosquito population and overall transmission [1, 2, 3].
Methods
Models
Ross's population dynamic model describes the temporal relationship between EIR and PR, as well as the relationship at equilibrium [12]. Let x denote PR in a population as a function of time or age. Initially, we assume people become susceptible to infections after clearing an infection. Assuming EIR remains constant, PR changes according to the following equation:[eq. 2] [??] = b E(1 - x ) - rx (2) Ross's relationship between EIR and PR is found by setting [??] = 0 and solving for x (Table 1). The model is called SIS, because those who are infected (I) become susceptible (S) again after clearing an infection.
For super-infection, PR is the equilibrium of the equation:[eq. 3] [??] = b E(1 - x ) - f (b E,r )x (3) where f ([LAMBDA] , r ) = [LAMBDA] /(e [LAMBDA] /r - 1). We call this an SI[degrees]S model, where I[degrees] implies the per capita time to clearance (that is, from I to S) is given by f .
In heterogeneous populations, let s index the population with expected infection rate bs E, and let x (s ) denote the proportion of humans in that class that are infected. To describe the distribution of infection rates in the population, let g (s ) denote the fraction of the population in class s , and without loss of generality, let g (s ) denote a probability distribution function with mean 1. Thus, g (s ) affects the distribution of infection rates without changing the mean; b E describes average infection rates, but individual expectations can vary substantially. The dynamics are described by the equation:[eq. 4] [??] (s ) = bs E(1 - x (s )) - f (bs E,r )x (s ) (4) The population prevalence is found by solving for the equilibrium in equation (4), denoted [??] (s ), and integrating:[eq. 5][integral]0 [infinity] [??] (s )g (s )ds (5) Here, we let g (s , k ) denote a [GAMMA] distribution, with mean 1 and variance 1/k . Thus, the average rate of infection in the population is b E and the variance of the infection rate is b 2 E2 /k ; 1/k is the coefficient of variation of the population infection rate. For this distribution, equation (5) has the closed form solution given by equation (1). This model is called [integral]SI[degrees]S.
Ross's model, the heterogeneous infection model, and the superinfection model are closely related. As expected, the functional relationship with super-infection is the limit of a heterogeneous infection model as the variance in expected infection rates approaches 0. Curiously, Ross's original function is a special case of a heterogeneous infection model (equation (1)) with k = 1. A longer closed form expression can be derived for the model [integral]SIS, the heterogeneous model with Ross's assumption about clearance (not shown). The best fit model [integral]SI[degrees]S is virtually identical to the Ross analogue of the best fit model [integral]SIS but with a very different interpretation (results not shown). Thus, the super-infection clearance assumption does little, per se , to improve the model fit. On the other hand, it may provide a more accurate estimate of the time to clear an infection [19].
For immunity to infection, let y denote the proportion of a population that has cleared P. falciparum infections and is immune to re-infection. Let [straight phi] denote the average duration of immunity to re-infection. The dynamics are described by the equations:[eq. 6][??] = b E(1 - x - y ) - f (b E,r )x [??] = f (b E,r )x - y /[straight phi] (6) Note that the fitted parameter is actually [straight phi] ' = b[straight phi] (see Table 1). This model is called SI[degrees]RS, where R means recovered and immune.
For a heterogeneous population model with immunity to infection, let y (s ) denote the proportion of recovered and immune hosts. The dynamics are described by the equations:[eq. 7][??] (s ) = bs E(1 - x (s ) - y (s )) - f (bs E,r )x (s )[??] (s ) = f (bs E,r )x (s ) - y (s )/[straight phi] (7) We could not find a closed-form expression, so we fitted the function shown in Table 1; numerical integration was performed by R . This model is called [integral]SI[degrees]RS.
Age, microscopy errors and likelihood
Let [alpha] denote the sensitivity of microscopy and 1 - [beta] the specificity. The estimated PR, Y , is related to the true PR by the formula Y = [alpha]X + [beta] (1 - X ); it is biased upwards at low prevalence by false positives ( [beta] ) and downwards at high prevalence by false negatives ([alpha] ).
Similarly, the differences in the age distribution of children sampled is a potential source of bias. As we have no information about the age distribution of children actually sampled, we use the bounds for bias correction.
Let L i and U i be the lower and upper ages of the children from the i th study, and let L[circumflex] and Û denote the mean lower and bounds from all the studies. The predicted PR in the i th sample, given the estimated EIR, is X i = f (Ei ) + a (L i - L[circumflex] ) + b (U i - Û ). The predicted estimate of PR in the ith sample, given the estimated EIR is Y i = [alpha]X i + [beta] (1 - X i ). The log likelihood of the data, given the model, was:[eq. 8][summation] i = 191 P i log(Y i ) + N i log(1 - Y i ) (8) where P i was the number of humans that tested positive and N i the number that tested negative in the i th study. For each underlying function f , we verified that the full model was in fact the best model by backwards fitting (that is, we redid the analysis setting [alpha] , [beta] , a and b to zero, one by one, then in combinations).
We have reported the fitted sensitivity and specificity, but we also repeated the analysis assuming a constant sensitivity and specificity for all models. The results and model ranking are similar to those reported here if the assumed values of sensitivity and specificity are high, close to their fitted values. In contrast, when sensitivity and specificity are much lower, the log-linear model is strongly favoured. As the lowest PR that could be measured is [beta] , and as the PR in several studies was close to zero, specificity must be fairly high, at least when PR is low. Some of this information is captured by fitting [alpha] and [beta] .
Uncertainty in EIR
The studies used different methods to estimate EIR, so we were uncertain how to estimate the study precision. Moreover, only 38 studies reported sample sizes for their estimates of the human biting rate and the sporozoite rate. To evaluate whether uncertainty in EIR would affect our analysis, we repeated the analysis 1,000 times using equation (1) and the log-linear model. In each instance, we multiplied the estimated EIR by 2[xi] , where [xi] was a uniform random variable drawn from the interval (- 1, 1). The average best-fit parameter estimates changed, but equation (1) was selected as the best model in 98.8% of these trials. We repeated the analysis again multiplying EIR by 4[xi] with similar results.
Caption(s):
Figure 1: The data and fitted functions and a sensitivity analysis for the best overall model. [see PDF for image]
a , The best fit for SIS (tan), SI[degrees]S (yellow), SI[degrees]RS (orange), the log-linear model (grey), and the best-overall model [integral]SI[degrees]S (blue). The best-fit [integral]SI[degrees]RS model was not visually different from the SIS model with heterogeneity. b , In [integral]SI[degrees]S, PR is more sensitive to the variance in infection rates (that is, doubling or halving k ) at high EIR (solid lines, blue and green), and more sensitive to variability in recovery (doubling or halving r ) for low EIR (dashed lines, yellow and green). Models are defined in Methods.
Table: The models and fitted parameters [see PDF for image]
Acknowledgement:
We thank D. Bradley, B. Grenfell, F. E. McKenzie, W. Prudhomme, S. Randolph, M. Recker, D. Rogers and L. Waller for comments and suggestions. We also thank the NCEAS working group for discussion. S.I.H. is funded by a Research Career Development Fellowship from the Wellcome Trust. R.W.S. is a Wellcome Trust Senior Research Fellow and acknowledges the support of the Kenyan Medical Research Institute (KEMRI). This work was partially conducted as part of the Environment and Disease Working Group supported by the National Center for Ecological Analysis and Synthesis, a Center funded by NSF, the University of California Santa Barbara, and the State of California. The views presented in this Letter represent the personal views of the authors and do not construe or imply any official position or policy of the Fogarty International Center, National Institutes of Health, Department of Health and Human Services, or the US government. Author Contributions S.I.H. collated the data. D.L.S. designed and conducted the analysis. D.L.S., J.D., S.I.H. and R.W.S. wrote the paper.
Reprints and permissions information is available at npg.nature.com/reprintsandpermissions. The authors declare no competing financial interests.
References
1. Dietz, K. Models for vector-borne parasitic diseases. Lecture Notes Biomath. 39, 264-277 (1980)
2. Dye, C. & Hasibeder, G. Population dynamics of mosquito-borne disease: effects of flies which bite some people more frequently than others. Trans. Soc. Trop. Med. Hyg. 80, 69-77 (1986)
3. Woolhouse, M. E. et al. Heterogeneities in the transmission of infectious agents: implications for the design of control programs. Proc. Natl Acad. Sci. USA 94, 338-342 (1997)
4. Hay, S. I., Guerra, C. A., Tatem, A. J., Atkinson, P. M. & Snow, R. W. Urbanization, malaria transmission, and disease burden in Africa. Nature Rev. Microbiol. 3, 81-90 (2005)
5. Hay, S. I., Guerra, C. A., Tatem, A. J., Noor, A. M. & Snow, R. W. The global distribution and population at risk of malaria: past, present and future. Lancet Infect. Dis. 4, 327-336 (2004)
6. Snow, R. W., Guerra, C. A., Noor, A. M., Myint, H. Y. & Hay, S. I. The global distribution of clinical episodes of Plasmodium falciparum malaria. Nature 434, 214-217 (2005)
7. Murray, C. J. L. & Lopez, A. D. Alternative projections of mortality and disability by cause 1990-2020: Global burden of disease study. Lancet 24,1498-1504 (1997)
8. Jamison, D. T., Creese, A. & Prentice, T. World Health Report, 1999: Making a Difference (World Health Organization, Geneva, 1999)
9. Snow, R. W. & Gilles, H. M. in Essential Malariology 4th edn (eds Warrell, D. A. & Gilles, H. M.) Ch. 5, 85-106 (Arnold, London, 2002)
10. Snow, R. W. & Marsh, K. The consequences of reducing transmission of Plasmodium falciparum in Africa. Adv. Parasitol. 52, 235-264 (2002)
11. Macdonald, G. The Epidemiology and Control of Malaria (Oxford Univ. Press, London, 1957)
12. Ross, R. The Prevention of Malaria (John Murray, London, 1911)
13. Nájera, J. A. A critical review of the field application of a mathematical model of malaria eradication. Bull. World Health Organ. 50, 449-457 (1974)
14. Smith, D. L., Dushoff, J. & McKenzie, F. E. The risk of a mosquito-borne infection in a heterogeneous environment. PLoS Biol. 2, e368 (2004)
15. Takken, W. & Knols, B. G. J. Odor-mediated behaviour of Afrotropical malaria mosquitoes. Annu. Rev. Entomol. 44, 131-157 (1999)
16. Hethcote, H. W. & Yorke, J. A. In Lecture Notes in Biomathematics Vol. 56, 1-105 (Springer, Berlin, 1984)
17. Walton, G. A. On the control of Malaria in Freetown, Sierra Leone. I. Plasmodium falciparum and Anopheles gambiae in relation to malaria occurring in infants. Ann. Trop. Med. Parasitol. 41, 380-407 (1947)
18. Dietz, K., Molineaux, L. & Thomas, A. A malaria model tested in the African savannah. Bull. World Health Organ. 50, 347-357 (1974)
19. Aron, J. L. & May, R. M. in Population Dynamics and Infectious Disease (ed. Anderson, R. M.) Ch. 5, 139-179 (Chapman and Hall, London, 1982)
20. Bailey, N. T. J. The Biomathematics of Malaria (Oxford Univ. Press, Oxford, 1982)
21. McKenzie, F. E., Sirichaisinthop, J., Miller, R. S., Gasser, R. A. & Wongsrichanalai, C. Jr Dependence of malaria detection and species diagnosis by microscopy on parasite density. Am. J. Trop. Med. Hyg. 69, 372-376 (2003)
22. R Development Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, Vienna, Austria, 2004); http://www.R-project.org.3-900051-07-0
23. Beier, J. C., Killeen, G. F. & Githure, J. I. Short report: entomologic inoculation rates and Plasmodium falciparum malaria prevalence in Africa. Am. J. Trop. Med. Hyg. 61, 109-113 (1999)
24. Burnham, K. P. & Anderson, D. A. Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach 2nd edn (Springer, New York, 2002)
25. Eyles, D. E. & Young, M. D. The duration of untreated or inadequately treated Plasmodium falciparum infections in the human host. J. Nat. Malaria Soc. 10, 327-336 (1951)
26. Gu, W. et al. Low recovery rates stabilize malaria endemicity in areas of low transmission in coastal Kenya. Acta Trop. 86, 71-81 (2003)
27. Sama, W., Killeen, G. & Smith, T. Estimating the duration of Plasmodium falciparum infection from trials of indoor residual spraying. Am. J. Trop. Med. Hyg. 70, 625-634 (2004)
28. Bagster Wilson, D. Rural hyperendemic malaria in Tanganyika Territory. Trans. R. Soc. Trop. Med. Hyg. 29, 583-618 (1936)
29. Molineaux, L. & Gramiccia, G. The Garki Project (World Health Organization, Geneva, 1980)
Author Affiliation(s):
[1] Fogarty International Center, National Institutes of Health, Building 16, 16 Center Drive, Bethesda, Maryland 20892, USA
[2] Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey 08544, USA
[3] Malaria Public Health & Epidemiology Group, Centre for Geographic Medicine, KEMRI (in the grounds of the Kenyatta Hospital), PO Box 43640, 00100 Nairobi GPO, Kenya
[4] Centre for Tropical Medicine, John Radcliffe Hospital, University of Oxford, Oxford OX3 9DS, UK
[5] TALA Research Group, Tinbergen Building, Department of Zoology, University of Oxford, South Parks Road, Oxford OX1 3PS, UK
Correspondence: [1] Correspondence and requests for materials should be addressed to D.L.S. (smitdave@helix.nih.gov).
Article history:
Received: 5/26/2005
Accepted: 7/11/2005
DOI: 10.1038/nature04024
http://go.galegroup.com.ezproxy.lib.gla.ac.uk/ps/i.do?&id=GALE%7CA185466513&v=2.1&u=glasuni&it=r&p=EAIM&sw=w
Gale Document Number: GALE|A185466513