The genetics of infectious disease susceptibility: has the evidence for epistasis been overestimated?
© Hall and Ebert; licensee BioMed Central Ltd. 2013
Received: 11 September 2012
Accepted: 8 July 2013
Published: 15 July 2013
Interactions amongst genes, known as epistasis, are assumed to make a substantial contribution to the genetic variation in infectious disease susceptibility, but this claim is controversial. Here, we focus on the debate surrounding the evolutionary importance of interactions between resistance loci and argue that its role in explaining overall variance in disease outcomes may have been overestimated.
Differences amongst individuals in their susceptibility to infection seldom have a simple genetic basis and are often determined by a complex interplay of multiple loci. Characterizing the number, location and effect size of the quantitative trait loci (QTL) underlying this variation informs our understanding of not only the pathways that influence susceptibility, but also the potential coevolutionary dynamics of host and parasites. Of particular interest is the role that epistasis, defined broadly as interactions among loci in determining a phenotype, has in shaping the variation we see in infectious disease susceptibility. For most complex traits, quantitative genetic theory suggests that epistasis is unlikely to contribute substantially to genetic variation [1, 2]. However, models of host-parasite co-evolution typically feature some degree of epistasis between resistance loci [3, 4], and the results of empirical linkage and association mapping studies suggest that epistatic interactions can explain considerable variation in infectious disease characteristics within natural populations [5, 6].
In this article, we will discuss the current state of genetic studies of disease susceptibility, with a particular focus on the theoretical and empirical support for epistasis. We then ask if the genetic basis of infectious disease susceptibility is different from other traits, or if the evidence for epistatic interactions has been overestimated. To provide the necessary background, we first give a brief overview of the debate surrounding the contribution of epistasis to the genetic architecture of complex traits.
The genetic architecture of complex traits
At the heart of this debate is the assumption that the importance of epistasis at the mechanistic level is reflected in the patterns of phenotypic and genetic variation at the level of the population. Within an individual, interactions between genes can result from a wide range of molecular mechanisms and can have positive or negative effects on fitness, depending on whether the resulting phenotype is greater or less than the individual effects of the alleles [12, 13]. This functional impact of gene-gene interactions, however, is different from the statistical contribution of epistasis to complex trait variation within a population , because the latter depends on the distribution of allele frequencies in that population . If most alleles are at extreme frequencies, then the majority of genetic variation should still be additive, even if there is dominance or epistasis acting at individual loci . In this case, the effect of rare alleles that interact will be negligible as the likelihood that two rare alleles are present in the same individual is very low. Thus, from a quantitative genetic perspective, it seems unlikely that epistatic interactions will contribute substantially to phenotypic variance unless the alleles are of major effect, and the frequencies of alleles involved in epistatic interactions are intermediate.
Are complex traits resulting from host-parasite interactions different?
In support of such genetic models, studies of infectious disease susceptibility have commonly documented three key indicators of epistasis. Both linkage and association mapping studies have shown that variation in measures of resistance are often associated with multiple QTL of major effect, that interactions between loci contribute substantially to phenotypic variation, and that evidence for specific candidate loci is often difficult to replicate in other experiments or environments [6, 21]. In a meta-analysis of over 500 QTL mapping experiments, for example, Wilfert and Schmid-Hempel  found that epistatic interactions were identified in 48 of 62 studies involving genome-wide scans, with most epistatic loci not previously identified using single QTL analyses (123 of 170 loci). Conventional quantitative genetic studies have also characterized the contribution of non-additive genetic components to patterns of susceptibility and resistance. Using reciprocal crosses between four populations, for example, a study of resistance in the red flour beetle (Triboliumcastanaeum) found that epistasis explained significant variation in host survival only upon infection by a parasite, and not under the unexposed and uninfected control conditions .
Has the evidence for epistasis been overestimated?
At first glance, the high prevalence of epistasis in mapping studies of disease traits suggests that interactions amongst host resistance genes might indeed contribute substantially to variation in disease susceptibility, as is assumed by models of host-parasite coevolution. What needs to be taken into account, however, is the estimation bias inherent in conventional mapping studies. Minor effect variants, for example, are unlikely to be identified in linkage studies due to a combination of broadly spaced markers and limited sample sizes. Conversely in association mapping, rare alleles will be difficult to detect due to the reliance on linkage disequilibrium between common markers and common causative variants (see discussions in ). Gene frequencies are also altered as part of the design of traditional mapping panels, which typically involve some level of inbreeding, combined with crosses between a few individuals representing phenotypic or even population extremes. In an F2 inter-cross between high and low resistance genotypes, for example, allele frequencies are on average 0.5, even if variants within the mapping panel were at extreme frequencies in the original population. Thus, by concentrating or combining alleles of major effect within and between populations, conventional mapping studies bias allele frequencies towards intermediate values and therefore increase the chance of finding epistasis.
Without information on the effect size and frequency of alleles in natural populations, it is difficult to determine whether epistatic interactions contribute substantially to genetic variation in quantitative susceptibility, or if the contribution of such interactions to individual variation has been overestimated. The lack of success in identifying the same loci across different experiments, for example, could be due to epistasis between resistance loci, or the result of the strong sampling bias and small fraction of genetic variation that is captured using experimental crosses. Nonetheless, identified epistatic interactions can be functionally important. In a number of studies, gene-gene interactions have helped characterize the pathways underlying the mechanisms of resistance (for example, [24–27]). In the mouse, for example, epistatic interactions revealed a new mechanism for resistance to the mouse cytomegalovirus, which involves an interaction between a receptor for natural killer cells and a molecule of the major histocompatibility complex on virus-infected cells . Such studies highlight the functional utility of characterizing epistasis, even if the statistical contribution of each gene-gene interaction to variation in a complex trait remains unclear.
Reconciling quantitative genetic and host-parasite theory
While epistasis is an integral component of many models of disease resistance and antagonistic coevolution (but not all [4, 29]; Figure 2), the contribution of epistatic variance to susceptibility remains difficult to evaluate using conventional QTL mapping methods. With the advent of next generation sequencing approaches, however, new insights can be generated into the genetic architecture of susceptibility [23, 30]. GWAS, for example, allow for the total genetic variation within a population to be decomposed into the combined effect of all loci acting additively (for example, ). The remaining, unexplained genetic variation, therefore, gives an upper limit for how much epistasis could potentially contribute to variation in infectious disease . Observed allele frequencies and effect-size parameters can also be estimated for a range of susceptibility loci (sensu ), and then compared to the expected intermediate allele frequencies predicted by different models of host-parasite coevolution. Yet, higher marker densities and GWAS do not completely resolve the contribution of specific gene-gene interactions to trait variation. Pairwise epistatic interactions are difficult to evaluate using the hundreds or thousands of makers required for conventional QTL mapping studies, let alone using the millions of markers required for GWAS.
Even in model systems where next generation sequencing approaches have been used extensively, we are far from a general understanding of the underlying architecture of resistance and susceptibility. In Drosophila, for example, considerable progress has been made in identifying loci underlying resistance to sigma virus transmission and verifying the importance of the resistance alleles in natural populations [32–35]. Yet resistance genes, such as ref(2)P, are often strain specific and do not completely account for the genetic variance underlying resistance to multiple virus isolates . Similarly, despite the wide range of infectious diseases that have been studied in humans using high-density genetic maps [37, 38], debate is still ongoing as to the distribution of allelic variants, and whether the genetic basis of susceptibility is based on high frequency common variants or the cumulative effects of many rare mutations . As such, we suggest that two key aspects of host-parasite biology will need more consideration as we move forward in the genomics era: first, that our understanding of the genetics of susceptibility will depend on the number of parasite genotypes included in association studies; and second, that the expectation of epistasis may not be appropriate for all measures of resistance.
How we account for the natural genetic diversity of parasites will strongly influence our understanding of the genetic architecture of susceptibility. If the causal parasite is unknown or resistance is assessed using a mix of parasite genotypes, then mechanisms unrelated to resistance could be contributing to variation in infectious disease. Competition between multiple parasite genotypes within the host [39, 40] and variation in dose-dependent effects across isolates  are all processes that would bias infection estimates. Conversely, if only a single pathogen genotype is used in a mapping study, then the relevance of any candidate loci is difficult to extend beyond the response of the host to that specific genotype. Indeed, where multiple strains of a parasite have been utilized within a mapping or association study, the results suggest that only a subset of identified QTL will confer resistance to all genotypes . A study exploring the association between mosquito immune genes and infection by Plasmodium falciparum, for example, revealed that certain candidate loci explained patterns of resistance only for specific parasite isolates . These findings highlight the need to account for the contribution of parasite genetic variation to variation in host susceptibility, otherwise the genetic architecture of disease susceptibility will be misrepresented.
Careful consideration of the trait used to characterize resistance will also be important for mapping studies. Phenotypes of resistance range from infection rates and parasite loads, through to symptoms of disease such as morbidity and mortality. Underlying each of these measures will be a range of processes involving the ability of a pathogen to penetrate the host, the recognition of parasite proteins by the host, and the subsequent immune response facilitating pathogen replication [43, 44]. Thus, the type of trait used to estimate resistance and the timing of a phenotypic assay (early or late in the infection process) could significantly influence the characterization of phenotypic and genetic variance. Estimating resistance based on symptoms of disease, for example, may more closely match classical quantitative genetic theory, whereas the initial ability of a parasite to penetrate a cell or tissue is a better fit for models of host resistance where epistasis features strongly. Indeed, initial infectivity in plants is often highly specific to certain host-parasite combinations, suggesting that susceptibility/resistance may be under control of a few major genes . Although such insights are uncommon in animals, studies are beginning to reveal that initial resistance to certain pathogens may follow a similar pattern , with subsequent symptoms of disease being more quantitative .
In summary, the contribution of epistasis to phenotypic and genetic variation is a complex issue for studies of host-parasite interactions. Unlike other quantitative traits, where theory points to the largely additive contribution to genetic variation , epistasis is a key component of many models of host-parasite interactions. As such, host-parasite research has focused on characterizing epistasis between resistance loci, rather than debating and evaluating the relative contributions of additive and epistatic genetic effects to phenotypic variance. Nonetheless, as more studies characterize the allelic variants underlying quantitative susceptibility in natural populations, the opportunity to reassess the importance of epistasis will help redefine how empirical and theoretical research approaches the genetic architecture of host-parasite interactions.
This review was supported by an EU Marie Curie Incoming International Fellowship (PIIF-GA-2009-252417) to MDH and by the Swiss National Science Foundation. We thank three anonymous reviewers, Luc F Bussière, Matthew C Tinsley and members of the Ebert group for comments on the manuscript.
- Hill WG, Goddard ME, Visscher PM: Data and theory point to mainly additive genetic variance for complex traits. PLoS Genet. 2008, 4: e1000008-10.1371/journal.pgen.1000008.PubMed CentralView ArticlePubMed
- Crow JF: On epistasis: why it is unimportant in polygenic directional selection. Philos Trans R Soc B. 2010, 365: 1241-1244. 10.1098/rstb.2009.0275.View Article
- Peters AD, Lively CM: Epistasis and the maintenance of sex. Epistasis and the Evolutionary Process. Edited by: Wolf JB, Brodie EDIII, Wade MJ. 2000, Oxford, UK: Oxford University Press, 99-112.
- Otto SP, Nuismer SL: Species interactions and the evolution of sex. Science. 2004, 304: 1018-1020. 10.1126/science.1094072.View ArticlePubMed
- Carlborg O, Haley CS: Epistasis: too often neglected in complex trait studies?. Nat Rev Genet. 2004, 5: 618-625. 10.1038/nrg1407.View ArticlePubMed
- Wilfert L, Schmid-Hempel P: The genetic architecture of susceptibility to parasites. BMCE vol Biol. 2008, 8: 187-View Article
- Falconer DS, Mackay TFC: Introduction to Quantitative Genetics. 1996, Harlow, UK: Pearson Education, 4
- Visscher PM: Sizing up human height variation. Nat Genet. 2008, 40: 489-490. 10.1038/ng0508-489.View ArticlePubMed
- Yang J, Benyamin B, McEvoy BP, Gordon S, Henders AK, Nyholt DR, Madden PA, Heath AC, Martin NG, Montgomery GW, Goddard ME, Visscher PM: Common SNPs explain a large proportion of the heritability for human height. Nat Genet. 2010, 42: 565-569. 10.1038/ng.608.PubMed CentralView ArticlePubMed
- Maher B: Personal genomes: the case of the missing heritability. Nature. 2008, 456: 18-21.View ArticlePubMed
- Zuk O, Hechter E, Sunyaev SR, Lander ES: The mystery of missing heritability: genetic interactions create phantom heritability. Proc Natl Acad Sci USA. 2012, 109: 1193-1198. 10.1073/pnas.1119675109.PubMed CentralView ArticlePubMed
- Lehner B: Molecular mechanisms of epistasis within and between genes. Trends Genet. 2011, 27: 323-331. 10.1016/j.tig.2011.05.007.View ArticlePubMed
- Phillips PC: Epistasis - the essential role of gene interactions in the structure and evolution of genetic systems. Nat Rev Genet. 2008, 9: 855-867. 10.1038/nrg2452.PubMed CentralView ArticlePubMed
- Hamilton WD: Sex versus non-sex versus parasite. Oikos. 1980, 35: 282-290. 10.2307/3544435.View Article
- Frank SA: Recognition and polymorphism in host-parasite genetics. Philos Trans R Soc B. 1994, 346: 283-293. 10.1098/rstb.1994.0145.View Article
- Flor HH: The complementary genic systems in Flax and Flax Rust. Adv Genet. 1956, 8: 29-54.View Article
- Sasaki A: Host-parasite coevolution in a multilocus gene-for-gene system. Proc R Soc B. 2000, 267: 2183-2188. 10.1098/rspb.2000.1267.PubMed CentralView ArticlePubMed
- Fenton A, Brockhurst MA: Epistatic interactions alter dynamics of multilocus gene-for-gene coevolution. PLoS One. 2007, 2: e1156-10.1371/journal.pone.0001156.PubMed CentralView ArticlePubMed
- Peters AD, Lively CM: The Red Queen and fluctuating epistasis: a population genetic analysis of antagonistic coevolution. Am Nat. 1999, 154: 393-405. 10.1086/303247.View ArticlePubMed
- Otto SP, Michalakis Y: The evolution of recombination in changing environments. Trends Ecol Evol. 1998, 13: 145-151. 10.1016/S0169-5347(97)01260-3.View ArticlePubMed
- Kover PX, Caicedo AL: The genetic architecture of disease resistance in plants and the maintenance of recombination by parasites. Mol Ecol. 2001, 10: 1-16. 10.1046/j.1365-294X.2001.01124.x.View ArticlePubMed
- Wegner KM, Berenos C, Schmid-Hempel P: Nonadditive genetic components in resistance of the red flour beetle Tribolium castanaeum against parasite infection. Evolution. 2008, 62: 2381-2392. 10.1111/j.1558-5646.2008.00444.x.View ArticlePubMed
- Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, McCarthy MI, Ramos EM, Cardon LR, Chakravarti A, Cho JH, Guttmacher AE, Kong A, Kruglyak L, Mardis E, Rotimi CN, Slatkin M, Valle D, Whittemore AS, Boehnke M, Clark AG, Eichler EE, Gibson G, Haines JL, Mackay TFC, McCarroll SA, Visscher PM: Finding the missing heritability of complex diseases. Nature. 2009, 461: 747-753. 10.1038/nature08494.PubMed CentralView ArticlePubMed
- Lazzaro BP, Sceurman BK, Clark AG: Genetic basis of natural variation in D. melanogaster antibacterial immunity. Science. 2004, 303: 1873-1876. 10.1126/science.1092447.View ArticlePubMed
- Williams TN, Mwangi TW, Wambua S, Peto TEA, Weatherall DJ, Gupta S, Recker M, Penman BS, Uyoga S, Macharia A, Mwacharo JK, Snow RW, Marsh K: Negative epistasis between the malaria-protective effects of alpha+−thalassemia and the sickle cell trait. Nat Genet. 2005, 37: 1253-1257. 10.1038/ng1660.PubMed CentralView ArticlePubMed
- Martin MP, Qi Y, Gao X, Yamada E, Martin JN, Pereyra F, Colombo S, Brown EE, Shupert WL, Phair J, Goedert JJ, Buchbinder S, Kirk GD, Telenti A, Connors M, O’Brien SJ, Walker BD, Parham P, Deeks SG, McVicar DW, Carrington M: Innate partnership of HLA-B and KIR3DL1 subtypes against HIV-1. Nat Genet. 2007, 39: 733-740. 10.1038/ng2035.PubMed CentralView ArticlePubMed
- Bomblies K, Lempe J, Epple P, Warthmann N, Lanz C, Dangl JL, Weigel D: Autoimmune response as a mechanism for a Dobzhansky-Muller-type incompatibility syndrome in plants. PLoS Biol. 2007, 5: e236-10.1371/journal.pbio.0050236.PubMed CentralView ArticlePubMed
- Desrosiers M-P, Kielczewska A, Loredo-Osti J-C, Adam SG, Makrigiannis AP, Lemieux S, Pham T, Lodoen MB, Morgan K, Lanier LL, Vidal SM: Epistasis between mouse Klra and major histocompatibility complex class I loci is associated with a new mechanism of natural killer cell-mediated innate resistance to cytomegalovirus infection. Nat Genet. 2005, 37: 593-599. 10.1038/ng1564.PubMed CentralView ArticlePubMed
- Kouyos RD, Salathé M, Otto SP, Bonhoeffer S: The role of epistasis on the evolution of recombination in host-parasite coevolution. Theor Popul Biol. 2009, 75: 1-13. 10.1016/j.tpb.2008.09.007.View ArticlePubMed
- Hill AVS: Evolution, revolution and heresy in the genetics of infectious disease susceptibility. Philos Trans R Soc B. 2012, 367: 840-849. 10.1098/rstb.2011.0275.View Article
- Park J-H, Gail MH, Weinberg CR, Carroll RJ, Chung CC, Wang Z, Chanock SJ, Fraumeni JF, Chatterjee N: Distribution of allele frequencies and effect sizes and their interrelationships for common genetic susceptibility variants. Proc Natl Acad Sci USA. 2011, 108: 18026-18031. 10.1073/pnas.1114759108.PubMed CentralView ArticlePubMed
- Magwire MM, Bayer F, Webster CL, Cao C, Jiggins FM: Successive increases in the resistance of Drosophila to viral infection through a transposon insertion followed by a duplication. PLoS Genet. 2011, 7: e1002337-10.1371/journal.pgen.1002337.PubMed CentralView ArticlePubMed
- Bangham J, Kim K-W, Webster CL, Jiggins FM: Genetic variation affecting host-parasite interactions: different genes affect different aspects of sigma virus replication and transmission in Drosophila melanogaster. Genetics. 2008, 178: 2191-2199. 10.1534/genetics.107.085449.PubMed CentralView ArticlePubMed
- Bangham J, Obbard DJ, Kim K-W, Haddrill PR, Jiggins FM: The age and evolution of an antiviral resistance mutation in Drosophila melanogaster. Proc R Soc B. 2007, 274: 2027-2034. 10.1098/rspb.2007.0611.PubMed CentralView ArticlePubMed
- Wilfert L, Jiggins FM: Disease association mapping in Drosophila can be replicated in the wild. Biol Lett. 2010, 6: 666-668. 10.1098/rsbl.2010.0329.PubMed CentralView ArticlePubMed
- Carpenter JA, Hadfield JD, Bangham J, Jiggins FM: Specific interactions between host and parasite genotypes do not act as a constraint on the evolution of antiviral resistance in Drosophila. Evolution. 2012, 66: 1114-1125. 10.1111/j.1558-5646.2011.01501.x.View ArticlePubMed
- Hill AVS: Aspects of genetic susceptibility to human infectious diseases. Annu Rev Genet. 2006, 40: 469-486. 10.1146/annurev.genet.40.110405.090546.View ArticlePubMed
- Khor CC, Hibberd ML: Host-pathogen interactions revealed by human genome-wide surveys. Trends Genet. 2012, 28: 233-243. 10.1016/j.tig.2012.02.001.View ArticlePubMed
- Wegner KM, Berenos C, Schmid-Hempel P: Host genetic architecture in single and multiple infections. J Evol Biol. 2009, 22: 396-404. 10.1111/j.1420-9101.2008.01657.x.View ArticlePubMed
- Ben-Ami F, Mouton L, Ebert D: The effects of multiple infections on the expression and evolution of virulence in a Daphnia-endoparasite system. Evolution. 2008, 62: 1700-1711. 10.1111/j.1558-5646.2008.00391.x.View ArticlePubMed
- Ben-Ami F, Ebert D, Regoes RR: Pathogen dose infectivity curves as a method to analyze the distribution of host susceptibility: a quantitative assessment of maternal effects after food stress and pathogen exposure. Am Nat. 2010, 175: 106-115. 10.1086/648672.View ArticlePubMed
- Harris C, Lambrechts L, Rousset F, Abate L, Nsango SE, Fontenille D, Morlais I, Cohuet A: Polymorphisms in Anopheles gambiae immune genes associated with natural resistance to Plasmodium falciparum. PLoS Pathog. 2010, 6: e1001112-10.1371/journal.ppat.1001112.PubMed CentralView ArticlePubMed
- Schmid-Hempel P: Parasite immune evasion: a momentous molecular war. Trends Ecol Evol. 2008, 23: 318-326. 10.1016/j.tree.2008.02.011.View ArticlePubMed
- Frank SA, Schmid-Hempel P: Mechanisms of pathogenesis and the evolution of parasite virulence. J Evol Biol. 2008, 21: 396-404. 10.1111/j.1420-9101.2007.01480.x.View ArticlePubMed
- Thompson JN, Burdon JJ: Gene-for-gene coevolution between plants and parasites. Nature. 1992, 360: 121-125. 10.1038/360121a0.View Article
- Duneau D, Luijckx P, Ben-Ami F, Laforsch C, Ebert D: Resolving the infection process reveals striking differences in the contribution of environment, genetics and phylogeny to host-parasite interactions. BMC Biol. 2011, 9: 11-10.1186/1741-7007-9-11.PubMed CentralView ArticlePubMed
- Hall MD, Ebert D: Disentangling the influence of parasite genotype, host genotype and maternal environment on different stages of bacterial infection in Daphnia magna. Proc R Soc B. 2012, 279: 3176-3183. 10.1098/rspb.2012.0509.PubMed CentralView ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.