Analyzing and understanding the relationship between genotypes and phenotypes is at the heart of genetics. Research on the nematode Caenorhabditis elegans has been instrumental for unraveling genotype-phenotype relations, and has important implications for understanding the biology of mammals, but almost all studies, including forward and reverse genetic screens, are limited by investigations in only one canonical genotype. This hampers the detection and functional analysis of allelic variants, which play a key role in controlling many complex traits. It is therefore essential to explore the full potential of the natural genetic variation and evolutionary context of the genotype-phenotype map in wild C. elegans populations.
We used multiple wild C. elegans populations freshly isolated from local sites to investigate gene sequence polymorphisms and a multitude of phenotypes including the transcriptome, fitness, and behavioral traits. The genotype, transcriptome, and a number of fitness traits showed a direct link with the original site of the strains. The separation between the isolation sites was prevalent on all chromosomes, but chromosome V was the largest contributor to this variation. These results were supported by a differential food preference of the wild isolates for naturally co-existing bacterial species. Comparing polymorphic genes between the populations with a set of genes extracted from 19 different studies on gene expression in C. elegans exposed to biotic and abiotic factors, such as bacteria, osmotic pressure, and temperature, revealed a significant enrichment for genes involved in gene-environment interactions and protein degradation.
We found that wild C. elegans populations are characterized by gene-environment signatures, and we have unlocked a wealth of genotype-phenotype relations for the first time. Studying natural isolates provides a treasure trove of evidence compared with that unearthed by the current research in C. elegans, which covers only a diminutive part of the myriad of genotype-phenotype relations that are present in the wild.
Gene-environment interactionsGenotype-phenotype relationsWild C. elegans strainsTranscriptomic diversity
The nematode Caenorhabditis elegans is a widely used model species in contemporary biological research, which covers a number of disciplines including developmental biology, genetics, and evolutionary biology. Many investigations have been of paramount importance for understanding the biology of mammals, but almost all studies in C. elegans, including forward (knocking out genes by mutation) and reverse (knocking down genes using RNA interference (RNAi)) genetic screens, have been conducted in only a few strains of this organism, of which the canonical strain Bristol N2 has been the most thoroughly studied. This severely constrains the detection and functional analysis of allelic variants, which play a key role in controlling many complex traits. It is therefore essential to explore the full potential of the natural genetic variation and evolutionary context of the genotype-phenotype map in wild C. elegans populations. Moreover, the widely used strains, such as N2 and CB4856, have often been kept under controlled laboratory conditions for decades, and the handling, storage, and maintenance of worms is standardized. Such artificial regimens very likely create multiple bottlenecks that can affect the genotype-phenotype relationship. For instance, a genetic analysis of wild C. elegans strains showed that the N2 alleles of npr-1 and glb-5 most likely originated as an adaptation to laboratory conditions . Genotype-phenotype relations have been studied in model organisms of many species, such as Arabidopsis[2, 3], Drosophila, and C. elegans[5, 6]. For the full appreciation and functional characterization of genes and the genotype-phenotype relations, it is essential to consider the natural context of the species, including analysis of natural isolates, and the interaction of the species with natural challenges. Previous studies on C. elegans have investigated the organism’s response to a wide range of different environmental factors, including exposure to different bacteria [7–9], pH , osmotic pressure [9, 10], chemicals [11, 12], and temperature [9, 12–16], among others. As yet, however, these responses have not been tested in natural populations.
C. elegans is an androdioecious species with a low outcrossing rate, leading to homozygous strains in natural isolates . These strains can therefore be treated as haplotypes. In this study, we investigated variation in genotype-phenotype relations for a total of 48 strains, of which 41 were freshly isolated from two different sites in France: 20 strains from a woodland area in Santeuil (S) from rotting hogweed stems, and 21 from an orchard in Orsay (O) from rotting apples. As an out-group, we used three strains freshly isolated from sites in the Netherlands and two strains previously isolated from France. Lastly, the genotypically most diverse laboratory-kept strain CB4856 and the canonical strain Bristol N2 were added (see Additional file 1, worksheet A) [9, 18–22]; these two strains have been used in many studies to uncover genotype-phenotype relations both by comparing strains or by using some type of quantitative trait loci approach [1, 18, 23–30].
In this study, we provide insight into the genotype-phenotype relations in natural C. elegans populations through analysis of its genomic and transcriptomic variation. We found that local genetic diversity reflects site-specific signatures of environmental sensing, protein regulation, and the immune defense system. Our results indicate that exploring natural isolates in C. elegans should lead to identification of key components of genotype-phenotype relations compared with studies that are limited to the canonical strain Bristol N2.
Results and discussion
Local C. elegans populations are genotypically separable
Previous investigations have studied population genetics and genomic diversity in C. elegans, focusing on global [31–35] or local [17, 36] populations. Cutter showed that there is a lack of geographic distribution of C. elegans genome sequences , and Andersen et al. reported that chromosome-scale selective sweeps have acted to reduce genetic variation, and have shaped the global C. elegans population structure in recent history . Barrière and Félix concluded that local diversity in this organism is high . In all of these papers, diversity in C. elegans was measured as genetic diversity. To date, very few papers have been published concerning phenotypic variation in wild isolates (by this we mean isolates that have not been maintained in the laboratory for a long time), and studies used only a small number of isolates [38–40]. To our knowledge, no studies have been reported on genotype-phenotype relations in wild populations.
As mentioned above, we used wild strains from Santeuil and Orsay and a number of out-group strains. All these were genotyped based on the hybridization of genomic C. elegans DNA to microarrays (see Methods section for details), resulting in the identification of 6,368 polymorphic genes with an absolute ratio of 0.5 for the mean hybridization intensity (Figure 1A). Most of these (around 66%) were found in only one to three strains (Figure 1B), showing that between-strain variation is more abundant than between-site variation. Major hotspots of polymorphic genes were found on chromosomes II and V and minor on the other chromosomes (Figure 1C; see Additional file 1 worksheet B; see Additional files 2, 3, 4). The hotspots of polymorphic genes co-localize with the c-type lectin, nuclear hormone receptor (nhr), and math genes, and chemoreceptor gene clusters . In addition, microsatellite loci were used to determine the population structure  of the Orsay and Santeuil strains (see Additional file 1, worksheet C; see Additional file 5, panel A).
The two isolation sites were genotypically separable. This was shown by analyses of the presence of gene polymorphisms using principal component analysis (PCA) (Figure 2A), a distance matrix visualized by an unrooted neighbor-joining (NJ) tree (Figure 2B), and a minimum spanning network of the microsatellite data (see Additional file 5, panel A). The minimum spanning network, PCA, and NJ analyses showed a clear distinction between the Santeuil and the Orsay strains, with one large genetic group and several smaller genetic groups being identified for both isolation sites. In PCA, the first two principal components capture around 75% of the variation in DNA hybridizations. As shown in Figure 2A, N2 is in the far right top corner, indicating its genetic difference from all other strains. Moreover, the NJ tree showed that the Santeuil strains (groups S1, to S3) and Orsay strains are different from both N2 and CB4856. In both the PCA and NJ analyses, the Orsay group (group O) was seen to be genetically less diverse than the Santeuil group. Within the main Santeuil group (group S1: all Santeuil strains except JU1924, JU1925, JU1926, JU1934, JU1935, and JU1936), diversity was slightly larger. Furthermore, in the NJ tree two small genotypic groups were separate from the main Orsay and Santeuil groups (group S2: JU1924, JU1925, and JU1926; group S3: JU1511, JU1934, JU1935, and JU1936). The strains within these separate groups were all from the Santeuil site, except for JU1511, which is from the Orsay site. The strains from Santeuil in group S2 were isolated from a single hogweed stem. Similarly, those from S3 were also isolated from their own single hogweed stem (see Additional file 1, worksheet A). Other strains were found on different hogweed stems. We found that strains isolated from an individual hogweed stem grouped close together, but were not found to form their own separate genotypic groups.
The Dutch strains were isolated from two isolation sites, with WN2001 isolated from one site and WN2002 and WN2003 from the other. The latter two strains grouped together in the NJ tree. One of the French out-group strains, JU396, grouped with the Santeuil strains, whereas the other, JU314, was different from the rest of the strains. N2 and CB4856 were as diverse from the other out-group members as the whole out-group was from the Orsay or Santeuil strains. By comparing the four genetic groups (O, S1, S2, and S3) with the out-group, the genes that were polymorphic were identified (see Additional file 5, panel B). Group S1 appeared to be the most divergent from the out-group, with 3,181 genes that differed significantly (false discovery rate (FDR) = 0.05).
The genetic separation between the Orsay and Santeuil populations was prevalent on all chromosomes (Figure 3). From the scale of the axes, it can be seen that most chromosomes contributed to the separation between the two isolation sites and the out-group, except for chromosome II; on this chromosome the Santeuil and Orsay lines formed one group that was separate only from the out-group. Chromosome V was the largest contributor to the variation between Orsay and Santeuil, most likely because of the generally higher level of variation among these strains (see Additional file 4). Of the approximately 2,500 genes that were different between S1, S2, and S3, around 1,050 are located on chromosome V. Of all the genes on chromosome V, around 210 are polymorphic between S1, S2, and S3. This is a significant enrichment (P<1 × 10-76) when compared with the other chromosomes, of which 8 to 10% of the genes are polymorphic. Chromosomes I and X are under-represented for polymorphic genes (both around 8% and P<1 × 10-12).
The detected genotypic diversity between sites is in line with genotyping results from 31 markers using amplified fragment length polymorphism analysis, and with microsatellite results from two loci in other local C. elegans populations . However, selective sequencing using restriction site-associated DNA tags did not reveal significant local diversity, possibly because of the limited number of different genotypes per location .
The four genetic groups (O, S1, S2, and S3) identified by analysis of the ratio intensities were used as input to search for all the genes linked to each isolation site. This allowed us to identify polymorphic genes by minor hybridization differences (absolute ratio (AR) <0.5), beyond those 6,386 found by major hybridization differences (AR >0.5). In this way, we identified 3,742 genes (FDR = 0.05) that were linked to each isolation site (see Additional file 6, panel A). Of these 3,742 genes, 2,403 were already identified as highly polymorphic in the initial analysis on major hybridization differences, and an additional 1,339 genes associated with isolation were found with only minor hybridization differences. Of the genes with major hybridization differences, around 62% could not be linked to an isolation site, again showing that between-strain variation is more abundant than between-site variation. Genes linked to isolation site were fond to be enriched for the gene classes fbox, math, bath, btb, C-type lectin (clec), serpentine chemoreceptor, and nhr.
To investigate whether specific types of genes are over-represented in the group of genes that could be linked to isolation site (n = 3,742) or in the polymorphic genes not linked to isolation site (n = 3,965), enrichment analyses were performed (Table 1; see Additional file 1, worksheets D and E). The linked and unlinked groups were analyzed using three types of annotations: gene class, anatomy terms and gene ontology (GO) terms, to investigate whether certain types of genes were enriched. For the gene classes, we found that fbox, math, bath, btb, clec, serpentine chemoreceptor, and nhr genes were enriched in the group of genes linked to isolation site (Table 1, see Additional file 1, worksheets F–H; see Additional file 6, panel B). Enrichment analyses were also performed for the genes that were significantly linked to the genetic groups O, S1, S2, and S3, revealing the same gene classes as above (see Additional file 1, worksheet I). The enrichment analysis of anatomy terms or GO terms did not identify a clear pattern linked to isolation site or genetic group (see Additional file 1, worksheets J and K).
Enrichment (based on DNA-array data) of gene classesa
Polymorphic by ratio
Math, bath, btb
Nuclear hormone receptor
Pharyngeal gland toxin-related
Scramblase (phospholipid scramblase)
aEnrichment (based on DNA-array data) of gene classes in the group of genes with variation linked to isolation site (3,742 genes) and the group of polymorphic genes that were not linked to isolation site (3,965 genes).
bSignificance in –log10(p).
cGene classes with group sizes ≥ 6, overlap ≥ 3 genes, and significance ≥ 2.5.
Local C. elegans populations are separable on the basis of their transcriptomes
Next, the influence of natural genetic variation on gene expression was studied by measuring the transcript levels of all genes of all strains, corrected for differential hybridization. An NJ tree was constructed based on the RNA hybridization data (Figure 2C). This tree showed that the genetic groups O, S1, S2, S3, and the out-group, were also separable based on gene expression level. Again, CB4856 and N2 differed from most of the other natural strains. Isolation site and genetic group influenced the variation in RNA levels of 6,930 and 7,996 genes, respectively (see Additional file 7). Most of these genes (77% and 78%) were not influenced by DNA polymorphisms (which were the cause of variation in 2,330 genes) or genotype (affecting 773 or 1,336 genes, depending on the genetic group or isolation site that was incorporated in the model).
Expressed genes linked to isolation site are enriched for the gene classes clec, fbxa, bath, and nhr
Enrichment analyses were performed for the genes whose RNA levels were influenced by isolation site or genetic group (see Additional file 1, worksheets L–N). The gene classes clec, fbxa, bath, and nhr were significantly enriched, thus yielding similar results to our DNA-level enrichment analyses. In addition, several nematode-specific peptide families were also enriched.
Together, these results show that at the genomic level, variation between local populations is enriched for the gene classes fbox, math, bath, btb, clec, serpentine chemoreceptor, and nhr, many of which are involved in gene-environment interactions [42–45]. Interestingly, we found that the gene classes clec, fbxa, bath, and nhr were also enriched, with variation linked to isolation site on the transcriptional level, even though the strains that originated from different sites were cultured under the same conditions. It has been shown that many of these groups of genes are differentially expressed after pathogen exposure, and thus could be involved in the immune response. For instance, C-type lectin domain-containing proteins (CTLD proteins, gene class clec) have been repeatedly proposed to contribute to nematode immunity . The immune function of these genes is supported by their specific upregulation in infected C. elegans[43, 44, 47–50] and also by reduced immune phenotypes after RNAi knockdown of clec-70, clec-17, clec-60, or clec-86[7, 51]. Furthermore, F-box proteins (gene class fbxa) are part of the protein degradation pathway . In this pathway, substrates for degradation are ubiquinated to be recognizable by the 26S proteasome. Taken together, these results show that local genetic diversity reflects site-specific signatures of immune response and protein degradation pathways in C. elegans. We also found that, in addition to genotypes, transcript profiles can be used to distinguish between local C. elegans populations, and may indicate the functional importance of the identified genes or gene classes in different environments .
Polymorphic genes are enriched for genes involved in gene-environment interactions
Polymorphic genes between the populations were compared using a set of genes extracted from 19 different studies on gene expression in C. elegans exposed to biotic and abiotic factors (see Additional file 1, worksheet S). In the wild, C. elegans is exposed to many different bacteria. In studies on the effect on gene expression of various bacteria, such as Lactobacillus rhamnosus, Microbacterium. nematophilum, Drechmeria coniospora, Serratia marcescens, Xenorhabdus nematophila, and Pseudomonas aeruginosa, c-type lectins were always found to be differentially expressed, as were in most cases the F-box protein genes. Receptors that are used to sense the environment, such as nhrs and serpentine receptors are also frequently differentially expressed when C. elegans is exposed to different bacteria. In response to abiotic factors such as temperature , osmotic stress  or ions [57, 58], the c-type lectins and F-box protein genes are also always differentially expressed. Furthermore, the c-type lectins, F-box protein genes, and receptor genes are differentially expressed in the presence of various other substances that can be encountered by wild C. elegans strains, including tryptophan , β-naphthoflavone , H2S , fluoranthene , hormones , sediment , humic substances , and pesticides [12, 66, 67]. The other gene classes (bath, math, and btb) that are importaty for the variation seen between the locations at which the wild C. elegans strains were isolated were also found to be differentially expressed in several of the aforementioned environmental studies. Altogether, the differential expression of genes in environmental studies indicates that the genes that are important for the variation between local populations of C. elegans are indeed of significance for interactions with the environment.
Local populations are separable for some fitness traits
The next question was whether the genetic polymorphisms between strains could influence fitness trait variation. C. elegans strains varied significantly in all traits except population size on Escherichia coli OP50 (Table 2). As all tests were performed, under standardized laboratory conditions and the variation between strains could be attributed to the genotype, showing that most phenotypic variation has a genetic basis. A genetic determinant has been found for some of these traits [5, 13]. We found a significant influence of the genetic groups on the population size of C. elegans on Bacillus thuringiensis NRRL B-18BT247 and on the length/width ratio (see Additional file 1, worksheet O). We additionally reconstructed an NJ tree using phenotypic trait variation; however, phenotypic variation did not separate the two isolation sites or any of the four genetic groups. Nevertheless, some phenotypes were specific to an isolation site or to certain genetic groups. Even though the two strains with the largest length/width ratio were from Santeuil, most worms from Santeuil were significantly shorter, had a significantly smaller length/width ratio, and so were stouter than worms from Orsay (Table 2, see Additional file 8). In addition, the generation time of worms from Santeuil was significantly shorter (Table 2) (more details can be found in Additional file 1, worksheet O).
Analysis of phenotypic variation between strains (ANOVA) and between sites
Populations per genotype. Na
3.136 ± 695
3278 ± 670
2988 ± 707
3.385 ± 750
4.83 × 10-4b
3369 ± 702
3402 ± 815
44 ± 39
<2.2 × 10-16b
53 ± 49
34 ± 21
Development time, days
2 to 5
1.79 ± 0.07
4.16 × 10-4b
1.79 ± 0.07
1.79 ± 0.07
Generation time, days
2 to 5
1.98 ± 0.08
3.13 × 10-6b
1.98 ± 0.077
1.97 ± 0.77
2 to 5
4.35 ± 0.70
4.85 ± 0.97
2 to 6
1.089 ± 58
4.83 × 10-5b
1107 ± 33
1070 ± 72
2 to 6
43.96 ± 2.63
1.33 × 10-9b
44.45 ± 1.99
43.46 ± 3.13
2 to 6
1.67 ± 0.24
1.12 × 10-6b
1.73 ± 0.19
1.61 ± 0.28
2 to 6
24.79 ± 0.82
1.19 × 10-6b
24.94 ± 0.79
24.64 ± 0.83
1.40 × 10-5b
Abbreviations: BT247 nematocidal Bacillus thuringiensis NRRL B-18247, DSM non-nematocidal Bacillus thuringiensis DSM-350, NA not applicable.
aDescribes the number of replicate populations of worms (each population more than 100 worms) which were tested per genotype, thus the number of such replicate populations (N) per isolation site is therefore 20 times higher than the replicate populations per genotype (as we have 20 genotypes per isolation site).
cWhen all observations per genotype were used instead of the mean per genotype, the populations from the different isolations sites were significantly different (P<0.0014).
Local populations are separable with regard to food preference
We then investigated if the wild strains differed from each other in their food preference behavior for naturally co-existing bacteria and for E. coli. Under the laboratory conditions we used, the worms preferred E. coli OP50 over all other bacteria, followed by Erwinia rhapontici, Sphingobacterium sp., Rhodococcus erythropolis, and Lactococcus lactis (Figure 4; see Additional file 1, worksheet P). Worms from Santeuil preferred E. rhapontici (isolated from Santeuil) equally to E. coli., whereas worms from Orsay preferred E. coli over E. rhapontici. This suggests that Santeuil worms could have a slight preference for the bacterium species with which they are more likely to be familiar for an overview of the average preference of all strains, see Additional file 9B; for the complete dataset, see Additional file 1, worksheet Q (Wormcount and Choice Index) and see Additional file 1, worksheet R (Significances). The most significant differences found between the Orsay and Santeuil strains were for the bacterial combinations E. coli OP50/E. rhapontici, E. coli OP50/R. erythropolis. and E. rhapontici/Sphingobacterium sp. (Figure 4).
The canonical strains Bristol N2 and CB4856 are genetic outliers
We finally investigated how far the canonical wild types Bristol N2 and CB4856 relate to the recently isolated natural strains. Both genotypic and transcriptomic analyses identified Bristol N2 and CB4856 as clearly differing from the standing genetic variation of the wild isolated strains. This also applied when we compared N2 and CB4856 with some of the other out-group strains. It is assumed that many alleles in CB4856 and N2 are laboratory-derived because both N2 and CB4856 went through multiple phenotypic and genetic bottlenecks over the past decades of laboratory maintenance [1, 68]. Together with our results, this shows that the genotype-phenotypes relations in both N2 and CB4856 are likely to be obscured by a number of laboratory-derived alleles with large effect. This might impede the detection and functional analysis of many other genes and alleles that by themselves have small effects on phenotypes, but together might have a large effect.
We measured a large variety of phenotypes, including the transcriptome, for multiple wild genotypes in C. elegans collected from different locally separated sites, under the same and constant laboratory conditions. The wild genotypes could be classified according to their site, based on genotypic and transcriptome analyses. These differences were also reflected in several fitness traits; however, because of the limited number of populations sampled, we were not able to associate fitness traits to the different sites. Nevertheless our data provide the basis for uncovering site-specific genotypic and phenotypic signatures. Future work should aim to provide insight into genetic drift or adaptation as the major attribute shaping C. elegans local evolution. Most likely, both processes play a role, depending on the gene or genetic element in question. However, for some gene classes, such as the chemoreceptors, it is tempting to think they are polymorphic as a result of adaptations to specific habitats.
In summary, we have unlocked a wealth of genotype-phenotype relations, indicating that the canonical wild type is a genetic outlier and that its genotype-phenotype characteristics represent a diminutive part of the myriad of interactions present in the wild.
Nematode and bacterial strains
The main set of strains of C. elegans comprised 41 new strains that were isolated (by M-A Félix) from two different locations in France (Orsay and Santeuil). The out-group comprised three new strains isolated in the Netherlands, two strains previously isolated in France, and the most diverse canonical strains N2 (Bristol) and CB4856 (Hawaii) [16, 18–21, 23–29, 67] (see Additional file 1, worksheet A for details). All strains were routinely maintained on nematode growth medium (NGM) with E. coli OP50 as a food source . E. coli OP50 was used in all experiments, except for the population growth experiment, in which B. thuringiensis NRRL B-18247 and B. thuringiensis DSM-350 were used next to E. coli. In the food preference experiment, in addition to E. coli OP50, E. rhapontici and R.s erythropolis (both isolated from and unique for Santeuil), and L. lactis and Sphingobacterium sp. (both isolated from and unique for Orsay) were used (all bacteria were isolated and identified by M-A. Félix and B. Samuel).
Genomic DNA analysis: worm culturing, DNA isolation, DNA microarrays, and statistical analysis
Gene expression microarrays (C. elegans (V2) Gene Expression Microarray 4X44K slides; Agilent Technologies, Santa Clara, CA, USA) were used to co-hybridize N2 versus wild-type DNA, allowing for analysis of population differences based on gene polymorphisms. Fresh populations of mixed stages were cultured for 96 hours at 20°C before sampling. All procedures were performed as recommended by the manufacturer (Agilent; Oligonucleotide Array-Based CGH for Genomic DNA Analysis; Enzymatic Labeling for Blood, Cells or Tissues (with a High Throughput Option) protocol, version 6.3). Genomic DNA isolation was performed with a commercial kit (NucleoSpin Tissue Kit; Machery-Nagel, Düren, Germany). For processing the data of the DNA microarrays, the limma package for the R software environment was used . Background correction was performed using the Substract method. LOESS (locally weighted regression scatter plot smoothing) within-array normalization and scale between-array normalization were used to process the raw intensity values . Genotypes were compared by calculating, per spot, the ratio of the intensities of each strain with the mean intensity over all strains. Genes with a ratio of greater than 0.5 or less than −0.5 were considered polymorphic. PCA was performed using the polymorphic genes from all strains. The unrooted NJ tree was compiled from a distance matrix made from the ratios of the polymorphic genes with the R package phangorn . Linear models were used to calculate the significance of the variation in DNA hybridization intensities linked to the isolation sites and the identified genetic groups. The model used to determine linkage to isolation site was:
where I is the DNA hybridization intensity of genotype x, L is the isolation site (out-group site, Orsay, or Santeuil) and E is the error. For linkage to Santeuil we took –log10(p) of 2.3 as threshold, while for linkage to Orsay we took –log10(p) of 2.7 as threshold. For linkage to genetic group the model used to determine linkage to isolation site was:
where I is the DNA hybridization intensity of genotype x, G is the genetic group (out-group, S1, S2, or S3) and E is the error. The thresholds used were –log10(p) 2.5, 2.3, 3.2, and 3.3 for O, S1, S2, and S3 respectively (for the number of genes per genetic group and overlapping genes (see Additional file 5B). The significance thresholds, adjusted for multiple testing, were determined by permutation, for which the same model was used, with the spot intensities randomly distributed over the genotypes (a P value that gave a ratio of false positives/true positives of <0.05 was used).
mRNA analysis: culturing, isolation, RNA-microarrays, and statistical analysis
For the mRNA microarrays, any males were discarded and only hermaphrodites grown on E. coli OP50 were used. Two independent replicates of each strain (synchronized late L4 larvae) were analyzed. For mRNA isolation, a commercial kit (RNEasy Micro Kit; Qiagen, Hilden, Germany) was used, following the manufacturer’s protocol (Purification of Total RNA from Animal and Human Tissues) provided with the kit, with modified lysing procedure (see Additional file 10). The microarrays used were as described above (C. elegans (V2) Gene Expression Microarray 4X44K slides; Agilent) following the manufacturer’s instructions. For processing of the RNA microarray data, the limma package for the R software environment was used. No background correction was performed, as recommended previously . For within-array normalization, the LOESS method was used and for between-array normalization, the quantile method was used. Expression variation was determined by linear models. The variation in intensities could be explained by batch, DNA hybridization, genetic group, and genotype (see also the paragraph on statistics in the Genomic DNA Analysis section). Significance thresholds, adjusted for multiple testing, were determined by permutations of all spots on the array. In the permutations, the RNA hybridization intensities were randomly distributed over the genotypes and batches (the P-value that gave a ratio of false positives/true positives of <0.05 was used).
All enrichment analyses were performed using a hypergeometric test. The number of genes selected by a criterion in this paper (for example, linked to a genetic group) were compared with the genes with a specific annotation (for example, c-type lectin). The chance that a number of genes will be overlapping depends on the total group size, the number of genes selected, and the number of genes with a specific annotation. This chance, together with the number of overlapping genes, can be used in a hypergeometric test. Annotation groups were considered enriched when the overlap was more than three genes and the significance –log10(p) was greater than 2.5.
Polymorphic genes between populations were compared with a set of differentially expressed genes extracted from a diverse set of gene-environment interaction studies in C. elegans. All enrichment analyses were performed using a hypergeometric test.
Development time and generation time
L1 juveniles fed with E. coli OP50 were incubated at 24°C and inspected at regular time intervals. Development time was defined as the period between worm inoculation and the moment at which the first worms with open vulva were seen. Generation time was the period between inoculation and the first appearance of eggs.
Length and width
Analysis of length and width of young gravid worms was performed with a particle analyzer (RapidVue; Beckman Coulter Inc., Miami, FL, USA). In total, 2000 worms per strain were measured.
To measure population growth, 10 single L4 worms were placed onto a bacterial lawn, and cultured at 20°C. After 96 hours, the number of worms on the plate was counted.
Food preference assay
To test the food preference of the worms, 5 μl drops of two different bacteria were placed on NGM in each well of a 12-well plate (see Additional file 1, worksheet P). A drop with juvenile nematodes up to he stage of L2 was then added to each well, and the plate was incubated overnight at 20°C. The worms on each bacterium were then counted and the Choice Index was calculated .
We used ANOVA to calculate the influence of strain/genotype on the phenotypic variation, by regressing the individual measurements over the strains/genotypes. We used a two-sided t-test, assuming unequal variance to determine if phenotypes were significantly different between isolation sites. ANOVA was used to determine if phenotypes were significantly different between genetic groups.
Population genetic differentiation was assessed using six microsatellite loci (see Additional file 1, worksheet C), which we previously identified to be highly variable in both natural and experimental C. elegans populations ( and see Additional file 10 for details).
Microarray data (both RNA and DNA) can be found at .
We thank Marie-Anne Félix (Paris, France) for providing the nematode strains from France; Buck Samuel (Boston, USA) for providing the wild bacteria; Joost Riksen (Wageningen, the Netherlands) and Janina Brakel (Kiel, Germany) for technical assistance; and K. Joeri van der Velde and Morris A. Swertz (Groningen Bioinformatics Centre, Groningen, the Netherlands) for assistance with WormQTL. We thank the compilers of WormBase for making it a versatile and important resource for C. elegans research. RJMV was funded by the NWO-ALW (project 855.01.151). RC and BPB were funded by ESF-EEFG (09-EuroEEFG-FP-002/G.0998.10N); JEK and LBS were funded by the ERASysbio-plus ZonMW project GRAPPLE (project 90201066); MGS was supported by Graduate School Production Ecology and Resource Conservation; and WC, WY, and HS were funded by NEMADAPT (DFG grant SCHU 1415/11-1). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Laboratory of Nematology, Wageningen University
Biology Department, Ghent University
Department of Evolutionary Ecology and Genetics, Zoological Institute, Christian Albrechts-Universitaet zu Kiel
McGrath PT, Rockman MV, Zimmer M, Jang H, Macosko EZ, Kruglyak L, Bargmann CI: Quantitative mapping of a digenic behavioral trait implicates globin variation inC. eleganssensory behaviors.Neuron 2009, 61:692–699.PubMedView Article
Snoek LB, Terpstra IR, Dekter R, Van den Ackerveken G, Peeters AJ: Genetical Genomics reveals large scale genotype-by-environment interactions inArabidopsis thaliana.Front Gen 2012, 3:317.
Keurentjes JJ, Fu J, Terpstra IR, Garcia JM, van den Ackerveken G, Snoek LB, Peeters AJ, Vreugdenhil D, Koornneef M, Jansen RC: Regulatory network construction inArabidopsisby using genome-wide gene expression quantitative trait loci.Proc Natl Acad Sci U S A 2007, 104:1708–1713.PubMedView Article
Mackay TFC, Richards S, Stone EA, Barbadilla A, Ayroles JF, Zhu DH, Casillas S, Han Y, Magwire MM, Cridland JM, et al.: TheDrosophila melanogastergenetic reference panel.Nature 2012, 482:173–178.PubMedView Article
Gutteling EW, Doroszuk A, Riksen JA, Prokop Z, Reszka J, Kammenga JE: Environmental influence on the genetic correlations between life-history traits inCaenorhabditis elegans.Heredity (Edinb) 2007, 98:206–213.View Article
Palopoli MF, Rockman MV, TinMaung A, Ramsay C, Curwen S, Aduna A, Laurita J, Kruglyak L: Molecular basis of the copulatory plug polymorphism inCaenorhabditis elegans.Nature 2008, 454:1019-U1066.PubMedView Article
O’Rourke D, Baban D, Demidova M, Mott R, Hodgkin J: Genomic clusters, putative pathogen recognition molecules, and antimicrobial genes are induced by infection ofC.eleganswithM.nematophilum.Genome Res 2006, 16:1005–1016.PubMedView Article
Sinha A, Rae R, Iatsenko I, Sommer RJ: System wide analysis of the evolution of innate immunity in the nematode model speciesCaenorhabditis elegansandPristionchus pacificus.PloS one 2012, 7:e44255.PubMedView Article
Grishkevich V, Ben-Elazar S, Hashimshony T, Hunter CP, Yanai I: A genomic bias for genotype-environment interactions inC. elegans.Mol Syst Biol 2012, 8:587.PubMedView Article
Rohlfing AK, Miteva Y, Hannenhalli S, Lamitina T: Genetic and physiological activation of osmosensitive gene expression mimics transcriptional signatures of pathogen infection inC. elegans.PloS one 2010, 5:e9010.PubMedView Article
Jonker MJ, Sweijen RAJC, Kammenga JE: Toxicity of simple mixtures to the nematodeCaenorhabditis elegansin relation to soil sorption.Environ Toxicol Chem 2004, 23:480–488.PubMedView Article
Viñuela A, Snoek LB, Riksen JA, Kammenga JE: Gene expression modifications by temperature-toxicants interactions inCaenorhabditis elegans.PloS one 2011, 6:e24676.PubMedView Article
Li Y, Alvarez OA, Gutteling EW, Tijsterman M, Fu J, Riksen JA, Hazendonk E, Prins P, Plasterk RH, Jansen RC, et al.: Mapping determinants of gene expression plasticity by genetical genomics inC. elegans.PLoS genetics 2006, 2:e222.PubMedView Article
Harvey SC, Viney ME: Thermal variation reveals natural variation between isolates ofCaenorhabditis elegans.J Exp Zool B Mol Dev Evol 2007, 308:409–416.PubMedView Article
Gaertner BE, Parmenter MD, Rockman MV, Kruglyak L, Phillips PC: More than the sum of its parts: a complex epistatic network underlies natural variation in thermal preference behavior inCaenorhabditis elegans.Genetics 2012, 192:1533–1542.PubMedView Article
Barriere A, Félix MA: High local genetic diversity and low outcrossing rate inCaenorhabditis elegansnatural populations.Curr Biol 2005, 15:1176–1184.PubMedView Article
Viñuela A, Snoek LB, Riksen JA, Kammenga JE: Aging uncouples heritability and expression-QTL inCaenorhabditis elegans.G3 2012, 2:597–605.PubMedView Article
Capra EJ, Skrovanek SM, Kruglyak L: Comparative developmental expression profiling of twoC. elegansisolates.PloS one 2008, 3:e4055.PubMedView Article
Seidel HS, Ailion M, Li J, van Oudenaarden A, Rockman MV, Kruglyak L: A novel sperm-delivered toxin causes late-stage embryo lethality and transmission ratio distortion inC. elegans.PLoS biology 2011, 9:e1001115.PubMedView Article
Rockman MV, Kruglyak L: Recombinational landscape and population genomics ofCaenorhabditis elegans.PLoS Genet 2009, 5:e1000419.PubMedView Article
Kammenga JE, Doroszuk A, Riksen JAG, Hazendonk E, Spiridon L, Petrescu AJ, Tijsterman M, Plasterk RHA, Bakker J: ACaenorhabditis eleganswild type defies the temperature-size rule owing to a single nucleotide polymorphism intra-3.Plos Genetics 2007, 3:358–366.View Article
Snoek LB, Van der Velde KJ, Arends D, Li Y, Beyer A, Elvin M, Fisher J, Hajnal A, Hengartner MO, Poulin GB, et al.: WormQTL–public archive and analysis web portal for natural variation data inCaenorhabditis spp.Nucleic Acids Res 2013, 41:D738-D743. http://www.WormQTL.orgPubMedView Article
Green JW, Snoek LB: Kammenga JE. Harvey SC: Genetic mapping of variation in dauer larvae development in growing populations of Caenorhabditis elegans. Heredity; 2013. doi:10.1038/hdy.2013.50
Rodriguez M, Snoek LB, Riksen JA, Bevers RP, Kammenga JE: Genetic variation for stress-response hormesis inC.eleganslifespan.Exp Gerontol 2012, 47:581–587.PubMedView Article
Elvin M, Snoek LB, Frejno M, Klemstein U, Kammenga JE, Poulin GB: A fitness assay for comparing RNAi effects across multipleC. elegansgenotypes.BMC genomics 2011, 12:510.PubMedView Article
Li Y, Breitling R, Snoek LB, van der Velde KJ, Swertz MA, Riksen J, Jansen RC, Kammenga JE: Global genetic robustness of the alternative splicing machinery inCaenorhabditis elegans.Genetics 2010, 186:405–410.PubMedView Article
Doroszuk A, Snoek LB, Fradin E, Riksen J, Kammenga J: A genome-wide library of CB4856/N2 introgression lines ofCaenorhabditis elegans.Nucleic Acids Res 2009, 37:e110.PubMedView Article
Rockman MV, Skrovanek SS, Kruglyak L: Selection at linked sites shapes heritable phenotypic variation inC. elegans.Science 2010, 330:372–376.PubMedView Article
Kammenga JE, Phillips PC, De Bono M, Doroszuk A: Beyond induced mutants: using worms to study natural variation in genetic pathways.Trends Gen 2008, 24:178–185.View Article
Cutter AD: Nucleotide polymorphism and linkage disequilibrium in wild populations of the partial selferCaenorhabditis elegans.Genetics 2006, 172:171–184.PubMedView Article
Koch R, van Luenen HG, van der Horst M, Thijssen KL, Plasterk RH: Single nucleotide polymorphisms in wild isolates ofCaenorhabditis elegans.Genome Res 2000, 10:1690–1696.PubMedView Article
Maydan JS, Lorch A, Edgley ML, Flibotte S, Moerman DG: Copy number variation in the genomes of twelve natural isolates ofCaenorhabditis elegans.BMC Genomics 2010, 11:62.PubMedView Article
Schulenburg H, Muller S: Natural variation in the response ofCaenorhabditis eleganstowardsBacillus thuringiensis.Parasitology 2004, 128:433–443.PubMedView Article
Sivasundar A, Hey J: Population genetics ofCaenorhabditis elegans: the paradox of low polymorphism in a widespread species.Genetics 2003, 163:147–157.PubMed
Haber M, Schungel M, Putz A, Muller S, Hasert B, Schulenburg H: Evolutionary history ofCaenorhabditis elegansinferred from microsatellites: evidence for spatial and temporal genetic differentiation and the occurrence of outbreeding.Mol Biol Evol 2005, 22:160–173.PubMedView Article
Sivamaruthi BS, Ganguli A, Kumar M, Bhaviya S, Pandian SK, Balamurugan K: Caenorhabditis elegansas a model for studyingCronobacter sakazakiiATCC BAA-894 pathogenesis.J Basic Microbiol 2011, 51:540–549.PubMedView Article
Taubert S, Ward JD, Yamamoto KR: Nuclear hormone receptors in nematodes: evolution and function.Mol Cell Endocrinol 2011, 334:49–55.PubMedView Article
Schulenburg H, Hoeppner MP, Weiner J 3rd, Bornberg-Bauer E: Specificity of the innate immune system and diversity of C-type lectin domain (CTLD) proteins in the nematodeCaenorhabditis elegans.Immunobiol 2008, 213:237–250.View Article
Bogaerts A, Temmerman L, Boerjan B, Husson SJ, Schoofs L, Verleyen P: A differential proteomics study ofCaenorhabditis elegansinfected withAeromonas hydrophila.Developmental and comparative immunology 2010, 34:690–698.PubMedView Article
Sahu SN, Lewis J, Patel I, Bozdag S, Lee JH, LeClerc JE, Cinar HN: Genomic analysis of immune response againstVibrio choleraehemolysin inCaenorhabditis elegans.PloS one 2012, 7:e38200.PubMedView Article
Irazoqui JE, Troemel ER, Feinbaum RL, Luhachack LG, Cezairliyan BO, Ausubel FM: Distinct pathogenesis and host responses during infection ofC. elegansbyP. aeruginosaandS. aureus.PLoS pathogens 2010, 6:e1000982.PubMedView Article
Ho MS, Tsai PI, Chien CT: F-box proteins: the key to protein degradation.J Biomed Sci 2006, 13:181–191.PubMedView Article
Coolon JD, Jones KL, Todd TC, Carr BC, Herman MA: Caenorhabditis elegansgenomic response to soil bacteria predicts environment-specific genetic effects on life history traits.PLoS Genet 2009, 5:e1000503.PubMedView Article
Grompone G, Martorell P, Llopis S, Gonzalez N, Genoves S, Mulet AP, Fernandez-Calero T, Tiscornia I, Bollati-Fogolin M, Chambaud I, et al.: Anti-inflammatoryLactobacillus rhamnosusCNCM I-3690 strain protects against oxidative stress and increases lifespan inCaenorhabditis elegans.PloS one 2012, 7:e52493.PubMedView Article
Pujol N, Zugasti O, Wong D, Couillault C, Kurz CL, Schulenburg H, Ewbank JJ: Anti-fungal innate immunity inC. elegansis enhanced by evolutionary diversification of antimicrobial peptides.PLoS pathogens 2008, 4:e1000105.PubMedView Article
Troemel ER, Chu SW, Reinke V, Lee SS, Ausubel FM, Kim DH: p38 MAPK regulates expression of immune response genes and contributes to longevity inC. elegans.PLoS Genet 2006, 2:e183.PubMedView Article
Cui Y, McBride SJ, Boyd WA, Alper S, Freedman JH: Toxicogenomic analysis ofCaenorhabditis elegansreveals novel genes and pathways involved in the resistance to cadmium toxicity.Genome biology 2007, 8:R122.PubMedView Article
Hattori A, Mizuno T, Akamatsu M, Hisamoto N, Matsumoto K: TheCaenorhabditis elegansJNK signaling pathway activates expression of stress response genes by derepressing the Fos/HDAC repressor complex.PLoS Genet 2013, 9:e1003315.PubMedView Article
Gracida X, Eckmann CR: Fertility and germline stem cell maintenance under different diets requires nhr-114/HNF4 inC.elegans.Curr Biol 2013, 23:607–613.PubMedView Article
Miller DL, Budde MW, Roth MB: HIF-1 and SKN-1 coordinate the transcriptional response to hydrogen sulfide inCaenorhabditis elegans.PloS one 2011, 6:e25476.PubMedView Article
Swain S, Wren JF, Sturzenbaum SR, Kille P, Morgan AJ, Jager T, Jonker MJ, Hankard PK, Svendsen C, Owen J, et al.: Linking toxicant physiological mode of action with induced gene expression changes inCaenorhabditis elegans.BMC Syst Biol 2010, 4:32.PubMedView Article
Custodia N, Won SJ, Novillo A, Wieland M, Li C, Callard IP: Caenorhabditis elegansas an environmental monitor using DNA microarray analysis.Ann N Y Acad Sci 2001, 948:32–42.PubMedView Article
Menzel R, Swain SC, Hoess S, Claus E, Menzel S, Steinberg CE, Reifferscheid G, Sturzenbaum SR: Gene expression profiling to characterize sediment toxicity–a pilot study usingCaenorhabditis eleganswhole genome microarrays.BMC Genomics 2009, 10:160.PubMedView Article
Menzel R, Menzel S, Swain SC, Pietsch K, Tiedt S, Witczak J, Sturzenbaum SR, Steinberg CE: The nematodeCaenorhabditis elegans, stress and aging: identifying the complex interplay of genetic pathways following the treatment with humic substances.Front Gen 2012, 3:50.
Lewis JA, Szilagyi M, Gehman E, Dennis WE, Jackson DA: Distinct patterns of gene and protein expression elicited by organophosphorus pesticides inCaenorhabditis elegans.BMC Genomics 2009, 10:202.PubMedView Article
Viñuela A, Snoek LB, Riksen JA, Kammenga JE: Genome-wide gene expression regulation as a function of genotype and age inC.elegans.Genome Res 2010, 20:929–937.PubMedView Article
Weber KP, De S, Kozarewa I, Turner DJ, Babu MM, de Bono M: Whole genome sequencing highlights genetic changes associated with laboratory domestication ofC. elegans.PloS one 2010, 5:e13922.PubMedView Article
Sulston JE, Hodgkin J: Methods. In The nematode Caenorhabditis elegans. Edited by: Wood WB. New York: Cold Spring Harbor Laboratory; 1988:587–606.
Boehnisch C, Wong D, Habig M, Isermann K, Michiels NK, Roeder T, May RC, Schulenburg H: Protist-type lysozymes of the nematodeCaenorhabditis eleganscontribute to resistance against pathogenicBacillus thuringiensis.PloS one 2011, 6:e24619.PubMedView Article
Smyth GK: Limma: linear models for microarray data. In Bioinformatics and computational biology solutions using R and Bioconductor. Edited by: Gentleman R, Carey VJ, Huber W, Irizarry RA, Dudoit S. New York: Springer Science + Business Media; 2005:397–420.View Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.