Ontogeny and phylogeny: molecular signatures of selection, constraint, and temporal pleiotropy in the development of Drosophila
© Artieri et al; licensee BioMed Central Ltd. 2009
Received: 20 November 2008
Accepted: 21 July 2009
Published: 21 July 2009
Karl Ernst Von Baer noted that species tend to show greater morphological divergence in later stages of development when compared to earlier stages. Darwin originally interpreted these observations via a selectionist framework, suggesting that divergence should be greatest during ontogenic stages in which organisms experienced varying 'conditions of existence' and opportunity for differential selection. Modern hypotheses have focused on the notion that genes and structures involved in early development will be under stronger purifying selection due to the deleterious pleiotropic effects of mutations propagating over the course of ontogeny, also known as the developmental constraint hypothesis.
Using developmental stage-specific expressed sequence tag (EST) libraries, we tested the 2 hypotheses by comparing the rates of evolution of 7,180 genes obtained from 6 species of the Drosophila melanogaster group with respect to ontogeny, and sex and reproduction-related functions in gonadal tissues. Supporting morphological observations, we found evidence of a pattern of increasing mean evolutionary rate in genes that are expressed in subsequent stages of development. Furthermore, supporting expectations that early expressed genes are constrained in divergence, we found that embryo stage genes are involved in a higher mean number of interactions as compared to later stages. We noted that the accelerated divergence of genes in the adult stage is explained by those expressed specifically in the male gonads, whose divergence is driven by positive selection. In addition, accelerated gonadal gene divergence occurs only in the adult stage, suggesting that the effects of selection are observed primarily at the stages during which they are expected occur. Finally, we also found a significant correlation between temporal specificity of gene expression and evolutionary rate, supporting expectations that genes with ubiquitous expression are under stronger constraint.
Taken together, these results support both the developmental constraint hypothesis limiting the divergence of early expressed developmentally important genes, leading to a gradient of divergence rates over ontogeny (embryonic < larval/pupal < adult), as well as Darwin's 'selection opportunity' hypothesis leading to increased divergence in adults, particularly in the case of reproductive tissues. We suggest that a constraint early/opportunity late model best explains divergence over ontogeny.
For over a century, developmental biologists have noted an ontogenic pattern among evolutionary relationships: earlier developmental stages are morphologically more similar across species than later stages; this is also known as Von Baer's third law [1–4]. While more recent studies in vertebrates have determined that the very earliest stages of ontogeny (for example, gastrulation) may be subject to substantial variation even among closely related species, upon reaching the tailbud stage, embryos begin to share more similarity in appearance that gradually declines with subsequent development . This 'hourglass' model of developmental similarity among vertebrates suggests that, while certain stages of development undergo substantial change over evolutionary time, there exists a significant conservation of the mechanisms underlying development across vertebrates [6–9]. Darwin originally interpreted Von Baer's observations via a selectionist framework [10, 11]. He suggested that divergence should be greatest during ontogenic stages in which organisms experienced the most varying 'conditions of existence' and, as a result, occasioned opportunity for differential selection . Embryos of varied species are therefore more similar than adults due to exposure to very similar fetal environments. Furthermore, he noted that derived features rarely appeared in an organism before the stage when they were used, indicating that the effect of selection was also specific to the stage where selection pressure actually occurred. This observation was important to his overall hypothesis, as selection pressures occurring during one stage that selected for traits expressed in other stages would be inconsistent with Von Baer's observations. Using secondary sexual traits as a primary example, Darwin compiled a large number of observations indicating that male-specific structures known to be highly divergent even among closely related species rarely developed until reproductive maturity was reached [10, 12].
Modern interpretations of Von Baer's third law have focused on another, non-mutually exclusive mechanism: genes implicated in early aspects of development are more likely to regulate a large number of downstream effectors via hierarchical regulatory cascades, and are thus more evolutionarily constrained due to the large deleterious pleiotropic effects of mutations. This is known as the developmental constraint hypothesis [3, 13, 14]. The complex hierarchical nature of gene regulatory networks has become a focus of major interest in the field of organismal development [15, 16] with special attention being paid in particular to those network modules critical to early development and conserved over broad evolutionary distances . For instance, the well known homeotic genes involved in establishing the anterior/posterior axis in the early development of most metazoans provide a striking example of highly conserved genes whose mutations are known to have extensive pleiotropic consequences [18–20]. These transcription factors are also known to act as master regulatory switches in cascades involved in regulating the proper expression of many downstream, developmentally important effectors . Another example is the gene regulatory feedback loop required for endoderm specification in echinoderms, which encodes several transcription factors whose inactivation has catastrophic effects on the entire body plan [17, 22]. These instances highlight the strength of purifying selection acting on specific genes known to be involved in complex developmental regulatory networks; however a more recent interest has concerned the broader evolutionary patterns of the genome with respect to ontogeny.
The evolutionary dynamics of genes expressed over the course of development have recently been examined at the genomic level in the case of flies and nematodes, using microarray-based information about the developmental timing of gene expression [23–25]. Castillo-Davis and Hartl  used previously published, developmental stage-specific microarray data  in order to compare the rates of coding sequence divergence of a relatively small number of genes (224) between Caenorhabditis elegans and C. briggsae (20 million to 120 million years diverged (MYD)). Genes in their dataset were classified either as 'non-modulated' genes (that is, invariant in expression level over development), early-expressed genes (that is, embryonic), or late-expressed genes (that is, larval and adult) based on the developmental stage at which their peak level of expression occurred. The authors found no significant difference in the rates of protein evolution among the three categories, though the early-expressed genes showed a higher rate of synonymous substitution as well as a lower codon usage bias (CUB) than late-expressed genes. The analysis of the same 2 species was subsequently refined by Cutter and Ward  using a larger dataset of 7,281 genes and a larger source of developmental expression data [27, 28]. Their results support some theoretical predictions of both the developmental constraint as well as Darwin's 'selection opportunity' hypothesis: when genes were classified based on the stage at which their peak expression level occurred, adult genes were found to be evolving more rapidly than those in the earlier, larval stage. Expression level in the larval stage, relative to the adult, was also found to be negatively correlated with sequence divergence, while the opposite was observed for expression in adults. However, the authors noted no unidirectional trend in evolutionary rates in genes expressed over the course of embryogenesis, as would be predicted by the developmental constraint hypothesis, leading them to suggest that constraint may not explain the evolutionary rates of proteins expressed during embryonic development in these species. Furthermore, when examining the tissue specificity of genes expressed in adult nematodes, the authors found that the majority, though not all, of the acceleration in evolutionary rate observed in this stage was explained by genes expressed primarily in the male gametes, providing evidence of a significant effect of sexual selection, presumably acting through sperm competition between males and hermaphrodites or antagonistic coevolution between genes expressed in sperm and oocytes .
Davis et al.  used the results of a microarray study of the expression levels of 4,028 genes over the course of Drosophila melanogaster ontogeny  and examined their rates of sequence divergence between D. melanogaster and D. pseudoobscura (25 to 55 MYD). They noted that gene expression level in the late embryo relative to later stages was negatively correlated with sequence divergence, while the opposite was observed in the case of adult males. However, the authors noted no significant correlation between expression levels and sequence divergence for the many of the sampled developmental stages. Unfortunately the species pairs used in both of these studies were quite distantly diverged and thus interpretation of these data is limited due to the saturation of synonymous site divergence (d S), which largely prevents investigation of questions regarding evidence of selection [30, 31]. Furthermore, comparisons at such evolutionary distances allow the possibility that expression patterns (for example, time of expression, sex bias, and so on) have diverged between species, questioning whether similar selective pressures are acting along both lineages at the level of individual genes .
Holometabolic insects such as Drosophila provide an excellent model for studying gene evolution over ontogeny as they pass through four separate, unambiguous developmental stages (embryo, larva, pupa, and adult). A large body of information about the evolutionary dynamics of the genomes of drosophilids has accumulated, aided significantly by the recent release and analysis of the complete genomes of 12 Drosophila species . However, the relationship between development and genomic evolution remains largely unexplored. Here, we analyze a larger dataset than was previously available, using information generated from publicly available developmental stage-specific expressed sequence tag (EST) libraries to assign genes to specific developmental stages and determine their evolutionary patterns within the D. melanogaster group, allowing more reliable estimates of divergence parameters as well as reducing the caveats associated with comparing distantly related species . We report a gradient of increasing mean evolutionary rate in genes expressed in subsequent stages of fly development, culminating in exaggerated gene sequence divergence specifically in adult males. When comparing genes expressed specifically in the gonads of embryos to adults, we found that the increased rate of divergence observed in adults is explained entirely by those genes expressed in the testis. No such pattern of accelerated gene divergence is observed in the embryonic gonads, supporting Darwin's expectations that selection pressures should act predominantly in the stage where the opportunity for selection occurs . Finally, when classifying genes into specific developmental stages using a series of increasing stage-specificity thresholds, we found a significant correlation between specificity of temporal stage of expression and evolutionary rate. We also reanalyzed the dataset used by Davis et al.  using our methods in order to refine their estimates of divergence and test the generality of their results (Additional files 1, 2, and 3). Taken together, our results support both developmental constraint acting to limit the divergence of early expressed, developmentally important genes [5, 8], as well as the notion that accelerated divergence in adults is primarily due to increased selection pressures occurring during this stage.
Analysis of the EST library-based developmental profile
Number of genes classified into each category according to the proportion of representation specificity thresholds used to classify the expressed sequence tag (EST) data
Stage/tissue – gonads combined:
Stage/tissue – gonads separated:
As a test of our assumption that a gene's highest stage of expression is also the stage during which the majority of its functions occur, we performed pairwise comparisons of the lists of genes classified at each stage for each specificity threshold using FatiGO [36, 37] (Additional file 5). We found that certain 'biological process' gene ontology (GO) terms associated with temporal-specific functions were consistently over-represented among genes classified into the stage(s) during which such functions were expected to occur. For instance, in the embryogenic versus adult comparison, terms associated with development and regulation (for example, 'regulation of biological process' (GO:0050789), and 'multicellular organismal development' (GO:0007275)) were consistently over-represented among genes classified as embryonic, while terms associated with detection and response to external stimuli were over-represented among genes classified as adult (for example, 'detection of stimulus' (GO:0051606), and 'response to abiotic stimulus' (GO:0009628)). Similar trends were observed in the comparison between the combined larval and pupal stages versus the adult stage, where for example, the term 'post-embryonic development' (GO:0009791) was over-represented among larval/pupal genes, as expected. In the comparison between the embryonic versus larval/pupal stages terms associated with regulation (for example, 'regulation of biological process' (GO:0050789)) tend to be over-represented among embryonic genes while those associated with energy metabolism (for example, 'generation of precursor metabolites and energy' (GO:0006091) and 'carbohydrate metabolic process' (GO:0005975)) tend to be over-represented in the larval/pupal stage, as may be expected given the large amount of organismal growth occurring during the larval stage . Curiously, the term 'sexual reproduction' (GO:0019953) is consistently over-represented among genes classified as being specific to the embryonic and larval/pupal stages as compared to the adult stage (Additional file 5). These genes may be associated with organogenesis of sexual organs, which occurs prior to adulthood, or with spermatogenesis, which begins in the third instar larval stage . However, in general, terms were over-represented in pairwise comparisons in the expected direction, providing support to our assumption of an association between expression level and temporal function.
The selection opportunity hypothesis  predicts not only that the average rate of change must increase over developmental stages, but also that the proportion of genes showing evidence of positive selection should increase with subsequent developmental stages . We tested this prediction by performing pairwise comparisons of the proportion of genes showing significant evidence of positive selection using the comparison between models 7 and 8 in phylogenetic analysis by maximum likelihood (PAML)  according to the Drosophila 12 Genomes Consortium data , at each stage and for each specificity threshold (Additional file 4). After applying a Bonferroni correction for multiple tests, we found no significant differences in the proportion of genes showing evidence of positive selection in any pairwise comparisons between stages (Additional file 9).
Stage specificity of selection pressure
A key postulate of the selection opportunity hypothesis  is that the effects of late-stage acting selective pressures primarily affect features specific to the stage at which they occur. As a test of this hypothesis, we sought to compare the effect of expression of genes within gonads relative to those expressed in the rest of the body at the two stages in which we had tissue-specific EST library representation information: embryo and adult. Genes were separately classified into either four different stage/tissue categories (embryonic general, embryonic gonads, adult general, and adult gonads) or five tissue categories (wherein the adult gonad library is separated into adult ovary or adult testis) (see Methods, Table 1 and Additional file 4). It should be noted that the 'embryonic general' class was generated from whole-body tissue (including the gonads), and also that in the generation of the embryonic gonad EST libraries individuals were not sexed, and thus the ESTs reflect undifferentiated gonads pooled from both sexes .
As in the case of genes classified into specific stages, we performed pairwise comparisons of the proportion of genes showing evidence of positive selection for each tissue/stage and for each specificity threshold. Again, no comparisons were statistically significant after Bonferroni correction, with the sole exception that genes classified as unique to the adult testis have a significantly higher proportion of genes showing evidence of positive selection than genes classified as unique to the adult general category (χ2 value = 8.76, df = 1, P = 0.0308) (Additional file 9).
Gene interaction profiles during development
Average number of interactions (95% confidence intervals (CIs)) per stage and per gonadal or non-gonadal categories in the embryonic and adult stages
Greater than twofold
7.884 to 9.107
8.097 to 9.729
6.566 to 8.240
5.611 to 7.915
6.291 to 7.619
5.713 to 7.201
Stage/tissue (adult gonads combined):
8.038 to 9.483
8.219 to 10.339
7.176 to 10.039
6.039 to 8.476
5.947 to 7.453
5.134 to 7.018
6.128 to 7.603
5.639 to 7.453
Stage/tissue (adult gonads separated):
8.136 to 9.784
8.264 to 10.749
7.086 to 10.346
6.060 to 9.077
5.698 to 7.278
5.104 to 7.073
6.683 to 8.419
6.298 to 8.722
6.024 to 7.590
5.758 to 7.769
When comparing the average number of interactions per gene between gonadal and non-gonadal tissues in the adult and embryonic stages, we observed significantly fewer interactions in both the adult non-gonad and adult gonad categories as compared to the embryonic general category at no specificity threshold and a greater than twofold proportion of representation threshold (P < 0.05). The embryonic gonad category showed a significantly higher mean number of interactions than both adult general and adult gonad categories only when no specificity threshold was used in classification (P < 0.05 after Bonferroni correction). No other pairwise comparisons of mean number of interactions per gene were statistically significantly different, including both within-stage comparisons of gonadal to non-gonadal tissue. When adult gonads were separated into either ovary or testis-specific genes, only genes classified as testis specific had significantly fewer mean interactions (P < 0.05 at no specificity threshold and a greater than twofold proportion of representation threshold). We then reanalyzed the data using only direct protein-protein interactions and again, results were qualitatively similar, though no pairwise comparison was statistically significant after Bonferroni correction when adult ovaries and testes were classified separately (with the sole exception of the embryo general category which shows a significantly higher mean number of interactions than the adult general category using no specificity threshold, P = 0.0180). Also similarly, limiting our analysis to 'high-confidence' interactions resulted in most of the significant comparisons to becoming non-significant, likely owing to the smaller total number of interactions as compared to the total dataset (Additional file 13).
Previous studies have demonstrated a significant negative correlation between the total number of interactions in which genes were involved and their rate of evolution [49, 50]. Given our observation that increased stage specificity was positively correlated with evolutionary rate, we tested for a significant correlation between the number of stages in which a gene was represented and its number of interactions. We found a significant positive correlation between the number of stages in which genes are represented and the number of interactions in which they are involved (Kendall rank sum correlation test τ = 0.0848, P = 4.501 × 10-12).
Our study provides molecular confirmation of two different but non-mutually exclusive hypotheses seeking to explain Von Baer's 'Third Law', noting that morphological similarity among organisms tends to decrease over ontogeny . Our findings consist of (1) evidence for stronger purifying selection during embryonic development as predicted by the modern developmental constraint hypothesis [3, 5], (2) evidence for selection-driven accelerated divergence of genes in the adult stage, exemplified by those expressed in males, as predicted by Darwin , and (3) the existence of a temporal pleiotropy restricting the divergence of genes that are broadly expressed over the course of development.
Expression patterns across the Drosophila phylogeny
All developmental and spatial representation of gene expression information in our study is based on data collected in D. melanogaster, therefore an underlying assumption is made that developmental and spatial expression patterns, or more specifically that the stage/tissue of highest expression level, do not vary significantly among species of the D. melanogaster subgroup. While several studies have shown considerable variation in expression levels between species at the adult stage [51, 52], to our knowledge, there are no studies that have directly compared expression levels between species over development on a large scale. A study conducted by Rifkin et al.  found that approximately 17% of genes surveyed (2,193/12,866) had significant differences between species in the degree to which genes in expression pattern changed during the onset of metamorphosis in D. melanogaster, D. simulans, and D. yakuba. However, it is unclear if such changes imply that the stage of highest level of expression changes between species. Regardless, if patterns of expression varied considerably between the species used in our study, we would expect this to add noise to the evolutionary signals we observed rather than produce systematic biases in our dataset.
Divergence patterns over development
The results of our analysis indicate that sequence follows the pattern observed in morphology over the course of development: we observed a positive gradient in the rates of divergence (d N and d N/d S) in subsequent stages of ontogeny (Figure 1, Additional file 1). However, in the case of the synonymous rate of substitution, d S is highest in adults and lowest in the larval/pupal stage (that is, larval/pupal < embryonic < adult) (Additional file 8). These observations are consistent with either (a) systematic variation in the level of codon usage bias between developmental stages, or (b) a systematic difference in the rate of mutation between stages of development. A recent study performed by Vicario et al.  confirmed that CUB varies significantly among developmental stages when estimated in both D. melanogaster and D. pseudoobscura. Furthermore, the pattern of variation in CUB that they observed (adult < embryonic < larval) mirrors the rate of synonymous substitution measured at each stage in our study, consistent with CUB being responsible for the patterns of variation in d S that we observe (that is, high CUB reduces d S by selecting against substitutions generating non-optimal codons) . A similar analysis of the Codon Adaptation Index  using codonW  on our dataset agreed with Vicario et al.'s results (data not shown) . While it is not possible to rule out the hypothesis of different mutation rates affecting genes expressed in different stages of ontogeny, the non-concordance between the patterns observed in the synonymous and non-synonymous rates of substitution, d S and d N, indicates that differential mutation rate alone is insufficient to explain the positive gradient of divergence in d N and d N/d S observed over ontogeny. However, a gradient in these divergence rates over development is predicted by both the developmental constraint and selection opportunity hypotheses and thus evidence supporting either or both will be considered below.
Embryonic developmental constraint
Supporting the developmental constraint hypothesis, we observed an increased mean number of interactions per gene among genes showing their highest level of expression in the embryonic stage when compared to those specific to other stages (Table 2). This is consonant with the notion that the products of genes expressed in this stage are involved in a greater number of highly connected regulatory networks, and are thus constrained in their divergence due to the cascading effects of deleterious mutations [15, 16]. We observed that genes classified as specific to the embryonic gonadal category were involved in significantly more interactions than those specific to the adult gonads, suggesting that lack of pleiotropy-mediated constraint may play some role in explaining the tolerance for evolutionary divergence of adult gonad specific genes when compared to those of other tissues and stages. This is particularly so in the case of the testis (Additional file 11).
A potential caveat to such analysis could occur if the majority of interaction studies in Drosophila were performed with the intention of identifying interactions in the embryo, thus biasing the data in favor of a greater number of embryo-specific gene interactors. However, when we limited our analysis to interactions derived from yeast two-hybrid experiments using gene predictions from the whole Drosophila melanogaster genome [48, 57], our results remained qualitatively unchanged, suggesting that our dataset is not significantly biased towards any specific stage. It should be noted that the yeast two-hybrid technique is known to generate a large number of false positive predictions of protein-protein interactions (reviewed in ). However, in order for such false positives to have a significant effect in biasing our data, it would require that the whole genome yeast two-hybrid studies from which the interaction data are derived [48, 57] preferentially produce false positives among genes expressed at their highest level in the embryonic stage. A large number of interactions in BioGRID's database are not derived from yeast two-hybrid studies, and limiting our analysis to these studies supports the results observed from the analysis of the entire dataset (data not shown). However, it is likely that interactions derived from these genetic studies are biased towards experiments conducted during embryogenesis, and thus such observations should be interpreted with caution.
Noting that very early ontogenic processes such as gastrulation can show considerable divergence among closely related species, Raff  suggested that developmental constraint may imperfectly reflect the sequence of organismal ontogeny, but rather that the constraining effects of pleiotropy should be highest during those developmental stages showing the least amount of modularity, or disassociation, between regulatory pathways. It is possible that, given the large scale morphogenesis that occurs during both embryogenesis and metamorphosis in Drosophila, more genes expressed during the embryonic and pupal stages occur in highly interconnected regulatory networks and thus are constrained by greater pleiotropy than those specific to the larval and adult stages. However, our analysis of the mean number of interactions of genes classified into the pooled larval and pupal stages found no significant difference when compared to genes classified into the adult stage (Table 2, Additional file 13). While this may be an effect of larval stage genes obscuring the signal of a greater number of interactions in the metamorphosis stage, this seems unlikely as under the strict predictions of the developmental constraint hypothesis, larval genes should be, on average, more conserved than those of the subsequent metamorphosis stage and therefore possibly involved in more interactions. Unfortunately, separate larval-derived and pupal-derived EST libraries will be required to answer such concerns. It should be noted that Arbeitman et al.  observed that the transcriptomes of the embryonic and pupal stages are more similar to one another than either is to the larval or adult, suggesting that many genes classified as embryonic specific may have important functions in metamorphosis.
Selection opportunity and adult divergence
Unlike the developmental constraint hypothesis, which predicts that the gradient in divergence rates observed over ontogeny is produced by relaxed selective constraint occurring on genes expressed in later stages, Darwin's selection opportunity hypothesis argues that this gradient is driven by positive selection . Unfortunately, an increase in d N and d N/d S over development, as we observed, is consistent with both positive selection and relaxed selective constraint. However, as part of the predictions of the selection opportunity hypothesis, we should also observe an increase in the proportion of positively selected genes in later stages of development . When examining the proportion of genes showing evidence of positive selection among our three developmental stages, the differences between stages were not statistically significant (Additional file 9). It should be noted however, that the number of genes in our dataset showing significant evidence of positive selection was quite small (359 out of 7,180 genes classified under no specificity threshold) and may represent too limited a dataset from which to draw statistically meaningful conclusions. While this may suggest that our results do not support Darwin's hypothesis, it is interesting that our study of both EST and microarray-based datasets noted that the accelerated rate of evolution observed in the adult stage is explained by the rapid evolution of male-biased genes and, more specifically, those expressed in the testis (Figure 2, Additional files 1 and 2). This result is consistent with previous morphological studies conducted within the D. melanogaster species complex that found that sexual traits (for example, genital arch area, testes length and area) show consistent, statistically significant differences between species, whereas non-sexual traits (for example, wing length and width, tibia and femur length, and malpighian tubules length and area) do not . Numerous studies have found that genes involved in sex and reproduction diverge rapidly under the effect of positive selection [60–64] and, more specifically, that genes with sex-biased expression show greater evidence of positive selection than non-sex biased genes [65, 66]. Thus there appears to be evidence that the accelerated evolution observed in later stages of development is driven by unique selective pressures such as sexual selection (but see also [67, 68] for examples of theory and empirical evidence suggesting relaxed selective constraint has a large effect in explaining the rapid evolution of genes with sex-limited expression).
Darwin's hypothesis that selection opportunity increases over the course of ontogeny also requires that the effects of selective pressure should only be observed at the stage in which the pressure occurs, and for which he presented secondary sexual traits as an example . While few studies have analyzed the rate of evolution of embryonic genes [23–25], numerous analyses have shown that adult traits and genes involved in reproduction, particularly in male reproductive organs, often evolve at accelerated evolutionary rates when compared to most other tissues [12, 36, 41, 60–65]. As expected, we observed that genes expressed in the pooled gonads of the adult fly are evolving more rapidly than non-gonadal adult tissue (Figure 2a, Additional file 11). In the case of the pooled embryonic gonads, under all specificity thresholds where the differences were statistically significant, genes classified as embryonic gonad specific are evolving less rapidly than whole embryonic tissue. Thus the situation of accelerated evolution of gonad specific genes in the adult is reversed in the embryo, suggesting that the selective forces occurring in the adult reproductive stage are acting primarily on genes expressed at that stage; or at least are not affecting the embryonic stage.
Temporal pleiotropy and protein evolution
A negative correlation between breadth of gene expression and protein divergence has been observed in taxa as distant as primates and flies [40–42] suggesting the existence of a broadly applicable mechanism constraining the divergence of genes expressed in multiple tissues. The most plausible of such mechanisms is negative selection against the deleterious pleiotropic effects engendered from mutations occurring in highly connected genes [49, 50, 69]. Our data suggest that such a model should be extended to include temporal pleitotropy to the well supported spatial pleiotropy observed in previous studies. We observed a clear pattern of increasing evolutionary divergence (in both d N and d N/d S) with increasing stage specificity of representation (Figure 1, Additional file 7), suggesting that genes expressed ubiquitously over the course of development are subject to similar, pleiotropy-mediated evolutionary constraints as genes that are ubiquitously expressed across different tissue types [40–42]. Furthermore, our observation of a significant positive correlation between the number of stages at which genes were represented and the average number of interactions in which these genes are involved strongly suggests that temporally ubiquitously expressed genes are involved in a greater number of cellular and organismal functions than their stage specific counterparts, and could thus be under more restricted evolutionary divergence due to the large effect of deleterious mutations at these loci.
Gene evolutionary rate estimates
All estimates of gene evolutionary rates were obtained from the Drosophila 12 Genomes Consortium Sequencing/Annotation Project  according to their PAML estimates  performed on six species of the D. melanogaster group: D. melanogaster, D. simulans, D. sechellia, D. yakuba, D. erecta, and D. ananassae . d N, d S, and d N/d S (ω in PAML) as calculated under model 0 were used in this analysis. For the EST library-based developmental profile (see below) the number of genes showing evidence of positive selection at each stage and for each stage/tissue category were obtained from the FDR corrected non-branch specific comparisons of models 7 and 8 in the Drosophila 12 Genomes Consortium dataset .
EST library-based developmental profile
We obtained information about the representation of all 7,180 genes in the Drosophila 12 Genomes Consortium  dataset that were found in all stage-specific D. melanogaster EST libraries in the National Center for Biotechnology Information (NCBI) UniGene database (release version 53) [35, 70]. EST libraries separately representing the larval and pupal stages were unavailable, therefore libraries were pooled into one of three developmental stage categories based on the stage from which they were generated: embryonic, larval/pupal, and adult. Genes were then classified into developmental stages based on the stage in which they showed their highest proportion of representation among sequenced ESTs (that is, the number of sequenced ESTs from each gene divided by the total number of ESTs sequenced in that stage's pooled libraries). Genes were reclassified into developmental stages using a series of arbitrarily chosen specificity thresholds, such that in order for a gene to be classified as specific to a stage, its highest proportion of representation had to occur at that stage and also exceed the proportion of representation at any other stage by a threshold of more than twofold, fourfold, or eightfold. Genes were also classified into a 'unique' category if they were represented only in libraries generated from a single stage, therefore producing a series of five separate sets of genes assigned to specific developmental stages (Table 1, Additional file 4).
EST libraries from the embryonic and adult stages were separated into those derived specifically from the gonads and those derived from whole embryos (including the gonads) in the case of the embryo, and from all other tissues (not including the gonads) in the case of the adult. Genes were then classified into embryonic general, embryonic gonads, adult general, and adult gonads as indicated above, using the same specificity thresholds. In the case of the adult stage, testis-derived and ovary-derived libraries were either classified together as 'adult gonads' or separated into 'adult testis' and 'adult ovary' categories. For the purposes of this comparison, all genes classified as larval/pupal-specific were ignored. The number of genes classified into each category and proportion of representation threshold from the EST analysis is shown in Table 1. In the comparison of adult and embryonic gonads and non-gonadal tissue, it is important to note that the numbers of genes classified into each category varies based on whether the adult gonads are combined or separated, especially at lower specificity thresholds, owing to the change in proportional representation introduced when the testis and ovary libraries are pooled.
FatiGO validation of EST-based classification
We obtained NCBI 'CG' numbers for all stage classified genes for which they were available (7,027 genes) using the 'symbol: symbol synonyms' tag in Flybase's  batch download feature. In the case where a Flybase gene (FBgn) was associated with multiple CG numbers, the CG number presented under the 'annotation symbol' heading of that FBgn's 'gene report' page was used. The few duplicate CG numbers occurring due to multiple FBgns linking to the same CG number were not removed. These duplicates most likely result from the splitting of what was originally a single gene into two when genome projects are reannotated. The list of CG numbers classified as specific to each stage were compared to one another using FatiGO[36, 37, 72], searching for over-representation of GO-biological processes in Drosophila melanogaster using a two-tailed Fisher exact test without duplicate filtering. Only significantly over-represented terms at GO levels 3 and 4 were collected.
Developmental profile of interactions
We collected protein and gene interaction data for the 4,422 genes from the EST dataset (Additional file 4) that were represented in the BioGRID database (release 2.0.36) [47, 73]. The total number of interactions, irrespective of the experimental methodology used to obtain them, that each gene was involved in was compiled and used in the analysis. We also compiled a dataset limited only to those interactions derived from yeast two-hybrid experiments for the purpose of ascertaining potential artifacts generated by biased stage sampling of genetic interactions (see Results) (Additional file 12). Finally, we also analyzed the dataset using only 'high-confidence' yeast two-hybrid interactions as defined by Giot et al.  (that is, those interactions with a confidence score greater than 0.5).
All statistical analyses were performed using the R statistical package . Permuted Kruskal-Wallis rank sum tests and 95% confidence intervals were computed using 10,000 permutations of the data using the 'coin' and 'boot' packages, respectively. Pairwise comparisons of the proportion of genes under positive selection were performed using χ2 tests. A Bonferroni correction for the effect of multiple tests was applied to all pairwise comparisons.
codon usage bias
- d N :
number of non-synonymous substitutions per non-synonymous site
- d S :
number of synonymous substitutions per synonymous site
expressed sequence tag
million years diverged
National Center for Biotechnology Information
Natural Sciences and Engineering Research Council of Canada
phylogenetic analysis by maximum likelihood.
We are grateful to Ben Evans, Alberto Civetta, Rob Kulathinal, and the three anonymous reviewers for their insightful comments on early versions of this manuscript. This work was funded by a Natural Sciences and Engineering Research Council of Canada (NSERC) postgraduate doctoral scholarship to CGA and an NSERC grant to RSS.
- Von Baer KE: Entwicklungsgeschichte der Tiere: Beobachtung und Relexion. 1828, Königsberg, Germany: BornträgerGoogle Scholar
- Gould SJ: Ontogeny and phylogeny. 1977, Cambridge, MA, USA: Harvard University PressGoogle Scholar
- Reidl R: Order in living organisms. 1978, New York, USA: John Wiley & SonsGoogle Scholar
- Richardson MK, Hanken J, Selwood L, Wright GM, Richards RJ, Pieau C, Raynaud A: Haeckel, embryos, and evolution. Science. 1998, 280: 983-986. 10.1126/science.280.5366.983c.View ArticlePubMedGoogle Scholar
- Raff RA: The shape of life: genes development, and the evolution of animal form. 1996, Chicago, IL, USA: The University of Chicago PressGoogle Scholar
- Seidle F: Körpergrundgestalt und Keimstruktur eine Erörterung über die Gundlagen der vergleichenden und experimentellen Embryologie un deren Gültigkeit bei phylogenetischen Übelegungen. Zool Anz. 1960, 164: 245-305.Google Scholar
- Sander K: The evolution of patterning mechanisms: gleaning from insect embryogenesis and spermatogenesis. Development and evolution. Edited by: Goodwin BC, Holder N, Wylie CC. 1983, Cambridge, MA, USA: Cambridge University Press, 137-159.Google Scholar
- Galis F, Metz JA: Testing the vulnerability of the phylotypic stage: on modularity and evolutionary conservation. J Exp Zoo. 2001, 291: 195-204. 10.1002/jez.1069.View ArticleGoogle Scholar
- Hall BK: Phylotypic stage or phantom: is there a highly conserved embryonic stage in vertebrates?. Trends Ecol Evol. 1997, 12: 461-463. 10.1016/S0169-5347(97)01222-6.View ArticlePubMedGoogle Scholar
- Darwin C: The descent of man, and selection in relation to sex. 1871, Princeton, NJ. USA: Princeton University PressView ArticleGoogle Scholar
- Darwin C: On the origin of species by means of natural selection, or, the preservation of favored races in the struggle for life. 1872, New York, USA: The Modern Library, 6Google Scholar
- Eberhard WG: Sexual selection and animal genitalia. 1985, Cambridge, MA, USA: Harvard University PressView ArticleGoogle Scholar
- Arthur W: A theory of the evolution of development. 1988, New York, USA: John Wiley & SonsGoogle Scholar
- Cutter AD, Ward S: Sexual and temporal dynamics of molecular evolution in C. elegans development. Mol Biol Evol. 2005, 22: 178-188. 10.1093/molbev/msh267.View ArticlePubMedGoogle Scholar
- Davidson EH, McClay DR, Hood L: Regulatory gene networks and the properties of the developmental process. Proc Natl Acad Sci USA. 2003, 100: 1475-1480. 10.1073/pnas.0437746100.PubMed CentralView ArticlePubMedGoogle Scholar
- Wittkopp PJ: Variable gene expression in eukaryotes: a network perspective. J Exp Biol. 2007, 210: 1567-1575. 10.1242/jeb.002592.View ArticlePubMedGoogle Scholar
- Davidson EH, Erwin DH: Gene regulatory networks and the evolution of animal body plans. Science. 2006, 311: 796-800. 10.1126/science.1113832.View ArticlePubMedGoogle Scholar
- Lewis EB: A gene complex controlling segmentation in Drosophila. Nature. 1978, 276: 565-570. 10.1038/276565a0.View ArticlePubMedGoogle Scholar
- Lutz B, Lu HC, Eichele G, Miller D, Kaufman TC: Rescue of Drosophila labial null mutant by the chicken ortholog Hoxb-1 demonstrates that the function of Hox genes is phylogenetically conserved. Genes Dev. 1996, 10: 176-184. 10.1101/gad.10.2.176.View ArticlePubMedGoogle Scholar
- Lemons D, McGinnis W: Genomic evolution of Hox gene clusters. Science. 2006, 313: 1918-1922. 10.1126/science.1132040.View ArticlePubMedGoogle Scholar
- Carroll SB: Homeotic genes and the evolution of arthropods and chordates. Nature. 1995, 376: 479-485. 10.1038/376479a0.View ArticlePubMedGoogle Scholar
- Hinman VF, Nguyen AT, Cameron RA, Davidson EH: Developmental gene regulatory network architecture across 500 million years of echinoderm evolution. Proc Natl Acad Sci USA. 2003, 100: 13356-13361. 10.1073/pnas.2235868100.PubMed CentralView ArticlePubMedGoogle Scholar
- Castillo-Davis CI, Hartl DL: Genome evolution and developmental constraint in Caenorhabditis elegans. Mol Biol Evol. 2002, 19: 728-735.View ArticlePubMedGoogle Scholar
- Cutter AD, Ward S: Sexual and temporal dynamics of molecular evolution in C. elegans development. Mol Biol Evol. 2005, 22: 178-188. 10.1093/molbev/msh267.View ArticlePubMedGoogle Scholar
- Davis JC, Brandman O, Petrov DA: Protein evolution in the context of Drosophila development. J Mol Evol. 2005, 60: 774-785. 10.1007/s00239-004-0241-2.View ArticlePubMedGoogle Scholar
- Hill AA, Hunter CP, Tsung BT, Tucker-Kellogg G, Brown EL: Genomic analysis of gene expression in C. elegans. Science. 2000, 290: 809-812. 10.1126/science.290.5492.809.View ArticlePubMedGoogle Scholar
- Baugh LR, Hill AA, Slonim DK, Brown EL: Composition and dynamics of the Caenorhabditis elegans early embryonic transcriptome. Development. 2003, 130: 889-900. 10.1242/dev.00302.View ArticlePubMedGoogle Scholar
- Reinke V, Gil IS, Ward S, Kazmer N: Genome-wide germline-enriched and sex-biased expression profiles in C. elegans. Mol Cell. 2004, 6: 605-616. 10.1016/S1097-2765(00)00059-9.View ArticleGoogle Scholar
- Arbeitman MN, Furlong EE, Imam F, Johnson E, Null BH, Baker BS, Krasnow MA, Scott MP, Davis RW, White KP: Gene expression during the life cycle of Drosophila melanogaster. Science. 2002, 297: 2270-2275. 10.1126/science.1072152.View ArticlePubMedGoogle Scholar
- Musters H, Huntley MA, Singh RS: A genomic comparison of faster-sex, faster-X, and faster-male evolution between Drosophila melanogaster and Drosophila pseudoobscura. J Mol Evol. 2006, 62: 693-700. 10.1007/s00239-005-0165-5.View ArticlePubMedGoogle Scholar
- Graur D, Li WH: Fundamentals of molecular evolution. 2002, Sunderland, CT, USA: SinauerGoogle Scholar
- Zhang Y, Sturgill D, Parisi M, Kumar S, Oliver B: Constraint and turnover in sex-biased gene expression in the genus Drosophila. Nature. 2007, 450: 233-237. 10.1038/nature06323.PubMed CentralView ArticlePubMedGoogle Scholar
- Drosophila 12 Genomes Consortium: Evolution of genes and genomes on the Drosophila phylogeny. Nature. 2007, 450: 203-218. 10.1038/nature06341.View ArticleGoogle Scholar
- Lachaise D, Cariou ML, David JR, Lemeunier F, Tsacas L, Ashburner M: Historical biogeography of the Drosophila melanogaster species subgroup. Evol Biol. 1988, 22: 159-226.Google Scholar
- Pontius JU, Wagner L, Schuler GD: UniGene: a unified view of the transcriptome. The NCBI handbook. 2003, Bethesda, MD, USA: National Center for Biotechnology InformationGoogle Scholar
- Al-Shahrour F, Minguez P, Tárraga J, Medina I, Alloza E, Montaner D, Dopazo J: FatiGO +: a functional profiling tool for genomic data. Integration of functional annotation, regulatory motifs and interaction data with microarray experiments. Nucleic Acids Res. 2007, 35: W91-6. 10.1093/nar/gkm260.PubMed CentralView ArticlePubMedGoogle Scholar
- Al-Shahrour F, Díaz-Uriarte R, Dopazo J: FatiGO: a web tool for finding significant associations of gene ontology terms with groups of genes. Bioinformatics. 2004, 20: 578-580. 10.1093/bioinformatics/btg455.View ArticlePubMedGoogle Scholar
- Vicario S, Mason CE, White KP, Powell JR: Developmental stage and level of codon usage bias in Drosophila. Mol Biol Evol. 2008, 25: 2269-2277. 10.1093/molbev/msn189.PubMed CentralView ArticlePubMedGoogle Scholar
- Hartenstein V: The atlas of drosophila development. 1993, Cold Spring Harbor, NY, USA: Cold Spring Harbor Laboratory PressGoogle Scholar
- Khaitovich P, Hellmann I, Enard W, Nowick K, Leinweber M, Franz H, Weiss G, Lachmann M, Pääbo S: Parallel patterns of evolution in the genomes and transcriptomes of humans and chimpanzees. Science. 2005, 309: 1850-1854. 10.1126/science.1108296.View ArticlePubMedGoogle Scholar
- Haerty W, Jagadeeshan S, Kulathinal RJ, Wong A, Ravi Ram K, Sirot LK, Levesque L, Artieri CG, Wolfner MF, Civetta A, et al: Evolution in the fast lane: rapidly evolving sex-related genes in Drosophila. Genetics. 2007, 177: 1321-1335. 10.1534/genetics.107.078865.PubMed CentralView ArticlePubMedGoogle Scholar
- Larracuente AM, Sackton TB, Greenberg AJ, Wong A, Singh ND, Sturgill D, Zhang Y, Oliver B, Clark AG: Evolution of protein-coding genes in Drosophila. Trends Genet. 2008, 24: 114-123. 10.1016/j.tig.2007.12.001.View ArticlePubMedGoogle Scholar
- Good JM, Nachman MW: Rates of protein evolution are positively correlated with developmental timing of expression during mouse spermatogenesis. Mol Biol Evol. 2005, 22: 1044-1052. 10.1093/molbev/msi087.View ArticlePubMedGoogle Scholar
- Yang Z, Nielsen R: Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models. Mol Biol Evol. 2000, 17: 32-43.View ArticlePubMedGoogle Scholar
- Shigenobu S, Kitadate Y, Noda C, Kobayashi S: Molecular characterization of embryonic gonads by gene expression profiling in Drosophila melanogaster. Proc Natl Acad Sci USA. 2006, 12: 13728-13733. 10.1073/pnas.0603767103.View ArticleGoogle Scholar
- Wright TRF, (ed): Genetic regulatory hierarchies in development. 1990, Toronto, Canada: Academic PressGoogle Scholar
- Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M: BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 2006, 34: D535-9. 10.1093/nar/gkj109.PubMed CentralView ArticlePubMedGoogle Scholar
- Giot L, Bader JS, Brouwer C, Chaudhuri A, Kuang B, Li Y, Hao YL, Ooi CE, Godwin B, Vitols E, et al: A protein interaction map of Drosophila melanogaster. Science. 2003, 302: 1727-1736. 10.1126/science.1090289.View ArticlePubMedGoogle Scholar
- Fraser HB: Modularity and evolutionary constraint on proteins. Nat Genet. 2005, 37: 351-352. 10.1038/ng1530.View ArticlePubMedGoogle Scholar
- Lemos B, Bettencourt BR, Meiklejohn CD, Hartl DL: Evolution of proteins and gene expression levels are coupled in Drosophila and are independently associated with mRNA abundance, protein length, and number of protein-protein interactions. Mol Biol Evol. 2005, 22: 1345-1354. 10.1093/molbev/msi122.View ArticlePubMedGoogle Scholar
- Ranz JM, Castillo-Davis CI, Meiklejohn CD, Hartl DL: Sex-dependent gene expression and evolution of the Drosophila transcriptome. Science. 2003, 300: 1742-1745. 10.1126/science.1085881.View ArticlePubMedGoogle Scholar
- Meiklejohn CD, Parsch J, Ranz JM, Hartl DL: Rapid evolution of male-biased gene expression in Drosophila. Proc Natl Acad Sci USA. 2003, 100: 9894-9899. 10.1073/pnas.1630690100.PubMed CentralView ArticlePubMedGoogle Scholar
- Rifkin SA, Kim J, White KP: Evolution of gene expression in the Drosophila melanogaster subgroup. Nat Genet. 2003, 33: 138-144. 10.1038/ng1086.View ArticlePubMedGoogle Scholar
- Akashi H: Gene expression and molecular evolution. Curr Opin Genet Dev. 2001, 11: 660-666. 10.1016/S0959-437X(00)00250-1.View ArticlePubMedGoogle Scholar
- Sharp PM, Li WH: The codon adaptation index – a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res. 1987, 15: 1281-1295. 10.1093/nar/15.3.1281.PubMed CentralView ArticlePubMedGoogle Scholar
- CodonW. [http://codonw.sourceforge.net]
- Stanyon CA, Liu G, Mangiola BA, Patel N, Giot L, Kuang B, Zhang H, Zhong J, Finley RL: A Drosophila protein-interaction map centered on cell-cycle regulators. Genome Biol. 2004, 5: R96-10.1186/gb-2004-5-12-r96.PubMed CentralView ArticlePubMedGoogle Scholar
- Hart GT, Ramani AK, Marcotte EM: How complete are current yeast and human protein-interaction networks?. Genome Biol. 2006, 7: 120-10.1186/gb-2006-7-11-120.PubMed CentralView ArticlePubMedGoogle Scholar
- Civetta A, Singh RS: Sex and speciation: genetic architecture and evolutionary potential of sexual versus nonsexual traits in the sibling species of the Drosophila melanogaster complex. Evolution. 1998, 52: 1080-1092. 10.2307/2411238.View ArticleGoogle Scholar
- Civetta A, Singh RS: High divergence of reproductive tract proteins and their association with postzygotic reproductive isolation in Drosophila melanogaster and Drosophila virilis group species. J Mol Evol. 1995, 41: 1085-1095. 10.1007/BF00173190.View ArticlePubMedGoogle Scholar
- Swanson WJ, Vacquier VD: The rapid evolution of reproductive proteins. Nat Rev Genet. 2002, 3: 137-144. 10.1038/nrg733.View ArticlePubMedGoogle Scholar
- Singh RS, Kulathinal RJ: Sex gene pool evolution and speciation – a new paradigm. Genes Genetic Sys. 2000, 75: 119-130. 10.1266/ggs.75.119.View ArticleGoogle Scholar
- Singh RS, Kulathinal RJ: Male sex drive and the masculinization of the genome. Bioessays. 2005, 27: 518-525. 10.1002/bies.20212.View ArticlePubMedGoogle Scholar
- Artieri CG, Haerty W, Gupta BP, Singh RS: Sexual selection and maintenance of sex: evidence from comparisons of rates of genomic accumulation of mutations and divergence of sex-related genes in sexual and hermaphroditic species of Caenorhabditis. Mol Biol Evol. 2008, 25: 972-979. 10.1093/molbev/msn046.View ArticlePubMedGoogle Scholar
- Pröschel M, Zhang Z, Parsch J: Widespread adaptive evolution of Drosophila genes with sex-biased expression. Genetics. 2006, 174: 893-900. 10.1534/genetics.106.058008.PubMed CentralView ArticlePubMedGoogle Scholar
- Baines JF, Sawyer SA, Hartl DL, Parsch J: Effects of x-linkage and sex-biased gene expression on the rate of adaptive protein evolution in Drosophila. Mol Biol Evol. 2008, 25: 1639-1650. 10.1093/molbev/msn111.PubMed CentralView ArticlePubMedGoogle Scholar
- Wade MJ: The evolutionary genetics of maternal effects. Maternal effects as adaptations. Edited by: Mousseau T, Fox C. 1998, Oxford, UK: Oxford University Press, 5-21.Google Scholar
- Cruickshank T, Wade MJ: Microevolutionary support for a developmental hourglass: gene expression patterns shape sequence variation and divergence in Drosophila. Evol Dev. 2008, 10: 583-590. 10.1111/j.1525-142X.2008.00273.x.View ArticlePubMedGoogle Scholar
- He X, Zhang J: Toward a molecular understanding of pleiotropy. Genetics. 2006, 173: 1885-1891. 10.1534/genetics.106.060269.PubMed CentralView ArticlePubMedGoogle Scholar
- UniGene: an organized view of the transcriptome. [http://www.ncbi.nlm.nih.gov/sites/entrez?db=unigene]
- Flybase: a database of Drosophila genes & genomes. [http://flybase.org/]
- Babelomics v3.1. [http://babelomics.bioinfo.cipf.es/]
- The BioGRID: general repository for interaction datasets. [http://www.thebiogrid.org]
- R Development Core Team: R: a language and environment for statistical computing. 2004, Vienna, Austria: R Foundation for Statistical ComputingGoogle Scholar
- Stark A, Lin MF, Kheradpour P, Pedersen JS, Parts L, Carlson JW, Crosby MA, Rasmussen MD, Roy S, Deoras AN, et al: Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures. Nature. 2007, 450: 219-232. 10.1038/nature06340.PubMed CentralView ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.