- Research article
- Open Access
Proliferation of Ty3/gypsy-like retrotransposons in hybrid sunflower taxa inferred from phylogenetic data
BMC Biologyvolume 7, Article number: 40 (2009)
Long terminal repeat (LTR) retrotransposons are a class of mobile genetic element capable of autonomous transposition via an RNA intermediate. Their large size and proliferative ability make them important contributors to genome size evolution, especially in plants, where they can reach exceptionally high copy numbers and contribute substantially to variation in genome size even among closely related taxa. Using a phylogenetic approach, we characterize dynamics of proliferation events of Ty3/gypsy-like LTR retrotransposons that led to massive genomic expansion in three Helianthus (sunflower) species of ancient hybrid origin. The three hybrid species are independently derived from the same two parental species, offering a unique opportunity to explore patterns of retrotransposon proliferation in light of reticulate evolutionary events in this species group.
We demonstrate that Ty3/gypsy-like retrotransposons exist as multiple well supported sublineages in both the parental and hybrid derivative species and that the same element sublineage served as the source lineage of proliferation in each hybrid species' genome. This inference is based on patterns of species-specific element numerical abundance within different phylogenetic sublineages as well as through signals of proliferation events present in the distributions of element divergence values. Employing methods to date paralogous sequences within a genome, proliferation events in the hybrid species' genomes are estimated to have occurred approximately 0.5 to 1 million years ago.
Proliferation of the same retrotransposon major sublineage in each hybrid species indicates that similar dynamics of element derepression and amplification likely occurred in each hybrid taxon during their formation. Temporal estimates of these proliferation events suggest an earlier origin for these hybrid species than previously supposed.
The genomes of flowering plants are remarkably variable in nuclear DNA content, with >1,000-fold differences among some taxa [1, 2]. While differences in ploidy and large-scale segmental duplication account for some of this variability, differential accumulation (and loss) of mobile genetic elements, especially the class I transposable elements known as long terminal repeat (LTR) retrotransposons, represents an additional and important process through which genome size can vary between individual plant species [3, 4]. Plant LTR retrotransposons represent ancient lineages that are ubiquitous in plant genomes [5, 6] and can account for >70% of the nuclear DNA of some plant species . Transposition of these elements is via an RNA intermediate, which enables new copies to be synthesized, reverse transcribed and subsequently integrated into host chromosomal DNA. This mode of transposition can result in large-scale genome expansion because each intact and functional element can potentially give rise to numerous daughter copies.
Recent years have witnessed considerable advances in our understanding of LTR retrotransposons. For example, we now know that these elements are tremendously diverse at the sequence level (in plants) with many subgroup lineages existing within the superfamily types Ty1/copia-like (Pseudoviridae) and Ty3/gypsy-like (Metaviridae) [7–12], that proliferation of these elements can rapidly restructure host genomes [13–16], and that their distribution along chromosome arms can be either dispersed or localized [9, 17–19]. Recent work also suggests that LTR retrotransposons (and other major classes of transposable elements) may not exclusively represent selfish or junk DNA as previously supposed [20, 21] but that, on occasion, transposable elements may have indeed played a more substantial role in generating evolutionary novelty [22–33].
Despite recent advances, we still know surprisingly little regarding how and when these elements become active and proliferate in natural populations; the vast majority of elements remain transcriptionally and transpositionally quiescent during normal growth and development. Various forms of environmental and/or genomic stress have been hypothesized to influence activation. For example, hybridization between genetically differentiated populations and/or species is one means through which these elements are thought to become active [34–38] although activation and proliferation following hybridization is not observed universally [39, 40]. Exposure of plants to biotic and abiotic stresses such as bacterial and viral pathogens, phytophathogenic fungal extracts, wounding, protoplast isolation, and cell culture also has been shown to activate some LTR retrotransposons [41, 42]. While biotic and abiotic stressors may represent more universal agents of activation, much of the data supporting this conclusion comes from experiments conducted under unnatural laboratory conditions; the extent to which these same stresses (especially those that occur naturally) have led to activation and proliferation in natural populations remains unknown.
An especially fascinating case of LTR retrotransposon proliferation in plants involves three annual sunflower species of ancient hybrid origin. These species (Helianthus anomalus, Helianthus deserticola, and Helianthus paradoxus) have arisen independently via ancient hybridization events between the same two parental taxa (Helianthus annuus and Helianthus petiolaris) (Figure 1) [43–45]. Whereas both parental taxa have extensive natural ranges in North America, the three hybrid species are restricted to western and southwestern regions of the United States where they are locally adapted to abiotically extreme environments. The genomes of all three hybrid taxa have experienced spectacular proliferations of Ty3/gypsy-like LTR retrotransposons [15, 46], resulting in large-scale increases in nuclear DNA content . The evolutionary history of the hybrid species is especially noteworthy given that both hybridization and abiotic stress have been hypothesized to facilitate the activation and proliferation of LTR retrotransposons.
In the current report, we demonstrate that Ty3/gypsy-like LTR retrotransposons in sunflower are considerably heterogeneous at the sequence level but yet the same element sublineage has proliferated independently in each hybrid sunflower species. We demonstrate further that the ages of these proliferation events (and thus a lower bound on the hybrid species' origins) can be estimated by examining particular signatures of proliferation found in the hybrid species. Estimates by this method suggest that the hybrid species may be older than previously suggested.
Sequence variability of Ty3/gypsy-like retrotransposons in Helianthus hybrid and parental species
The diploid hybrid species possess composite genomes as a result of their hybrid origins [44, 48]. The elements that proliferated in the hybrid species' genomes are therefore derived from retrotransposon lineages originally present in the genomes of the parental species H. annuus and/or Helianthus petiolaris. We surveyed sequence diversity in the two parental and three hybrid Helianthus species by amplifying a 520-bp region of the Ty3/gypsy-like rt domain-encoding region with degenerate primers followed by cloning and sequencing 92 to 108 amplification products per species. Analysis of these sequences revealed considerable diversity in each of the five Helianthus species, with pairwise sequence divergences ranging from 0% to 48.7% (H. annuus), 0% to 40.5% (H. petiolaris), 0% to 39.6% (H. anomalus), 0% to 59.9% (H. deserticola), and 0% to 36.4% (H. paradoxus). Proper reading frames were determined and all sequences translated to assess the frequency of potentially functional copies. Between approximately 20% (H. petiolaris) and 42% (H. deserticola) of sequences were found to possess indels and/or premature stop codons (Table 1), indicating that a sizable fraction of these elements are no longer likely to be capable of autonomous transposition. These percentages are likely to be underestimates given that we have sequenced only a fraction of the total interior coding region.
Phylogenetic analyses and sublineage-specific element numerical abundance
A phylogenetic analysis of elements derived from both parental species (H. annuus and H. petiolaris) identified multiple, well supported lineages, with sequences from both H. annuus and H. petiolaris present in each major lineage (Figure 2). The presence of elements from both parental species in each major lineage indicates that the origins of these Ty3/gypsy-like lineages predate the origins of the major clades in which H. annuus and H. petiolaris reside (see Figure 1). We propose the name Surge, (for 'sunflower repetitive gypsy-like elements') for these Ty3/gypsy-like retrotransposons in Helianthus. In accordance with criteria put forth in  addressing family identification and naming of transposable elements, we assign the names Surge1 to Surge5 for lineages A to E in Figure 2, respectively.
Phylogenetic analyses on data sets including sequences from both parental species and a single hybrid species (that is, H. annuus +H. petiolaris + H. anomalus; H. annuus + H. petiolaris + H. deserticola; and H. annuus + H. petiolaris + H. paradoxus; Figure 3a–c, respectively), yielded similar results with respect to the distribution of sequences across major identified sublineages and additionally revealed that a single sublineage (shaded gray in Figure 3a–c; lineage E' in Figure 4; Table 1) consistently harbored a higher abundance of sequences derived from the hybrid species' genomes than from the parental species' genomes. This sublineage lies within a larger, well supported major lineage (designated as lineage E, or Surge5). This pattern of consistent differential abundance between parental and hybrid species of sequences in lineage E' presumably emerges because elements that are more common (that is to say, have proliferated) in the hybrid species' genomes are more frequently amplified by degenerate polymerase chain reaction (PCR) and have a higher likelihood of being cloned and sequenced. In phylogenetic analyses, these sequences group most closely with related sequences in the parental species' genomes from which they are likely derived. This pattern of consistent differential abundance between hybrid and parental taxa in sublineage E' was not observed for any other Ty3/gypsy-like lineage (Figure 4).
Proliferation events inferred from frequency distributions of divergence values
Signatures of transposable element proliferation in species' genomes also can be characterized through analysis of the distribution of divergence values between pairwise combinations of element sequences [50, 51]. This form of analysis relies on the fact that all daughter copies of transpositionally active elements are identical at the time of insertion but subsequently accumulate mutations independently. Peaks in the distribution of divergence values correspond to episodes of transposable element proliferation, with peaks associated with greater divergence representing more ancient proliferation events and peaks associated with lesser divergence representing more recent events.
Phylogenetic analyses implicate a single Ty3/gypsy-like sublineage (sublineage E') as a candidate proliferative source lineage of Ty3/gypsy-like retrotransposon amplification in the hybrid species. Distributions of pairwise divergence values for sequences from within this sublineage are depicted in Figure 5a–e for the two parental and three hybrid Helianthus species. The program siZer  was employed to evaluate these distributions for evidence of significant features (peaks) (see Methods). Analyses of these distributions indicate strong support for a single large peak in the range 0.1 to 0.13 for H. annuus, H. petiolaris, H. anomalus, and H. deserticola and at approximately 0.07 for H. paradoxus. Strong support for smaller secondary peaks at lower divergence values (0.02 to 0.03) was additionally found for H. anomalus, and H. deserticola, and strong support for two such additional secondary peaks (at 0.02 and 0.04) was detected for H. paradoxus. There was some (albeit much weaker) support of secondary features in the distribution of H. petiolaris values, though these were only detected over a very narrow range of binwidths. There was no support for secondary peaks in H. annuus. Peaks at lower divergence values suggest recent retrotransposon proliferation events and support our assertion (based on phylogenetic data) that sublineage E' is indeed a proliferative source lineage.
Timeframes for proliferation events indicated by these secondary peaks can be explored given that genome-level mutation rates for Helianthus have been estimated. In wild sunflowers, a silent site mutation rate has been estimated at 1.0 × 10-8 substitutions/site/year based on sequence comparisons in a large EST database coupled with fossil calibrations (M. Barker and L. Rieseberg, University of British Columbia, personal communication). It has been suggested, however, that mutation rates for LTR retrotransposons may be approximately twofold higher than silent site mutation rates for protein coding genes . Thus, utilizing a mutation rate of 2.0 × 10-8 to account for elevated sequence evolution of Ty3/gypsy-like retrotransposons, proliferation events indicated by peaks at 0.02 to 0.04 divergence are roughly estimated to have occurred some 0.5 to 1 million years ago. Timeframes for proliferation events indicated by more prominent primary peaks were not estimated because these features were found in both the hybrid and parental species. It is thus inferred that proliferations associated with these features predate the origins of the hybrid taxa.
Phylogenetic relationship of Surge1 to Surge5 elements to other plant Ty3/gypsy-like retrotransposons
Evolutionary relationships of Surge1 to Surge5 elements to other plant Ty3/gypsy-like retrotransposons were evaluated by phylogenetic analysis of aligned amino acid sequences of the rt domain. A single, full length sequence was randomly selected from each major lineage identified in Figure 2 and included in a phylogenetic analysis with Ty3/gypsy-like LTR retrotransposons isolated from the genomes of other plants. Figure 6 depicts one of three most parsimonious trees that differ only in the placement of the RIRE1/Athila clade relative to two other well supported clades (the first designated as 'class B' and the second consisting of Gorge2, RetroSor1, Cinful, and Wallabi). The Surge elements form a well supported monophyletic group within the class B Ty3/gypsy-like retrotransposons and were most closely related to elements isolated from Arabidopsis thaliana.
Discussion and conclusion
Ty3/gypsy-like retrotransposon proliferation in Helianthus hybrid taxa
Despite the ubiquity and abundance of LTR retrotransposons in plant genomes, our understanding of the dynamics of their proliferation and the consequences of proliferation events on host species evolution is surprisingly limited. Sunflower species in the genus Helianthus provide an excellent group for investigating the possible causes and potential consequences of LTR retrotransposon proliferation in an ecological and evolutionary context. In a previous report , we demonstrated that three ancient hybrid sunflower species have independently experienced massive proliferation of Ty3/gypsy-like LTR retrotransposons following their origins. The current study examines the dynamics of these proliferation events in light of the known relationships among the sunflower species investigated and the requisite condition that proliferative elements in the hybrid species are necessarily derived from lineages present in one or both parental species.
As is commonly observed in plant genomes, we found Ty3/gypsy-like retrotransposons to be considerably diverse at the sequence level, with multiple well supported phylogenetic lineages identified. Particular elements that undergo proliferation, however, are expected to be more abundant in a species' genome, and thus more frequently amplified, cloned, and sequenced via the degenerate PCR methodology employed in this study. Consistent with this expectation, element sequences in the same single sublineage were consistently more abundant numerically in each of the three hybrid species' genomes relative to the genomes of the parental species. This pattern is unlikely to have emerged stochastically via PCR drift given that the same pattern was observed for all hybrid taxa. Moreover, the cloning of degenerate PCR amplification products was conducted on pools of five independent PCR reactions per species, further reducing the likelihood of observing this pattern by chance. This pattern also cannot be attributed to variation in primer sequence specificity across the sunflower species because the degenerate primers used in this study were based on aligned amino acid sequences of several plant species (see Methods), with H. annuus (a parental species) as the sunflower representative. Additionally, our interpretation of this phylogenetic signal is corroborated through independent analyses of the frequency spectra of pairwise sequence divergences (Figure 5a–e).
That proliferation of the same sublineage of Ty3/gypsy-like retrotransposon has occurred independently in each of the three hybrid sunflower species is of considerable interest, and future work will examine this lineage in greater detail to determine whether transcriptional and/or transpositional activation can be detected in natural and/or greenhouse synthesized hybrids between the parental species H. annuus and H. petiolaris. It is noteworthy that elements within this sublineage also are fairly abundant in the parental species, indicating past amplification in the parental species as well. Based on limited sampling, however, these elements do not appear to be currently active transcriptionally in either the parental or hybrid taxa (RT-PCR data not shown), a result that lies in contrast to another study examining diversity and abundance of Ty3/gypsy-like elements in wild Iris species and their early generation hybrids .
Another potentially relevant factor in these proliferation events may be the demographic history of these sunflower hybrid species. Recent work  has demonstrated that several categories of transposable elements display differential patterns of distributional abundance and presumed activity among natural populations of Arabidopsis lyrata that have and have not experienced historical bottlenecks during postglacial recolonization into new geographical regions. Following arguments put forth previously [55, 56] the authors invoke weaker selection against transposable element activity in bottlenecked populations resulting from reductions in effective population sizes and the accompanying increased strength of genetic drift. It is conceivable that similar demographic forces may have acted in the Helianthus hybrid species given differences in habitat preferences between the hybrid and parental sunflower species and the founder event-like population structures that may have been associated with the hybrid species' origins.
While this study indicates clear patterns of retrotransposon proliferation events in the genomes of these sunflower hybrid species, some caveats need mention. First, it is unlikely that we have sampled the total Ty3/gypsy-like diversity in these Helianthus genomes. The sequence variability reported here is limited by the degeneracy of the primers employed. More comprehensive methods for uncovering the full range of retrotransposon subfamily diversity would require genome-level sequencing efforts. For example, by analyzing whole genome shotgun (WGS) libraries, Hawkins et al. identified three major subfamilies of Ty3/gypsy-like elements in Gossypium species, naming them Gorge1, 2, and 3. Similarly, utilizing available large sequence datasets for Oryza australiensis, Piegu et al. also characterized three major subfamilies of Ty3/gypsy-like elements. The sunflower Ty3/gypsy-like elements described in the current study are most closely related to the Gorge3 and Kangourou subfamilies of elements identified by Hawkins et al. and Piegu et al., respectively (Figure 6); this suggests that additional Ty3/gypsy diversity in Helianthus remains uncharacterized. A second caveat of this study, and related to the first, is that we have surveyed sequence variability of the more conserved reverse transcriptase domain-encoding region in the current study whereas our earlier report of proliferation in the hybrid species  was based on comparisons among parental and hybrid species of relative abundance (Southern blot) and absolute abundance (quantitative PCR) of the integrase domain-encoding region. Thus, while we assume we have documented proliferation of the same Ty3/gypsy-like subfamily in the current and earlier report, we cannot rule out the possibility that we have identified different subfamilies in these two studies and we currently lack the resolution to detect this possibility. This matter can be resolved through additional surveys of sequence variability and by isolating and sequencing the entire protein-coding interior regions for a diversity of these elements. A third caveat is that although the parental taxa that gave rise to the hybrid species are still extant, certain retrotransposon lineages could have been lost from the genomes of one or both parental species over evolutionary time; thus, the genomic composition of modern day H. annuus and H. petiolaris may differ in some fashion from that of the H. annuus and H. petiolaris individuals/populations that originally gave rise to the hybrid species.
Distributions of element divergence values and temporal estimates of proliferation events in the Helianthus diploid hybrid species
Distributions of divergence values for sequences within the candidate proliferative source lineage E' revealed strong evidence of secondary features (peaks) associated with lower values of divergence (Figure 5a–e) in the hybrid species, with no evidence of such peaks in H. annuus and only limited evidence of such peaks in H. petiolaris. This pattern is exactly that predicted under a scenario of element derepression and proliferation in the diploid hybrid taxa following or associated with their origins. The specific sets of sequences that give rise to these secondary features appear to differ among the three hybrid taxa (data not shown), providing further evidence of independent proliferation events in the three hybrid species. Evidence of peaks associated with lower divergence values in H. petiolaris was weak and observed only under a very narrow range of binwidths in analyses with the program siZer . Nonetheless, we cannot rule out recent activity of a lesser scale in this parental species.
Ty3/gypsy-like proliferation events in the hybrid species' genomes offer a unique opportunity to explore the temporal origins of these species given that proliferation events occurring in the hybrid taxa place a lower bound on their birth. Using methods for dating the ages of paralogous sequences within genomes [50, 51, 57], these proliferation events in the hybrid species are estimated to have occurred between approximately 0.5 to 1 million years ago. These estimates suggest an earlier origin for the hybrid taxa than has been previously suggested based on microsatellite divergence data [58–60], but are largely consistent with more recent revised estimates based on EST sequence divergence data (L. Rieseberg, University of British Columbia, personal communication).
Retrotransposon proliferation and species evolution
An outstanding question remains how, if at all, transposable element proliferation may have contributed to evolutionary events that took place in this group of sunflowers. Might this represent an example of new species arising via hybridization and concomitant 'reassortments of repetitious DNAs', as envisioned originally by McClintock ? Earlier views by other prominent researchers suggested that transposable elements likely represent 'selfish' or 'parasitic' DNA sequences only, and that a major role in evolutionary processes of the host need not be invoked in order to explain their existence [20, 21]. More recently, however, several researchers have argued that transposable elements may indeed play a more prominent role in generating evolutionary novelty with potential effects on adaptive evolutionary change [22, 23, 27, 28, 30, 62–65]. The vast amounts of sequence data now available for several species appear to be supportive of this more recent view, especially with regard to the LTR retrotransposons. For example, genomic sequence data for several model species including Mus musculus, Caenorhabditis elegans, and Drosophila melanogaster indicate that LTR retrotransposons sequences (and fragments thereof) show a higher than expected prevalence of association with certain categories of genes, and that novel gene configurations can arise via new exons or spliced additions to existing exons [24–26]. In addition to generating evolutionary novelty via new (or modified) protein sequences, recent studies also suggest that LTR retrotransposons as well as other types of transposable elements may influence regulatory aspects of host genes [31–33].
Notwithstanding any contribution to evolutionary novelty in this group, it is also intriguing to ponder how these sunflower genomes simply have accommodated massive genomic expansions given the highly mutagenic nature of such large-scale proliferation events . Interestingly, in phylogenetic analyses with other plant Ty3/gypsy-like retrotransposons, Surge elements group within the 'class B' elements (as described by Martin and Llorens . Elements in this group possess an additional domain (a chromodomain) in their interior coding region that is involved in site-directed integration of elements into heterochromatin . The spatial scale of proliferation in the hybrid species conforms with expectations of site-directed insertion, as fluorescence in situ hybridization (FISH) experiments reveal that the vast majority of retrotransposon proliferation in the sunflower hybrid taxa has occurred in pericentromeric regions of chromosomes . The amenable nature of this group of sunflowers to experimental study and the excellent genomic resources now emerging for Helianthus should greatly facilitate future work on the likely causes and evolutionary consequences of retrotransposon proliferation in these species.
Seeds of all species under investigation were obtained from the United States Department of Agriculture (USDA) National Plant Germplasm System (Table 1). Seeds were germinated in the dark on moist filter paper in Petri dishes and germinated seedlings were then transferred to 10 cm pots and grown in the Kansas State University greenhouses until suitable size for harvesting of plant tissue. DNA was extracted using a DNeasy Plant Mini Kit (Qiagen, Valencia, CA, USA) following the manufacturer's instructions.
Degenerate primers (forward, 5'-GGACCTGCTGGACAAGGGNTWYATHMG-3' and reverse, 5'-CAGGAAGCCCACCTCCCKNWRCCARAA-3') were developed and used to amplify a 520 bp fragment of the Ty3/gypsy-like rt domain-encoding region from a single plant of each species. Degenerate primers were developed with the web-based program CODEHOP  and were based on aligned Ty3/gypsy-like reverse transcriptase (rt) amino acid sequences from sunflower  and the following additional elements: Dea1, Ananas; Del1, Lilium; and Legolas, Arabidopsis. PCR was performed on an MJ Research (Watertown, MA, USA) PTC-100 thermal cycler under the following conditions: 3 min at 94°C, followed by 31 cycles of 30 s at 94°C, 30 s at 45°C, and 60 s at 72°C, and a final extension of 3 min at 72°C. Individual PCR reactions each contained 5 ng DNA, 50 pmol of each primer, 1 unit of Taq polymerase, and a final concentration of 30 mM tricine, 50 mM KCl, 2 mM MgCl2, and 100 μM of each dinucleotide triphosphate (dNTP). For each sunflower species, five individual 25 μl PCR reactions were performed and pooled for further processing in order to reduce potential effects of PCR drift . Pooled products were gel purified using a QIAquick Gel Extraction Kit (Qiagen, Valencia, CA, USA) and cloned using the pGEM-T Vector System I (Promega, Madison, WI, USA). For each species, between 96 and 109 positive clones were sequenced using the M13 universal sequencing primer on an ABI 3730xl Genetic Analyzer. In a small number of instances two or more sequences obtained from the same species demonstrated 100% identity. Because of the inability to distinguish between a single amplicon variant having been cloned more than two times versus independent amplification of identical elements with different chromosomal insertion points, only a single representative of identical sequences was retained so as not to bias interpretation in subsequent analyses (see Results). Thus, for each species, between 92 and 108 unique sequences were retained for further analysis (Table 1). Sequence alignments were conducted with ClustalW  with subsequent manual adjustments. Phylogenetic analyses were conducted in PAUP* v.4.0b10  using the Kimura two-parameter model of sequence evolution. Sequences used in this study have been deposited in GenBank under accession numbers GQ366796–GQ367295.
Evidence of recent retrotransposon proliferation events in the hybrid species was evaluated by examining distributions of pairwise divergence values for sequences derived from a candidate proliferative source lineage as suggested by our phylogenetic analyses. Peaks in the frequency distribution are interpreted as episodic events of transposition, with peaks associated with lower values of divergence representing more recent proliferation events. Pairwise divergence values among sequences were determined using MEGA version 4  under the Kimura two-parameter model of sequence evolution. In order to identify significant features (peaks) in these distributions, we utilized the program siZer . This program evaluates data over a wide range of binwidths and determines statistical support for features (peaks) in a distribution based upon regions of significant slope increase and decrease.
Bennett MD, Bhandol P, Leitch IJ: Nuclear DNA amounts in angiosperms and their modern uses – 807 new estimates. Annals of Botany. 2000, 86: 859-909. 10.1006/anbo.2000.1253.
Bennett MD, Leitch IJ: Nuclear-DNA Amounts in Angiosperms. Annals of Botany. 1995, 76: 113-176. 10.1006/anbo.1995.1085.
Kumar A, Bennetzen JL: Plant retrotransposons. Annu Rev Genet. 1999, 33: 479-532. 10.1146/annurev.genet.33.1.479.
Feschotte C, Jiang N, Wessler SR: Plant transposable elements: where genetics meets genomics. Nat Rev Genet. 2002, 3: 329-341. 10.1038/nrg793.
Voytas DF, Cummings MP, Koniczny A, Ausubel FM, Rodermel SR: copia-like retrotransposons are ubiquitous among plants. Proc Natl Acad Sci USA. 1992, 89: 7124-7128. 10.1073/pnas.89.15.7124.
Suoniemi A, Tanskanen J, Schulman AH: Gypsy-like retrotransposons are widespread in the plant kingdom. Plant J. 1998, 13: 699-705. 10.1046/j.1365-313X.1998.00071.x.
Flavell AJ, Smith DB, Kumar A: Extreme heterogeneity of Ty1-copia group retrotransposons in plants. Mol Gen Genet. 1992, 231: 233-242.
Friesen N, Brandes A, Heslop-Harrison JS: Diversity, origin, and distribution of retrotransposons (gypsy and copia) in conifers. Mol Biol Evol. 2001, 18: 1176-1188.
Pearce SR, Harrison G, Li D, Heslop-Harrison J, Kumar A, Flavell AJ: The Ty1-copia group retrotransposons in Vicia species: copy number, sequence heterogeneity and chromosomal localisation. Mol Gen Genet. 1996, 250: 305-315.
Konieczny A, Voytas DF, Cummings MP, Ausubel FM: A superfamily of Arabidopsis thaliana retrotransposons. Genetics. 1991, 127: 801-809.
Zhang X, Wessler SR: Genome-wide comparative analysis of the transposable elements in the related species Arabidopsis thaliana and Brassica oleracea. Proc Natl Acad Sci USA. 2004, 101: 5589-5594. 10.1073/pnas.0401243101.
Marin I, Llorens C: Ty3/Gypsy retrotransposons: Description of new Arabidopsis thaliana elements and evolutionary perspectives derived from comparative genomic data. Molecular Biology and Evolution. 2000, 17: 1040-1049.
SanMiguel P, Tikhonov A, Jin YK, Motchoulskaia N, Zakharov D, Melake-Berhan A, Springer PS, Edwards KJ, Lee M, Avramova Z, Bennetzen JL: Nested retrotransposons in the intergenic regions of the maize genome. Science. 1996, 274: 765-768. 10.1126/science.274.5288.765.
SanMiguel P, Gaut BS, Tikhonov A, Nakajima Y, Bennetzen JL: The paleontology of intergene retrotransposons of maize. Nat Genet. 1998, 20: 43-45. 10.1038/1695.
Ungerer MC, Strakosh SC, Zhen Y: Genome expansion in three hybrid sunflower species is associated with retrotransposon proliferation. Curr Biol. 2006, 16: R872-873. 10.1016/j.cub.2006.09.020.
Hawkins JS, Kim H, Nason JD, Wing RA, Wendel JF: Differential lineage-specific amplification of transposable elements is responsible for genome size variation in Gossypium. Genome Res. 2006, 16: 1252-1261. 10.1101/gr.5282906.
Heslop-Harrison JS, Brandes A, Taketa S, Schmidt T, Vershinin AV, Alkhimova EG, Kamm A, Doudrick RL, Schwarzacher T, Katsiotis A, et al: The chromosomal distributions of Ty1-copia group retrotransposable elements in higher plants and their implications for genome evolution. Genetica. 1997, 100: 197-204. 10.1023/A:1018337831039.
Presting GG, Malysheva L, Fuchs J, Schubert I: A Ty3/gypsy retrotransposon-like sequence localizes to the centromeric regions of cereal chromosomes. Plant J. 1998, 16: 721-728. 10.1046/j.1365-313x.1998.00341.x.
Santini S, Cavallini A, Natali L, Minelli S, Maggini F, Cionini PG: Ty1/copia- and Ty3/gypsy-like DNA sequences in Helianthus species. Chromosoma. 2002, 111: 192-200. 10.1007/s00412-002-0196-2.
Orgel LE, Crick FH: Selfish DNA: the ultimate parasite. Nature. 1980, 284: 604-607. 10.1038/284604a0.
Doolittle WF, Sapienza C: Selfish genes, the phenotype paradigm and genome evolution. Nature. 1980, 284: 601-603. 10.1038/284601a0.
Kidwell MG, Lisch DR: Transposable elements and host genome evolution. Trends in Ecology and Evolution. 2000, 15: 95-99. 10.1016/S0169-5347(99)01817-0.
Kidwell MG, Lisch D: Transposable elements as sources of variation in animals and plants. Proc Natl Acad Sci USA. 1997, 94: 7704-7711. 10.1073/pnas.94.15.7704.
Ganko EW, Greene CS, Lewis JA, Bhattacharjee V, McDonald JF: LTR retrotransposon-gene associations in Drosophila melanogaster. J Mol Evol. 2006, 62: 111-120. 10.1007/s00239-004-0312-4.
DeBarry JD, Ganko EW, McCarthy EM, McDonald JF: The contribution of LTR retrotransposon sequences to gene evolution in Mus musculus. Mol Biol Evol. 2006, 23: 479-481. 10.1093/molbev/msj076.
Ganko EW, Bhattacharjee V, Schliekelman P, McDonald JF: Evidence for the contribution of LTR retrotransposons to C. elegans gene evolution. Mol Biol Evol. 2003, 20: 1925-1931. 10.1093/molbev/msg200.
Miller WJ, McDonald JF, Nouaud D, Anxolabehere D: Molecular domestication – more than a sporadic episode in evolution. Genetica. 1999, 107: 197-207. 10.1023/A:1004070603792.
Miller WJ, McDonald JF, Pinsker W: Molecular domestication of mobile elements. Genetica. 1997, 100: 261-270. 10.1023/A:1018306317836.
McDonald JF, Matyunina LV, Wilson S, Jordan IK, Bowen NJ, Miller WJ: LTR retrotransposons and the evolution of eukaryotic enhancers. Genetica. 1997, 100: 3-13. 10.1023/A:1018392117410.
McDonald JF: Transposable elements: possible catalysts of organismic evolution. Trends Ecol Evol. 1995, 10: 123-126. 10.1016/S0169-5347(00)89012-6.
Lagemaat van de LN, Landry JR, Mager DL, Medstrand P: Transposable elements in mammals promote regulatory variation and diversification of genes with specialized functions. Trends Genet. 2003, 19: 530-536. 10.1016/j.tig.2003.08.004.
Jordan IK, Rogozin IB, Glazko GV, Koonin EV: Origin of a substantial fraction of human regulatory sequences from transposable elements. Trends Genet. 2003, 19: 68-72. 10.1016/S0168-9525(02)00006-9.
Feschotte C: Opinion – Transposable elements and the evolution of regulatory networks. Nature Reviews Genetics. 2008, 9: 397-405. 10.1038/nrg2337.
Waugh O'Neill RJ, O'Neill MJ, Marshall Graves JA: Undermethylation associated with retroelement activation and chromosomal remodelling in an interspecific mammalian hybrid. Nature. 1998, 393: 68-72. 10.1038/29985.
Labrador M, Farre M, Utzet F, Fontdevila A: Interspecific hybridization increases transposition rates of Osvaldo. Molecular Biology and Evolution. 1999, 16: 931-937.
Liu B, Wendel JF: Retrotransposon activation followed by rapid repression in introgressed rice plants. Genome. 2000, 43: 874-880. 10.1139/gen-43-5-874.
Kashkush K, Feldman M, Levy AA: Transcriptional activation of retrotransposons alters the expression of adjacent genes in wheat. Nat Genet. 2003, 33: 102-106. 10.1038/ng1063.
Fontdevila A: Genetic instability and rapid speciation: are they coupled?. Transposable elements and evolution. Edited by: McDonald JF. 1993, Dordrecht: Kluwer Academic Publishers, 242-253.
Kentner EK, Arnold ML, Wessler SR: Characterization of high-copy-number retrotransposons from the large genomes of the louisiana iris species and their use as molecular markers. Genetics. 2003, 164: 685-697.
Robinson TJ, Wittekindt O, Pasantes JJ, Modi WS, Schempp W, Morris-Rosendahl DJ: Stable methylation patterns in interspecific antelope hybrids and the characterization and localization of a satellite fraction in the Alcelaphini and Hippotragini. Chromosome Research. 2000, 8: 635-643. 10.1023/A:1009294226213.
Wessler SR: Turned on by stress. Plant retrotransposons. Curr Biol. 1996, 6: 959-961. 10.1016/S0960-9822(02)00638-3.
Grandbastien MA: Activation of plant retrotransposons under stress conditions. Trends in Plant Science. 1998, 3: 181-187. 10.1016/S1360-1385(98)01232-1.
Rieseberg LH: Homoploid Reticulate Evolution in Helianthus (Asteraceae) – Evidence from Ribosomal Genes. American Journal of Botany. 1991, 78: 1218-1237. 10.2307/2444926.
Rieseberg LH: Hybrid origins of plant species. Annual Review of Ecology and Systematics. 1997, 28: 359-389. 10.1146/annurev.ecolsys.28.1.359.
Rieseberg LH, Raymond O, Rosenthal DM, Lai Z, Livingstone K, Nakazato T, Durphy JL, Schwarzbach AE, Donovan LA, Lexer C: Major ecological transitions in wild sunflowers facilitated by hybridization. Science. 2003, 301: 1211-1216. 10.1126/science.1086949.
Noor MA, Chang AS: Evolutionary genetics: jumping into a new species. Curr Biol. 2006, 16: R890-892. 10.1016/j.cub.2006.09.022.
Baack EJ, Whitney KD, Rieseberg LH: Hybridization and genome size evolution: timing and magnitude of nuclear DNA content increases in Helianthus homoploid hybrid species. New Phytologist. 2005, 167: 321-633. 10.1111/j.1469-8137.2005.01433.x.
Lai Z, Nakazato T, Salmaso M, Burke JM, Tang S, Knapp SJ, Rieseberg LH: Extensive chromosomal repatterning and the evolution of sterility barriers in hybrid sunflower species. Genetics. 2005, 171: 291-303. 10.1534/genetics.105.042242.
Wicker T, Sabot F, Hua-Van A, Bennetzen JL, Capy P, Chalhoub B, Flavell A, Leroy P, Morgante M, Panaud O, et al: A unified classification system for eukaryotic transposable elements. Nature Reviews Genetics. 2007, 8: 973-982. 10.1038/nrg2165.
Piegu B, Guyot R, Picault N, Roulin A, Saniyal A, Kim H, Collura K, Brar DS, Jackson S, Wing RA, Panaud O: Doubling genome size without polyploidization: Dynamics of retrotransposition-driven genomic expansions in Oryza australiensis, a wild relative of rice. Genome Research. 2006, 16: 1262-1269. 10.1101/gr.5290206.
Hawkins JS, Hu GJ, Rapp RA, Grafenberg JL, Wendel JF: Phylogenetic determination of the pace of transposable element proliferation in plants: copia and LINE-like elements in Gossypium. Genome. 2008, 51: 11-18. 10.1139/G07-099.
Chaudhuri P, Marron JS: SiZer for exploration of structures in curves. Journal of the American Statistical Association. 1999, 94: 807-823. 10.2307/2669996.
Ma J, Bennetzen JL: Rapid recent growth and divergence of rice nuclear genomes. Proc Natl Acad Sci USA. 2004, 101: 12404-12410. 10.1073/pnas.0403715101.
Lockton S, Ross-Ibarra J, Gaut BS: Demography and weak selection drive patterns of transposable element diversity in natural populations of Arabidopsis lyrata. Proc Natl Acad Sci USA. 2008, 105: 13965-13970. 10.1073/pnas.0804671105.
Lynch M, Conery JS: The origins of genome complexity. Science. 2003, 302: 1401-1404. 10.1126/science.1089370.
Lynch M: The Origins of Genome Architecture. 2007, Sunderland, Massachusetts: Sinaur Associates, Inc
Lynch M, Conery JS: The evolutionary demography of duplicate genes. Journal of Structural and Functional Genomics. 2003, 3: 35-44. 10.1023/A:1022696612931.
Gross BL, Schwarzbach AE, Rieseberg LH: Origin(s) of the diploid hybrid species Helianthus deserticola (Asteraceae). American Journal of Botany. 2003, 90: 1708-1719. 10.3732/ajb.90.12.1708.
Welch ME, Rieseberg LH: Patterns of genetic variation suggest a single, ancient origin for the diploid hybrid species Helianthus paradoxus. Evolution Int J Org Evolution. 2002, 56: 2126-2137.
Schwarzbach AE, Rieseberg LH: Likely multiple origins of a diploid hybrid sunflower species. Mol Ecol. 2002, 11: 1703-1715. 10.1046/j.1365-294X.2002.01557.x.
McClintock B: The significance of responses of the genome to challenge. Science. 1984, 226: 792-801. 10.1126/science.15739260.
McDonald JF: Evolution and consequences of transposable elements. Curr Opin Genet Dev. 1993, 3: 855-864. 10.1016/0959-437X(93)90005-A.
Lin R, Ding L, Casola C, Ripoll DR, Feschotte C, Wang H: Transposase-derived transcription factors regulate light signaling in Arabidopsis. Science. 2007, 318: 1302-1305. 10.1126/science.1146281.
Xiao H, Jiang N, Schaffner E, Stockinger EJ, Knaap van der E: A retrotransposon-mediated gene duplication underlies morphological variation of tomato fruit. Science. 2008, 319: 1527-1530. 10.1126/science.1153040.
Biemont C, Vieira C: Genetics: junk DNA as an evolutionary force. Nature. 2006, 443: 521-524. 10.1038/443521a.
Gao X, Hou Y, Ebina H, Levin HL, Voytas DF: Chromodomains direct integration of retrotransposons to heterochromatin. Genome Research. 2008, 18: 359-369. 10.1101/gr.7146408.
Stanton E, Ungerer M, Moore R: The genomic organization of Ty3/gypsy-like retrotransposons in Helianthus (Asteraceae) homoploid hybrid species. American Journal Of Botany. 2009
Rose TM, Schultz ER, Henikoff JG, Pietrokovski S, McCallum CM, Henikoff S: Consensus-degenerate hybrid oligonucleotide primers for amplification of distantly related sequences. Nucleic Acids Res. 1998, 26: 1628-1635. 10.1093/nar/26.7.1628.
Santini S, Cavallini A, Natali L, Minelli S, Maggini F, Cionini PG: Ty1/copia- and Ty3/gypsy-like DNA sequences in Helianthus species. Chromosoma. 2002, 111: 192-200. 10.1007/s00412-002-0196-2.
Wagner A, Blackstone N, Cartwright P, Dick M, Misof B, Snow P, Wagner GP, Bartels J, Murtha M, Pendleton J: Surveys of Gene Families Using Polymerase Chain-Reaction – Pcr Selection and Pcr Drift. Systematic Biology. 1994, 43: 250-261.
Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680. 10.1093/nar/22.22.4673.
Swofford DL: PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods). Version 4. 2002, Sinauer Associates, Sunderland, Massachusetts
Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24: 1596-1599. 10.1093/molbev/msm092.
Gross BL, Rieseberg LH: The ecological genetics of homoploid hybrid speciation. J Hered. 2005, 96: 241-252. 10.1093/jhered/esi026.
Saitou N, Nei M: The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987, 4: 406-425.
Xiong Y, Eickbush TH: Origin and evolution of retroelements based upon their reverse transcriptase sequences. Embo J. 1990, 9: 3353-3362.
We thank Carolyn Ferguson, Takeshi Kawakami, Ying Zhen, Susan Rolfsmeier, Ziyi Wang, Madhav Nepal, and Loren Rieseberg for comments on an earlier draft. This work was supported by Kansas State University and National Science Foundation grant DEB-0742993 to MCU.
MCU conceived and designed the study. SCS and KMS generated the data. SCS and MCU analyzed the data. MCU wrote the manuscript.