- Research article
- Open Access
Systematics and plastid genome evolution of the cryptically photosynthetic parasitic plant genus Cuscuta(Convolvulaceae)
BMC Biologyvolume 5, Article number: 55 (2007)
The genus Cuscuta L. (Convolvulaceae), commonly known as dodders, are epiphytic vines that invade the stems of their host with haustorial feeding structures at the points of contact. Although they lack expanded leaves, some species are noticeably chlorophyllous, especially as seedlings and in maturing fruits. Some species are reported as crop pests of worldwide distribution, whereas others are extremely rare and have local distributions and apparent niche specificity. A strong phylogenetic framework for this large genus is essential to understand the interesting ecological, morphological and molecular phenomena that occur within these parasites in an evolutionary context.
Here we present a well-supported phylogeny of Cuscuta using sequences of the nuclear ribosomal internal transcribed spacer and plastid rps2, rbcL and matK from representatives across most of the taxonomic diversity of the genus. We use the phylogeny to interpret morphological and plastid genome evolution within the genus. At least three currently recognized taxonomic sections are not monophyletic and subgenus Cuscuta is unequivocally paraphyletic. Plastid genes are extremely variable with regards to evolutionary constraint, with rbcL exhibiting even higher levels of purifying selection in Cuscuta than photosynthetic relatives. Nuclear genome size is highly variable within Cuscuta, particularly within subgenus Grammica, and in some cases may indicate the existence of cryptic species in this large clade of morphologically similar species.
Some morphological characters traditionally used to define major taxonomic splits within Cuscuta are homoplastic and are of limited use in defining true evolutionary groups. Chloroplast genome evolution seems to have evolved in a punctuated fashion, with episodes of loss involving suites of genes or tRNAs followed by stabilization of gene content in major clades. Nearly all species of Cuscuta retain some photosynthetic ability, most likely for nutrient apportionment to their seeds, while complete loss of photosynthesis and possible loss of the entire chloroplast genome is limited to a single small clade of outcrossing species found primarily in western South America.
Between 150 and 200 species of Cuscuta have been described, and they are distributed widely on every continent except Antarctica . These parasites have no roots at maturity and their leaves are reduced to minute scales. As such, few morphological characters exist to distinguish and classify species outside of the flower and fruit. Style and stigma morphology, capsule dehiscence and corolla and calyx shape and size form the basis of existing monographical studies [1–3]. Engelmann  separated Cuscuta into three subgenera on the basis of style fusion and stigma shape. Members of subgenus Monogyna have the two styles fused for most or all of their length, and consist of thick-stemmed species that commonly parasitize trees and shrubs; subgenera Cuscuta and Grammica have free styles, with stigmas being globose in subgenus Grammica and elongate in subgenus Cuscuta (Figure 1). The last full monograph of the genus completed by Yuncker  recognized nine species in Monogyna, distributed primarily in Eurasia and Africa with one species, Cuscuta exaltata Engelmann, having a disjunctive distribution in the southern United States in the scrub habitat of Florida and Texas. The 28 species in subgenus Cuscuta recognized by Yuncker have native ranges restricted to, but widely distributed in, the Old World. Subgenus Grammica, with 121 species recognized by Yuncker, is almost completely limited to the New World, with a handful of exceptions in Asia, Africa and the Pacific islands, including Tasmania and Australia.
Engelmann  further divided each of the subgenera into sections based on stigma morphology and capsule dehiscence. Monogyna consists of two sections; the first, Callianche, contains only Cuscuta reflexa Roxburgh, defined by its elongated stigmas atop the fused styles. All other members of subgenus Monogyna are relegated to section Monogynella, which have shorter, stouter stigmas. All members of subgenus Monogyna possess a circumscissile capsule as the fruit. Subgenus Cuscuta is subdivided into four sections. Section Cleistococca has only one species, Cuscuta capitata Roxburgh, which is distinguished from all other members of subgenus Cuscuta by having an indehiscent capsule as its fruit. Fruits of sections Pachystigma and Epistigma are only irregularly circumscissile, and fruits of section Eucuscuta are always cleanly dehiscent. Section Pachystigma is distinguished from section Epistigma by the presence of long, slender styles topped by wider stigmas, whereas members of section Epistigma possess only short to undetectable styles topped by the elongated stigmas. The six species of Pachystigma are restricted to Southern Africa, while the four species of Epistigma and Cuscuta capitata are restricted to central Asia. Section Eucuscuta has a wider distribution, with the largest number of species found close to the Mediterranean Sea. Subgenus Grammica is divided into two sections based on capsule dehiscence, with section Eugrammica possessing complete to partially dehiscent capsules and section Cleistogrammica producing indehiscent capsules. Species of subgenus Grammica are relatively evenly divided between the two subgenera, with 53 species in section Cleistogrammica and 68 species in Eugrammica .
Cuscuta is a readily recognizable genus, with the only species in the completely unrelated but strikingly similar parasitic vine genus Cassytha L. (Lauraceae) ever likely to cause any confusion  ; however, small flowers and a paucity of usable morphological characteristics often make the identification of Cuscuta to the species level a challenge. Although no comprehensive taxonomic study of the entire genus has been completed since Yuncker's monograph, Cuscuta remains one of the most widely studied parasitic plant lineages, with numerous publications on its anatomy [5–7], nutritional physiology  , plastid evolution [9–21] and even foraging behavior [22–25]. Phylogenies of Convolvulaceae with a small sampling of Cuscuta species showed it is confidently nested within that family . Although its exact placement could not be strongly inferred with more in-depth analysis  , the most confident placement was sister to a the 'Convolvuloideae' clade . Taxa from subgenus Monogyna appeared basal to subgenus Cuscuta and subgenus Grammica in those studies. Another study showed multiple members of subgenus Cuscuta to be nested within multiple clades of subgenus Grammica  , although those data are likely a result of misidentification of taxa and are discussed more extensively in our results.
Conflicting evidence exists as to the photosynthetic ability across the genus. Machado and Zetsche  demonstrated low levels of photosynthetic carbon assimilation in the noticeably chlorophyllous stems of Cuscuta reflexa (subgenus Monogyna) despite apparent loss of all ndh genes  , but found no detectable levels of RuBisCo expression in C. europaea, despite the presence of the gene encoding its large subunit (rbcL) in the plastid genome. Studies further showed that C. reflexa only produces chlorophyll in a specific layer of cells isolated from atmospheric gas exchange, suggesting it only photosynthesizes by recycling carbon dioxide released from respiratory byproducts of carbohydrates from its host source . C. pentagona Engelmann of subgenus Grammica was shown to possess a normal photosynthetic ratio of chlorophyll a to b, contain properly localized photosynthetic proteins and display low levels of carbon assimilation . However, other members of subgenus Grammica seem to possess highly altered plastid genomes; C. gronovii Willdenow and C. subinclusa Durand et Hilgard seemingly lack plastid-encoded polymerase (rpo) genes  , although low levels of transcription of rbcL still take place from nuclear-encoded polymerase promoter sites  , and these species, along with C. campestris Yuncker and C. reflexa still possess normal chlorophyll a and b ratios . In contrast, C. odorata Ruiz et Pavon and C. grandiflora Humbolt, Bonpland et Kunth are achlorophyllous, lack thylakoids and do not produce detectable levels of rbcL transcript or protein . The additional loss of some non-coding data from the plastid genome along with a few minor changes to intact reading frames within Cuscuta and Convolvulaceae have been reported and roughly mapped on a phylogeny of Cuscuta based on a minimal sampling of taxa .
In this study, we examine the phylogeny of the genus Cuscuta by sampling 35 species from all sections of the genus defined by Englemann  with the exceptions of section Epistigma and the monospecific section Cleistococca. Our sampling also includes species from 19 of 29 subsectional groups recognized by Yuncker . We obtain DNA sequences for phylogenetic analysis from two plastid loci (rbcL and rps2) and the nuclear internal transcribed spacer (ITS) region between the 18S and 5.8S ribosomal RNA loci from largely overlapping subsets of taxa to investigate phylogenetic relationships within the genus and test the monophyly of the previously defined subgeneric and subsectional delimitations. We determine genome sizes for species available as fresh tissue in order to address questions of species delimitation and to test whether genome size correlates with published chromosome numbers, which are highly variable . In addition to the plastid loci mentioned above, which correspond to the RuBisCo large subunit and a small ribosomal protein subunit respectively, we sample two more plastid loci representing two other functionally distinct genes (atpE, ATP synthase subunit; rpoA, plastid-encoded polymerase subunit) from smaller subsets of taxa in order to test whether all classes of plastid genes are evolving equally in Cuscuta relative to photosynthetic taxa. Using further polymerase chain reaction (PCR) assays, we test the distribution of major changes to the plastid genome within the genus and combine them with previously published evidence to gain a comprehensive view of photosynthetic evolution within Cuscuta. Finally, we use evidence from the biology and natural history of these parasites to suggest potential hypotheses as to why photosynthesis is retained in most members of the genus despite what superficially appears to be minimal opportunity for gain of photosynthetic carbohydrate.
Figure 2 shows individual parsimony bootstrap consensus cladograms for ITS, rps2 and rbcL and the four-gene combined dataset including matK data. Maximum parsimony bootstrap values (MP) are shown above the nodes and Bayesian posterior probability estimates (PP) are shown below the nodes. The individual gene trees are almost identical in topology, with no well-supported incongruences. Many of the support values are high for individual genes and almost every node is very well supported in the combined analysis. Furthermore, maximum-likelihood analyses were performed on the individual gene datasets; these analyses also gave nearly congruent topologies that agreed at well-supported in-group nodes (Figure 3). Cuscuta was found to be sister to the 'Convolvuloideae' clade  for two of the genes (matK and ITS), and this placement was very well supported in the combined analysis (MP 92, PP 1.0). Within Cuscuta, subgenus Monogyna was monophyletic and sister to all other Cuscuta species (MP 100, PP 1.0), with C. exaltata sister to all other sampled Monogyna species. Section Monogynella was paraphyletic, with C. reflexa of the monotypic section Callianche nested within (MP BP 100, PP 1.0). Subgenus Cuscuta was strongly supported as paraphyletic (MP 98, PP 1.0), with Cuscuta nitida Meyer representing section Pachystigma falling sister to subgenus Grammica, a result also supported by loss of two transfer RNA genes and loss of introns from ycf3 and atpF (see Figure 4). The two sampled species in section Eucuscuta were monophyletic (MP 100, PP 1.0). Subgenus Grammica was clearly monophyletic (MP 100, PP 1.0), although many highly supported nodes reject the monophyly of sections Eugrammica and Cleistogrammica. The basal lineage of subgenus Grammica was not clearly resolved, with the consensus showing a clade including subsection Odoratae (C. chilensis Ker-Gawler) with subsection Acutilobae (C. foetida Humboldt, Bonpland et Kunth) and a clade with subsections Indecorae, Umbellatae and Leptanthae in a polytomy together with a clade containing the remainder of the sampled subsections of subgenus Grammica. Subsection Californicae and subsection Tinctoriae were not monophyletic in the combined four-gene tree, but the monophyly of all other subsections cannot be disputed by these data. Our data are congruent at well-supported nodes with a study that sampled many additional species of subgenus Grammica utilizing two short loci (including ITS) .
Nuclear genome size results
Genome size estimates were highly variable within Cuscuta and did not appear to be related to previously published chromosome numbers overall (Table 1). Species in subgenus Monogyna, which generally show intermediate chromosome numbers between the other two subgenera  , have extremely large nuclear genomes according to our results. Low numbers of plastid clones relative to nuclear clones in a genomic fosmid library used to generate the full plastid genome sequence of Cuscuta exaltata help confirm these data . Within subgenus Cuscuta section Eucuscuta, genome sizes of Cuscuta europaea L. and C. epilinum Weihe actually did appear to correlate with karyotypes and known ploidy levels  , with the apparent recent triploid C. epilinum having a genome size consistent with these data relative to C. europaea. Estimated nuclear genome sizes within subgenus Grammica are the most variable, with an estimate for Cuscuta pentagona (1.16 picograms/2C) being the smallest of all sampled species and C. indecora Choisy (65.54 pg/2C) being the largest. There does not appear to be a standard genome size within this subgenus, although closely related species in subsection Oxycarpae, subsection Cephalanthae and subsection Lepidanche all possess proportional nuclear genome size, with three size classes perhaps reflecting different ploidy levels. Interestingly, accessions of C. gronovii from different geographic localities showed quite striking differences in genome size, even within two collections made within the state of Pennsylvania. Smaller, secondary peaks were detected in many species, suggesting that these stem tips were growing so rapidly as to have many cells at different stages of mitosis with different overall DNA content depending on phase. Alternatively, the parasites could be undergoing endoreduplication, a process frequent in metabolically active cells of eukaryotes by which the genome of those cells is doubled within the nucleus .
Plastid genome variation assays
Major changes to the plastid genome reported in this and previous studies are mapped on the cladogram in Figure 4. PCR and sequencing of the region between petD and rps11 showed that taxa across subgenus Grammica contained only residual rpoA pseudogene sequence, although the length of the remaining intergenic region was surprisingly constant across those taxa (data not shown). This confirmed previous hybridization data that failed to detect rpo (plastid-encoded RNA polymerase) genes [18, 20] and showed loss of transcription from known plastid-encoded polymerase promoter sites . PCR data also detected an inversion in the large single-copy region of C. reflexa  and C. japonica  that is a synapomorphy in all sampled species of subgenus Monogyna, as is a constriction of the large single-copy boundary of the inverted repeat region into ycf2. A two-kilobase inversion in the large single-copy region of the plastid genome was found in both sampled members of subgenus Cuscuta subsection Eucuscuta. Long PCR covering many intergenic regions demonstrated that the substantial reduction of non-coding DNA is shared across subgenus Grammica, with all species in the subgenus seemingly converging on a minimal length (Figure 5). Sequences from Cuscuta lupuliformis, in subgenus Monogyna, show much less reduction in intergenic regions. Members of subgenus Cuscuta, which still possess a full set of seemingly functional rpo genes, show intermediate levels of intergenic sequence loss; this indicates that intergenic constriction does not completely result from a loss of plastid-encoded polymerase promoter regions.
Finally, we attempted to study plastid genes in C. chilensis. C. chilensis is an achlorophyllous relative of C. odorata, a species which appears to lack rbcL . Unlike the results from C. odorata, we were unable to amplify rrn16 from C. chilensis using many combinations of primers. Furthermore, hybridization of various ribosomal protein gene and rrn16 PCR products from other species within Cuscuta subgenus Grammica to a filter containing over 1,500 Cuscuta chilensis clones from a genomic fosmid library returned no positive hits. Positive control amplifications of Cuscuta chilensis mitochondrial genes and hybridization of mitochondrial probes to the Cuscuta chilensis library showed that organellar DNA was present in our DNA extraction and library.
Tests of selective constraint
With such variability in gene content across Cuscuta, it was important to determine whether remaining genes are still under selective constraint and how these patterns of constraint vary across genes, across the taxonomic range of Cuscuta and between Cuscuta and its photosynthetic relatives. Unconstrained maximum-likelihood trees are shown in Figure 6. Trees with all branches constrained to the same non-synonymous to synonymous rate ratio were significantly worse than fully unconstrained trees for atpE, rbcL and rps2 (Table 2), indicating lineage-specific heterogeneity in selective constraint for these genes. No significant difference was observed between the likelihoods of rpoA trees when trees with all branches constrained to an identical non-synonymous to synonymous rate ratio were compared with unconstrained trees. Of the four hypotheses tested for atpE (significant constraint differences between outgroups and all Convolvulaceae including Cuscuta, differences between Cuscuta and non-parasites, differences between subgenera Cuscuta+Grammica and all other taxa, and differences between subgenus Grammica and all other taxa), constraining an independent non-synonymous to synonymous rate ratio for all Cuscuta from the rest of the tree most improved the likelihood scores, with the resulting likelihood no longer being significantly different from the fully unconstrained tree. For rbcL, all of the clades tested in the same manner remained significantly worse than the unconstrained tree, with the greatest improvement coming when subgenera Cuscuta and Grammica together were given a separate non-synonymous to synonymous rate ratio. In this case, as is apparent in the unconstrained tree, the non-synonymous to synonymous rate ratio actually decreases within Cuscuta, with all species under higher levels of purifying selection than the autotrophic outgroups. For rps2, yet a third pattern was observed. Of the hypotheses tested, a change in non-synonymous to synonymous rate ratio across Convolvulaceae improves the likelihood the most, again to the extent that it is no longer significantly different to the unconstrained tree, suggesting that a relaxation of constraint may have occurred in this gene before the evolution of parasitism. A similar result was found in the independently derived parasitic plant family Orobanchaceae, where significant rate increases in rps2 are seen even in very photosynthetic lineages before evolution of holoparasitism . For rpoA, there was no significant difference between the fully constrained and fully unconstrained trees, and no appreciable changes occurred under any of the proposed hypothetical shifts in non-synonymous to synonymous rate ratio.
Morphological, biogeographical and taxonomic interpretation of phylogeny
Although subgenus Grammica is clearly monophyletic in our study, it has been suggested that it is paraphyletic, with members of subgenus Cuscuta nested in multiple clades within Grammica . That study also included data from plastid rbcL and nuclear ITS, allowing us to compare sequences for taxa shared with our study. As their phylogenies show strong conflict with ours and make no sense from a morphological standpoint, and because data reportedly gathered from the same species as vouchered specimens from our study clearly represent unrelated taxa, we conclude that multiple taxa were misidentified in . This likely also affected their conclusion that loss of photosynthetic genes is distributed randomly on the phylogeny, when a clear stepwise and more parsimonious loss of photosynthetic genes is evident from our results. Cuscuta species can be difficult to identify when in flower (see Figure 7) and nearly impossible to identify from vegetative material, which was the source of tissue used for DNA isolations .
Yuncker believed that the morphological features of subgenus Grammica were the ancestral states owing to the species-richness of that subgenus; subgenus Grammica is clearly in a highly derived position within the genus and cannot be considered a potentially ancestral group. However, once the tree is re-rooted to the proper node (Figure 8), subsectional relationships within sections largely agree with interpretation of phylogenetic relationships proposed by Yuncker. Artificial relationships found to be non-monophyletic mostly result from interpretation of two morphological characters: stigma morphology and capsule dehiscence. Elongated stigmas appear to be a derived state in C. reflexa, which is nested within a clade of species with much stouter stigmas. In contrast, the globose stigmas seen in subgenus Grammica are apparently derived from elongate stigmas, such as those seen in subgenus Cuscuta. Stigma morphology appears to be quite plastic within the genus and a full range of intermediates between subgenus Cuscuta and subgenus Grammica exist. Thus, it is not surprising that section Pachystigma (represented by C. nitida in our dataset), with intermediate stigma morphology, is actually sister to subgenus Grammica and should be included in that subgenus. In fact, a species within section Pachystigma, Cuscuta cucullata Yuncker, is so similar to the only member of subgenus Grammica found in South Africa, C. appendiculata Engelmann, that Yuncker points out that they may be confused with each other. Although we were unable to sample those two species for our phylogeny, their distribution in South Africa has biogeographical implications for the colonization of the New World by subgenus Grammica from a South African/South American dispersal event. Putatively basal clades of subgenus Grammica are either distributed almost completely in South America (subsection Acutilobae and subsection Odoratae) or contain lineages distributed widely from South to North America (subsection Indecorae and subsection Umbellatae). Interestingly, C. cucullata and C. appendiculata are unique among South African Cuscuta species in having indehiscent capsules, which facilitate floating and water-mediated dispersal of the seeds in many members of subgenus Grammica section Cleistogrammica. Subgenus Grammica has successfully spread across both North and South America since colonizing the New World and now contains many more species than the other two subgenera combined. Whether the ancestor of C. exaltata (subgenus Monogyna) may have taken a similar route to colonize the New World is unknown, although it too shares a morphologically similar relative in South Africa (C. cassytoides Nees von Esenbeck).
While capsule dehiscence was one of the main characters used for monographical work in Cuscuta [1, 2] , our phylogenetic analyses agree with another study  that it is a transient character in the genus with very little systematic value and that the sectional entities of Eugrammica and Cleistogrammica should no longer be recognized. Many species of Cuscuta subgenus Grammica possess irregularly dehiscent capsules that are not easily classified as either indehiscent or circumscissile. Two interesting cases of indehiscent-capsuled species being allied to clades with circumscissile capsules are C. tasmanica Engelmann and C. sandwichiana Choisy. These derived members of subgenus Grammica have independently colonized islands far from the home of their Mexican sister taxa and both are found in coastal habitats. Indehiscent capsules may have also aided their aquatic dispersal events. Other taxa from subgenus Grammica found in the Pacific Rim (e.g. C. australis R. Brown) likely took a similar dispersal route via indehiscent capsules  , although we do not have data for those taxa in our phylogeny. Two other Old World species from subgenus Grammica, Cuscuta chinensis in Asia and Cuscuta kilamanjari in Africa, have dehiscent capsules, and may or may not have dispersed to their present ranges via ancestral indehiscent capsules.
Genome sizes and speciation
Estimates of species number within Cuscuta vary greatly, largely because so few characters exist to distinguish them. The existence of forms with supernumary chromosomes  and such widely scattered estimates of chromosome numbers in the genus  suggest polyploid and aneuploid evolution may occur rather rapidly in this lineage. Species that appear very similar morphologically may occupy very dissimilar ecological niches and exhibit different host preferences. One such example involves C. pentagona, C. campestris, C. polygonorum Engelmann and other relatives in subsection Arveses and subsection Platycarpae. C. campestris is often merged taxonomically with C. pentagona, as the two are distinguished primarily by slight differences in overall flower size and angularity of the calyx. However, our estimates of genome size between accessions identified as either form differed in size by almost a factor of 10 (Table 1). Estimates for C. polygonorum and C. pentagona differ by almost 50%, although those species have also been merged in at least one taxonomic treatment . C. polygonorum can be identified by flowers that are often four-merous and that have a slightly different gynoecium shape than those in C. pentagona. However, the species can usually be distinguished simply by noticeable habitat and host preferences. In such cases, where forms seem to be ecologically distinct as well as morphologically distinguishable, we suggest species-level distinction is likely warranted given the disparate genome sizes. Seemingly different ploidy levels exist within Cuscuta gronovii. Morphological variation in corolla size and shape exist in this species as well (Figure 7), indicating that cryptic species with different chromosome numbers that are incapable of interbreeding may exist. Accelerated rates of nucleotide substitution in the nuclear genome may also promote rapid speciation in subgenus Grammica if acceleration in ribosomal loci such as ITS (Figure 3) and 18S  are correlated with protein-coding rates. As almost all species of Cuscuta readily produce selfed seed even in the absence of pollinators, and pollen is often deposited on the stigma before the corolla opens, drastic changes in the nuclear genome that prevent outcrossing may promote speciation.
Plastid genome evolution in Cuscuta
In contrast to previous descriptions of chloroplast genome evolution in Cuscuta as a 'slippery slope'  or as occurring in a random, uncoordinated manner across the phylogeny  , we find that plastid genome evolution in Cuscuta has occurred in a stepwise fashion, with punctuated modification at various evolutionary time-points followed by long periods of stasis within various clades. Major changes occurred in the ancestor of the genus, the ancestor of subgenus Grammica and within one fully non-photosynthetic clade of subgenus Grammica. Across most species of subgenus Grammica and, as such, the majority of all Cuscuta species, plastid genome content appears to have stabilized on a smaller, but constrained size (see, e.g., Figure 5). Different types of genes appear to be evolving under different levels of constraint. Most surprisingly, rbcL appears to be under much greater purifying selection in Cuscuta than in autotrophic relatives. This effect may largely be a result of much higher overall rates of substitution in Cuscuta for the plastid genome (see branch lengths in Figures 3 and 6), but a need for amino acid stasis in rbcL. This intense conservation of most photosynthetic genes is quite unexpected for a genus that lacks leaves and extensive chlorophyllous surface area. Hibberd et al.  suggest that recycling of internally respired carbon dioxide may be the answer. However, loss of ndh genes could possibly make these parasites extremely susceptible to photorespiration unless extremely high respiratory rates existed near these photosynthetic cells or some other mechanism similar to C4 photosynthesis existed . Furthermore, these plants have seemingly little need to produce carbohydrates, which are readily obtained from the host.
A second pathway involving rbcL in lipid biosynthesis in green seeds of Brassica  suggests a tantalizing explanation for retention of photosynthetic genes in Cuscuta. Chlorophyll is concentrated in the developing ovules of Cuscuta (Figure 1), almost exclusively so in healthy members of subgenera Grammica and Cuscuta. Seeds often have high lipid content as energy reserves for the seedling and to aid in desiccation tolerance and seed longevity, and Cuscuta has been shown to accumulate lipid bodies that fill the majority of the non-nuclear cytoplasm . Most Cuscuta species are annuals and must be prolific producers of highly energetic seeds to ensure at least some offspring will be able to germinate and survive long enough to search out and attach to a host. The seeds are impermeable to water until the epidermal layer is scarified and they can live unimbibed for decades and remain viable. As lipids are less available from vascular extracts from the host and because of the intense demand for lipid production during fruiting, this efficient lipid synthesis pathway is a more plausible explanation for conservation of a photosynthetic apparatus in Cuscuta than residual carbohydrate production. Photosynthetic genes may have additional functions in subgenus Monogyna, where chlorophyllous cells are also concentrated in a thin layer of internal stem tissue .
Loss of photosynthesis in Cuscuta
If photosynthesis is important for seed lipid production in most Cuscuta, then questions remain as to why a few species can survive without chlorophyll and rbcL  (C. chilensis; Figure 1). Reproductive biology of the lineages of Cuscuta that contain these species, subsections Odoratae and Grandiflorae (and possibly Acutilobae), may provide an important clue. Large corolla size (see Cuscuta chilensis; Figure 7) and strong fragrance characterize members of these subsections. In our experience with cultivating C. chilensis, it is incapable of producing selfed seed (from over 100 hand-pollinations), whereas most Cuscuta species readily produce massive quantities of selfed seed without assistance. Observations of various natural populations in Chile showed that pollinator visitation was frequent, with species of Lepidoptera, Hymenoptera and Diptera all moving between flowers with varying amounts of Cuscuta pollen on their bodies. However, seed set in these natural populations was extremely low, with only a small proportion of old flowers containing viable seed. Likewise, seeds are usually sparse or absent on herbarium specimens of species in sections Odoratae and Grandiflorae. An ability to survive on hosts year-round may explain why these species have less demand for a massive seed set and, thus, are able to survive the cost of low fecundity to reap the benefits of self-incompatibility. A decreased demand for massive lipid production during fruiting may have rendered the remaining photosynthetic genes in the ancestor of these Cuscuta species obsolete. Our results and observations suggest in-depth molecular and reproductive physiological study of the large-flowered South American clades of Cuscuta subgenus Grammica will provide further insight into the evolutionary loss of photosynthesis in this parasitic lineage.
By generating a well-supported phylogeny of the economically important parasitic plant genus Cuscuta, we have provided a framework through which to test whether traditional taxonomic divisions of the genus represent monophyletic evolutionary clades, to evaluate which morphological characters are systematically misleading, to formulate biogeographical hypotheses that best explain current distributions of major clades and to interpret molecular phenomenon such as nuclear genome size evolution and plastid genome evolution. Subgenus Cuscuta is paraphyletic with subgenus Grammica nested within it. Subgenus Grammica likely colonized the new world through a dispersal event from South Africa to South America and then radiated throughout both North and South America; subsequent long-distance dispersal events, many possibly aided by transition to floating indehiscent capsules, best explain the few scattered members of subgenus Grammica in Hawaii, Australia, Asia and Africa. Nuclear genome size is highly variable in the genus and may be useful in recognizing new cryptic species. A reduction in plastid genome size appears to have occurred in punctuated steps followed by periods of relative stasis. Although plastid nucleotide substitution rates are quite rapid, photosynthetic genes are very strongly conserved in the majority of Cuscuta species even after the loss of all plastid ndh and RNA polymerase genes. The plastid genome is likely retained primarily for lipid biosynthesis during seed production and is possibly lost completely in a single clade of outcrossing species whose life histories seem to accommodate a reduction in overall seed production.
The quality of available tissue for different Cuscuta species was variable, but a common method using a typical plant CTAB DNA isolation  with 1% polyethylene glycol (molecular weight 8,000) added to the buffer proved effective for live plants grown in the Pennsylvania State University Biology greenhouse, freshly collected wild plants, frozen tissue, silica-gel dried tissue and small samples from herbarium specimens. For some dried material received in silica gel, vouchers were unavailable and we instead identified the species by dissection of rehydrated flowers from the sample. Photographs taken through a dissecting scope of characters necessary for identification are available as vouchers for such species. For two species for which we received no voucher material or flowering and fruiting material for dissection, we verified proper identification of the sample with sequence comparison of vouchered data at loci always variable above the species level. Vouchered specimens were deposited in the Pennsylvania State University Herbarium (PAC). Vouchers, taxon information and GenBank accession numbers for all sequences are presented in Table 3.
PCR and sequencing
Previously designed primers ITS4 and ITS5 were used for amplification and sequencing of the nuclear ITS locus according to a published protocol . A few taxa exhibited sequence polymorphisms, particularly in a highly variable loop region  , which was not confidently alignable across all taxa and was excluded for analyses. This also often resulted in length polymorphisms that required Topo cloning (Invitrogen, Carlsbad, CA) for capillary sequencing. For all taxa with polymorphic ITS loci, we found no evidence of lineage sorting, as all alleles from a given species always formed a clear clade. We used consensus sequences from multiple clone reads to sort true nucleotide polymorphisms from Taq polymerase error in incorporated PCR fragments. True nucleotide polymorphisms were rare and were entered into the data matrix as the predominant locus in our sample. Only one sequence from each species with identified length polymorphisms was used. Plastid rps2 was amplified with primers rps2-661R and either rps2-18F or rps2-47F  or, for recalcitrant taxa, new primers designed from the more readily generated Cuscuta sequences and the available plastid genome sequences of C. exaltata and C. obtusiflora (data analysis in prep). A partial rbcL product was also amplified using published primer sequences  or new primers designed specifically for Cuscuta. For some taxa sampled from herbarium material, internal primer combinations were used to amplify and sequence the gene in parts when necessary. Amplification across atpE was performed using primers atpB-1277F  and trnF-F ; for members of section Eucuscuta, trnT(2)-R  was substituted for trnF-F on the basis of an inversion of those taxa verified by this PCR and a PCR from trnF-F to rps4-32F . rpoA or rpoA pseudogenes were amplified and sequenced with a combination of the newly designed primers petD-endF and rps11-C398F. PCR protocol for rps2, rbcL, atpE, and rpoA all followed the rps2 protocol described by dePamphilis et al. . Long PCR assays of intergenic sequences were conducted using the following primer combinations: psbD-40F  to trnfM-R ; trnC-F  to psbD-45R ; and rps4-32F to atpB-s1277F. PCR from psbA-984F to ndhB-13F  was used to confirm contraction of the inverted repeat in members of subgenus Monogyna. These longer PCR assays were performed using 1 × Taq Extender Buffer, 0.2 mM of each dNTP, 2.5 mM MgCl2, 3.0 μM of each primer, 0.5 units of Taq DNA Polymerase (Promega, Pittsburgh, PA), 0.5 units of Taq Extender (Stratagene, La Jolla, CA) and approximately 500 ng of template DNA in 50 μl total volume. Amplification was accomplished using a thermal-cycling scheme of an initial 94°C denaturation for 2 min, followed by 10 cycles of 94°C for 10 s, 55°C for 30 s and 68°C for 6 min. Sixteen additional cycles were performed under the parameters of 94°C for 20 s, 55°C for 30 s and 68°C for 6 min with an additional 20 s added to this extension time each cycle. A final, additional extension at 68°C for 7 min was also performed. In cases where multiple bands were produced, this process was repeated with the extra MgCl2 removed. All newly designed PCR primers are given in Table 4. All PCR products that were sequenced were cleaned using a Qiaquick PCR Purification Kit (Qiagen, Valencia, CA) or a combination of five units of Exonuclease I and five units of Shrimp Alkaline Phosphatase (USB, Cleveland, OH) in 10 μl volume incubated at 37°C for 1 h followed by 15 min at 80°C to inactivate the enzymes. Sequencing was performed on a Beckman-Coulter CEQ-8000XL machine following the manufacturer's protocol.
ITS sequences were initially aligned using Clustal X  followed by manual adjustment. Protein-coding plastid sequences were easily aligned by eye, with attention paid to codon alignment in the few areas where gaps existed. A consensus of 500 bootstrap trees was created for each gene individually using maximum parsimony in PAUP*4.0b10 . Aligned datasets contained 684 base pairs (bp) for ITS, 1,399 bp for rbcL, and 660 bp for rps2. A combined bootstrap consensus was created using data from these three genes combined with matK data (1,650 aligned bp, 4,393 combined bp)  , although not all taxa are available for every locus owing to gene loss and/or failed amplification. Bayesian posterior probabilities were calculated for each node using Mr. Bayes v3.0b4 . Four cold chains and one chain heated at the default value were run with swapping according to default settings and a general-time reversible (GTR) likelihood model with a gamma and invariant parameter estimated from the data. One million generations were run with sampling every hundredth generation for a total of 10,000 trees. Likelihood estimates were graphed to determine appropriate burn-in values for each gene (200 trees discarded for rps2 and rbcL, 400 trees discarded for ITS, 250 discarded for combined data). In addition, maximum-likelihood phylograms and non-parametric bootstrap values (100 replicates) were generated with the program Garli (Version 0.951) using default search options under the GTR + gamma + I model for each of the three newly reported gene alignments with parameters estimated from the data.
Genome size estimates
Nuclear genome size estimates and standard errors were measured by flow cytometry  using either rice, soybean, tobacco, barley or wheat cultivars of known nuclear genome size as standards. Four replicates were performed for each plant, with the mean estimates and standard deviations (SD) reported in Table 1. Fresh plant material for these measurements was grown in the Pennsylvania State University Biology greenhouse. Cuscuta seeds were germinated after scarification in concentrated H2SO4 and grown with Impatiens walleriana, Solenostemon scutellarioides or Linum usitatissimum (for C. epilinum) as hosts. Fresh stem tip tissue was used for all size estimates reported.
Aligned datasets for atpE, rbcL and rps2 with identical sampling of 12 taxa were imported into HYPHY.99beta (see ). A different set of taxa was used for rpoA, which is missing in all sampled members of subgenus Grammica. A user tree, based on highly supported nodes of the bootstrap consensus tree in Figure 2 that was congruent with all single-gene analyses, was used for all genes (single-gene trees for atpE and rpoA not shown). Synonymous and non-synonymous branch lengths were first calculated with no constraints under the MG96, HKY 3, 4 codon model. Next, a tree with all branches constrained to the same non-synonymous to synonymous ratio was optimized, and a likelihood ratio test (LRT) was performed to determine whether the unconstrained tree had a significantly better likelihood. Likelihood parameters were then reoptimized for trees with the non-synonymous to synonymous rate ratio constrained differently for various clades (i.e. two non-synonymous to synonymous rate ratios on the tree; one for the subclade being tested, one for the remainder of the tree). Clades examined in this manner for atpE, rbcL and rps2 were the Convolvulaceae clade (Ipomoea + Cuscuta), all Cuscuta, all Cuscuta except subgenus Monogyna and the clade comprising the three sampled species of subgenus Grammica. For rpoA, clades examined were Convolvulaceae, Cuscuta, subgenus Cuscuta and Cuscuta nitida. LRTs were confined to testing only hypotheses of change at these nodes of interest rather than performing numerous additional tests and thereby increasing the chance of Type I error.
Yuncker TG: The genus Cuscuta. Mem Torrey Bot Soc. 1932, 18: 113-331.
Engelmann G: Systematic arrangement of the species of the genus Cuscuta, with critical remarks on old species and descriptions of new ones. Trans Acad Sci St Louis. 1859, 1: 453-523.
Choisy JD: De Convolvulaceis Dissertatio Tertia. Mem Soc Phys Hist Nat Geneve. 1841, 9: 261-288.
Kuijt J: Biology of Parasitic Flowering Plants. 1969, Berkeley and Los Angeles: University of California Press
Hibberd JM, Bungard RA, Press MC, Jeschke WD, Scholes JD, Quick WP: Localization of photosynthetic metabolism in the parasitic angiosperm Cuscuta reflexa . Planta. 1998, 205: 506-513. 10.1007/s004250050349.
Kelly CK, Harris D, Perez-Ishiwara R: Is breaking up hard to do? Breakage, growth, and survival in the parasitic clonal plant Cuscuta corymbosa (Convolvulaceae). Am J Bot. 2001, 88: 1458-1468. 10.2307/3558454.
Lyshede OB: Studies on mature seeds of Cuscuta pedicillata and Cuscuta campestris by electron microscopy. Ann Bot (London). 1992, 69: 65-371.
Jeschke WD, Baig A, Hilpert A: Sink-stimulated photosynthesis, increased transpiration and increased demand-dependent stimulation of nitrate uptake: nitrogen and carbon relations in the parasitic association Cuscuta reflexa-Coleus blumei. Cuscuta reflexa-Coleus blumei. 1997, 48: 915-925.
Machado MA, Zetsche K: A structural, functional and molecular analysis of plastids of the holoparasites Cuscuta reflexa and Cuscuta europaea. Planta. 1990, 181: 91-96. 10.1007/BF00202329.
Panda MM, Choudhury NK: Effect of irradiance and nutrients on chlorophyll and carotenoid content and hill reaction activity in Cuscuta reflexa. Photosynthetica. 1992, 26: 585-592.
Haberhausen G, Valentin K, Zetsche K: Organization and sequence of photosynthetic genes from the plastid genome of the holoparasitic flowering plant Cuscuta reflexa . Mol Gen Genet. 1992, 232: 154-161. 10.1007/BF00299148.
Haberhausen G, Zetsche K: Functional loss of ndh genes in an otherwise relatively unaltered plastid genome of the holoparasitic flowering plant Cuscuta reflexa. Plant Mol Biol. 1994, 24: 217-222. 10.1007/BF00040588.
Freyer R, Neckermann K, Maier RM, Kossel H: Structural and functional-analysis of plastid genomes from parasitic plants – loss of an intron within the genus Cuscuta . Curr Genet. 1995, 27: 580-586. 10.1007/BF00314451.
Choudhury NK, Sahu D: Photosynthesis in Cuscuta reflexa : a total plant parasite. Photosynthetica. 1999, 36: 1-9. 10.1023/A:1007025500452.
Sherman TD, Pettigrew WT, Vaughn KC: Structural and immunological characterization of the Cuscuta pentagona L-chloroplast. Plant Cell Physiol. 1999, 40: 592-603.
van der Kooij TAW, Krause K, Dorr I, Krupinska K: Molecular, functional and ultrastructural characterisation of plastids from six species of the parasitic flowering plant genus Cuscuta . Planta. 2000, 210: 701-707. 10.1007/s004250050670.
Berg S, Krupinska K, Krause K: Plastids of three Cuscuta species differing in plastid coding capacity have a common parasite-specific RNA composition. Planta. 2003, 218: 135-142. 10.1007/s00425-003-1082-8.
Krause K, Berg S, Krupinska K: Plastid transcription in the holoparasitic plant genus Cuscuta : parallel loss of the rrn16 PEP-promoter and of the rpoA and rpoB genes coding for the plastid-encoded RNA polymerase. Planta. 2003, 216: 815-823.
Berg S, Krause K, Krupinska K: The rbcL genes of two Cuscuta species, C. gronovii and C. subinclusa, are transcribed by the nuclear-encoded plastid RNA polymerase (NEP). Planta. 2004, 219: 541-546. 10.1007/s00425-004-1260-3.
Stefanovic S, Olmstead RG: Down the slippery slope: plastid genome evolution in Convolvulaceae. J Mol Evol. 2005, 61: 292-305. 10.1007/s00239-004-0267-5.
Revill MJW, Stanley S, Hibberd JM: Plastid genome structure and loss of photosynthetic ability in the parasitic genus Cuscuta . J Exp Bot. 2005, 56: 2477-2486. 10.1093/jxb/eri240.
Kelly CK, Venable DL, Zimmerer K: Host specialization in Cuscuta costaricensis – an assessment of host use relative to host availability. Oikos. 1988, 53: 315-320. 10.2307/3565530.
Kelly CK: Plant fraging – a marginal value model and coiling response in Cuscuta subinclusa . Ecology. 1990, 71: 1916-1925. 10.2307/1937599.
Kelly CK: Resource choice in Cuscuta europaea . Proc Natl Acad Sci USA. 1992, 89: 12194-12197. 10.1073/pnas.89.24.12194.
Runyon JB, Mescher MC, De Moraes CM: Volatile chemical cues guide host location and host selection by parasitic plants. Science. 2006, 313: 1964-1967. 10.1126/science.1131371.
Stefanovic S, Krueger L, Olmstead RG: Monophyly of the Convolvulaceae and circumscription of their major lineages based on DNA sequences of multiple chloroplast loci. Am J Bot. 2002, 89: 1510-1522.
Stefanovic S, Olmstead RG: Testing the phylogenetic position of a parasitic plant (Cuscuta, Convolvulaceae, Asteridae): Bayesian inference and the parametric bootstrap on data drawn from three genomes. Syst Biol. 2004, 53: 384-399. 10.1080/10635150490445896.
Stefanovic S, Austin DF, Olmstead RG: Classification of Convolvulaceae: A phylogenetic approach. Syst Bot. 2003, 28: 791-806.
Sherman TD, Pettigrew WT, Vaughn KC: Structural and immunological characterization of the Cuscuta pentagona L. chloroplast. Plant Cell Physiol. 1999, 40: 592-603.
Pazy B, Plitmann U: Chromosome divergence in the genus Cuscuta and its systematic implications. Caryologia. 1995, 48: 173-180.
Stefanovic S, Kuzmina M, Costea M: Delimitation of major lineages within Cuscuta subgenus Grammica (Convolvulaceae) using plastid and nuclear DNA sequences. Am J Bot. 2007, 94: 568-589.
McNeal JR, Leebens-Mack JH, Arumuganathan K, Kuehl JV, Boore JL, dePamphilis CW: Using partial genomic fosmid libraries for sequencing complete organellar genomes. Biotechniques. 2006, 41: 69-73.
Larkins BA, Dilkes BP, Dante RA, Coelho CM, Woo Y, Liu Y: Investigating the hows and whys of DNA endoreduplication. J Exp Bot. 2001, 52: 183-192. 10.1093/jexbot/52.355.183.
dePamphilis CW, Young ND, Wolfe AD: Evolution of plastid gene rps2 in a lineage of hemiparasitic and holoparasitic plants: Many losses of photosynthesis and complex patterns of rate variation. Proc Natl Acad Sci USA. 1997, 94: 7367-7372. 10.1073/pnas.94.14.7367.
Pazy B: Supernumerary chromosomes and their behaviour in meiosis of the holocentric Cuscuta babylonica Choisy. Bot J Linn Soc. 1997, 123: 173-176. 10.1006/bojl.1996.0076.
Beliz T: A revision of Cuscuta section Cleistogrammica using phenetic and cladistic analyses with a comparison of reproductive mechanisms and host preferences in species from California, Mexico, and Central America. 1986, Berkeley, CA: University of California, Berkeley
Nickrent DL, Starr EM: High rates of nucleotide substitution in nuclear small-subunit (18S) rDNA from holoparasitic flowering plants. J Mol Evol. 1994, 39: 62-70. 10.1007/BF00178250.
Horvath EM, Peter SO, Joet T, Rumeau D, Cournac L, Horvath GV, Kavanagh TA, Schafer C, Peltier G, Medgyesy P: Targeted inactivation of the plastid ndhB gene in tobacco results in an enhanced sensitivity of photosynthesis to moderate stomatal closure. Plant Physiol. 2000, 123: 1337-1349. 10.1104/pp.123.4.1337.
Schwender J, Goffman F, Ohlrogge JB, Shachar-Hill Y: Rubisco without the Calvin cycle improves the carbon efficiency of developing green seeds. Nature. 2004, 432: 779-782. 10.1038/nature03145.
Doyle JJ, Doyle JL: Isolation of plant DNA from fresh tissue. Focus. 1990, 12: 13-15.
Baldwin BG: Phylogenetic utility of the internal transcribed spacers of nuclear ribosomal DNA in plants: an example from the Compositae. Mol Phylogenet Evol. 1992, 1: 3-16. 10.1016/1055-7903(92)90030-K.
Hershkovitz MA, Zimmer EA: Conservation patterns in angiosperm rDNA ITS2 sequences. Nucl Acids Res. 1996, 24: 2857-2867. 10.1093/nar/24.15.2857.
Olmstead RG, Michaels HJ, Scott KM, Palmer JD: Monophyly of the Asteridae and identification of their major lineages inferred from DNA sequences of rbcL . Ann Mo Bot Gard. 1992, 79: 249-265. 10.2307/2399768.
Hoot SB, Culham A, Crane PR: The utility of atpB gene sequences in resolving phylogenetic relationships: Comparisons with rbcLand 18S ribosomal DNA sequences in the Lardizabalaceae. Ann Mo Bot Gard. 1995, 82: 194-207. 10.2307/2399877.
Dumolin-Lapegue S, Pemonge MH, Petit RJ: An enlarged set of consensus primers for the study of organelle DNA in plants. Mol Ecol. 1997, 6: 393-397. 10.1046/j.1365-294X.1997.00193.x.
Demesure B, Sodzi N, Petit RJ: A set of universal primers for amplification of polymorphic noncoding regions of mitochondrial and chloroplast DNA in plants. Mol Ecol. 1995, 4: 129-131.
Nickrent DL, Yan OY, Duff RJ, dePamphilis CW: Do nonasterid holoparasitic flowering plants have plastid genomes?. Plant Mol Biol. 1997, 34: 717-729. 10.1023/A:1005860632601.
Graham SW, Olmstead RG: Utility of 17 chloroplast genes for inferring the phylogeny of basal angiosperms. Am J Bot. 2000, 87: 1712-1730. 10.2307/2656749.
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The ClustalX windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucl Acids Res. 1997, 24: 4876-4882. 10.1093/nar/25.24.4876.
Swofford DL: PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods), Version 4.0b10. 2002, Sunderland, MA: Sinauer Associates
Huelsenbeck JP, Ronquist F: MRBAYES: Bayesian inference of phylogeny. Bioinformatics. 2001, 17: 754-755. 10.1093/bioinformatics/17.8.754.
Arumuganathan K, Earle ED: Estimation of nuclear DNA contents of plants by flow cytometry. Plant Mol Biol Rep. 1991, 9: 229-241.
Pond SL, Frost SD, Muse SV: HyPhy: hypothesis testing using phylogenies. Bioinformatics. 2005, 21 (5): 676-679. 10.1093/bioinformatics/bti079.
The authors thank Daniel Austin, Todd Barkman, Mauricio Bonifacino, Alison Colwell, Peter Endress, Andreas Fleischmann, Julian Hibberd, Greg Jordan, Cliff Morden, Lytton Musselman, Ann Rhoads, Susan Schardt, Kim Steiner, Kyoji Yamada, George Yatskievych, Tony Omeis and the Pennsylvania State University Greenhouse, Tom Wendt and the University of Texas herbarium, and the Pennsylvania State University herbarium for assistance in obtaining plant material, and Daniel Nickrent, David Geiser, Stephen Schaeffer and Andrew Stephenson for helpful comments on the manuscript. This study was supported by a National Science Foundation Doctoral Dissertation Improvement Grant to JRM (DEB-0206659) as well as DEB-0120709 to CWD, and partially performed under the auspices of the US Department of Energy's Office of Science, Biological and Environmental Research Program, and by the University of California, Lawrence Berkeley National Laboratory under Contract No. DE-AC02-05CH11231. A Henry W. Popp Graduate Assistantship for JRM from the Department of Biology at Penn State University is gratefully acknowledged.
JRM collected PCR data, performed analyses and wrote the manuscript. KA performed flow cytometry nuclear genome size estimation. JVK and JLB helped produce complete plastid genome sequences for two Cuscuta species and a photosynthetic outgroup from which some loci were extracted for this study and which were used extensively for primer design. CWD participated in design and coordination of the research and extensively edited the manuscript. All authors read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.