- Research article
- Open Access
Genomics analysis of Aphanomyces spp. identifies a new class of oomycete effector associated with host adaptation
BMC Biology volume 16, Article number: 43 (2018)
Oomycetes are a group of filamentous eukaryotic microorganisms that have colonized all terrestrial and oceanic ecosystems, and they include prominent plant pathogens. The Aphanomyces genus is unique in its ability to infect both plant and animal species, and as such exemplifies oomycete versatility in adapting to different hosts and environments. Dissecting the underpinnings of oomycete diversity provides insights into their specificity and pathogenic mechanisms.
By carrying out genomic analyses of the plant pathogen A. euteiches and the crustacean pathogen A. astaci, we show that host specialization is correlated with specialized secretomes that are adapted to the deconstruction of the plant cell wall in A. euteiches and protein degradation in A. astaci. The A. euteiches genome is characterized by a large repertoire of small secreted protein (SSP)-encoding genes that are highly induced during plant infection, and are not detected in other oomycetes. Functional analysis revealed an SSP from A. euteiches containing a predicted nuclear-localization signal which shuttles to the plant nucleus and increases plant susceptibility to infection.
Collectively, our results show that Aphanomyces host adaptation is associated with evolution of specialized secretomes and identify SSPs as a new class of putative oomycete effectors.
Oomycetes are filamentous eukaryotes that have colonized all terrestrial and oceanic ecosystems . Oomycete evolution started from marine habitats, and their closest cousins are probably free-living phagotrophic protists . The great diversity of lifestyles displayed by oomycetes raised questions about genetic and molecular mechanisms involved in their evolution and rapid adaptation to environmental changes [2, 3]. Oomycete pathogenicity mainly relies on large repertoires of secreted proteins, known as effectors. They show rapid evolution within a given genome as a result of co-evolution with their hosts and are often associated with transfers to unrelated hosts [4, 5]. For instance, protease inhibitors produced by two sister Phytophthora species evolved to target plant proteases of their respective unrelated hosts, linking effector specialization and host diversification . Phytophthora spp. whole genome studies revealed a bipartite genome architecture that evolved at different rates  and where effector genes are associated with transposable elements (TEs) in gene-sparse regions . RxLR and Crinklers (CRNs) are the two main effector families found in these fast-evolving genomic regions of Phytophthora spp. [4, 7]. These two large families of effectors consist of modular proteins with a conserved N-terminus host-addressing signal (i.e., a trafficking sequence), while the variable Cter region harbors the effector function [8, 9]. Besides their role in pathogenicity, numerous RxLR proteins are specifically detected by host plants able to produce cognate resistance proteins to trigger resistance [10, 11]. Specific interactions between RxLR effectors and resistance proteins are on the basis of the gene-for-gene concept, a fundamental process of the plant immune system, also known as effector-triggered immunity (ETI) . In that case, rapid evolution of RxLR effectors allows the pathogen to counteract this surveillance system.
Phylogenetic analyses placed the Aphanomyces genus in the Saprolegnian lineage, which includes several species pathogenic in plants and aquatic animals such as the salmon pathogen Saprolegnia parasitica . In contrast, most of the species in the Peronosporalean lineage are mainly pathogenic on plants, one of the best-studied species being Phytophthora infestans, which causes late blight on solanaceous crops such as potato or tomato . The Aphanomyces genus has been shown to contain three major lineages, including plant pathogens, aquatic animal pathogens, and saprophytic species , making this genus a unique model to understand evolutionary mechanisms involved in adaptation of oomycetes to distantly related hosts and environmental niches. Among the most damaging Aphanomyces species is the legume pathogen Aphanomyces euteiches, which causes significant damage to various legume crops (peas, alfalfas, faba beans, lentils, etc.) . First reported by Drechsler in 1925 as the causal agent pathogen of root rot in peas in Wisconsin (USA) , the pathogen is now recorded in Europe, Australia, New Zealand, and throughout the USA. Aphanomyces euteiches is a major problem affecting the field pea in most pea-growing regions .
Aphanomyces euteiches is composed of distinct subspecific groups based on genotype and host preference (i.e., pea, alfalfa, etc.) [17, 18]. The infection is initiated by oospore germination in close vicinity to a root plant host. The oospores form a germ tube and a long terminal zoosporangium that can release more than 300 primary motile zoospores . The zoospores locate and encyst on the host root within minutes, and the cysts are able to germinate and penetrate the cortical cells within hours. The mycelium then grows intercellularly through the root tissue and forms oospores within a few days of infection . Although the symptoms caused by A. euteiches can be difficult to distinguish from symptoms caused by other root-infecting plant pathogens (such as Pythium or Fusarium), a characteristic colored soft rot of the roots is generally observed . Under optimal field conditions, legume infection by A. euteiches can result in symptoms within 10 days, and oospores can be detected between 7 and 14 days . The oospores of A. euteiches are long-lived and can remain dormant in soil and organic debris for many years , making legume cultivation inefficient. Effective chemical controls for Aphanomyces root rot of legumes are not available, and the development of tolerant cultivars appears to be the more effective management technique available to farmers. Partial resistance in pea or the legume model Medicago truncatula to A. euteiches is mediated by several quantitative trait loci (QTLs). Recent whole genome sequencing data in conjunction with genome-wide association studies (GWASs) on M. truncatula have allowed the identification of promising candidate genes [20, 21] to manage the parasite.
Aphanomyces astaci is an obligate parasite of freshwater decapods, particularly crayfish (but crabs could act as potential vectors as well [22, 23]). It presumably originates from North America, where infected native crayfish do not show disease symptoms. Five distinct A. astaci genotype groups (from A to E) are known in Europe and were isolated from infected European crayfish specimens. A. astaci reproduces asexually through the formation of short-lived bi-flagellated zoospores that spread in aquatic ecosystems . After the encystment of the zoospore on the host cuticle and its germination, the growing hypha penetrates into the cuticle. In resistant crayfish, the hyphal growth is stopped by encapsulation and melanization, resulting from the host immune response. In susceptible crayfish, hyphae penetrate deeper into tissues and organs, generally killing the host . The time required for development varies depending on the A. astaci strain and water temperature . Like plant pathogens, A. astaci presumably secretes a battery of virulence proteins to promote infection and facilitate host adaptation. Indeed, infection experiments have shown that A. astaci strains from group E had a high level of virulence, comparable to that of group B . A. astaci has been nominated among the “100 of the World’s Worst Invasive Alien Species” in the Global Invasive Species Database .
In contrast to species from the Peronosporalean lineage, little is known regarding the molecular mechanisms that govern host adaptation and environmental niche colonization for Aphanomyces species. A previous analysis of a collection of expressed sequence tags (ESTs) from the legume pathogen A. euteiches revealed the uniqueness of its effector repertoire, which is composed of numerous CRN effector genes without any RxLR effectors [28,29,30]. This observation led to the suggestion that CRNs are ancestral in the oomycete lineage and specific to phytopathogenic species [7, 9]. Indeed, CRN coding genes have been detected in all plant pathogenic oomycetes sequenced to date, but never in animal pathogenic species such as S. parasitica . This has also raised the possibility that, besides the RxLR family, CRNs and maybe other oomycete effectors may have important roles in triggering host susceptibility.
Here we report on the whole genome sequencing and annotation of a pea-infecting strain of A. euteiches. To characterize genetic factors involved in plant adaptation by comparative genomics, we generated a first genomic draft of the crustacean pathogen A. astaci. In combination with RNA-seq data, we showed that Aphanomyces adaptation to plant or animal hosts is correlated with the expression of highly specialized secretomes. Moreover, our study led to the identification of a set of small secreted proteins (SSPs) which can be considered as a new class of oomycete effectors.
Aphanomyces spp. genome sequencing and phylogeny relationship
The genome of the A. euteiches ATCC201684 strain was sequenced by combining 454 and Illumina sequencing technologies providing a high-quality reference assembly of 57 Mb (Table 1). The estimated genome size of A. euteiches is consistent with the size range of most previously sequenced oomycete genomes except P. infestans (240 Mb)  and close to the assembled S. parasitica fish pathogen genome (63 Mb) . The GC content (~ 47%) is one of the lowest detected in oomycetes and in close agreement with the Albugo laibachii  and Plasmopara viticola genomes . The benchmarking universal single-copy orthologs (BUSCO) method  was used to estimate the degree of completeness of the assembled gene space. Most of the gene space was covered as 83.1% complete, and a 3.8% partial model of conserved fungal genes was identified within the genome of A. euteiches. The number of genes (19,548) is more important in A. euteiches than in other sequenced oomycetes, while the gene length mean (~ 1.5 kb) is similar to that of various oomycetes [33, 34, 36]. The A. euteiches genome is thus highly compact with the second-highest gene density (one gene per 2.9 kb) reported so far for oomycetes and similar to the S. parasitica genome (one gene per 2.6 kb). The completeness of the A. euteiches sequenced genome and prediction of open reading frames were validated by the analyses of expressed sequences. In total, 87% of ESTs previously generated from A. euteiches mycelium (MYC) and mycelium grown in contact with M. truncatula roots (INT)  mapped to the assembly. In addition, ~ 90% of assembled Illumina reads generated during this study by using MYC and INT samples, mapped to the A. euteiches genome assembly. The completeness of the assembly was further evaluated with the BUSCO method  using the Alveolata-Stramenopiles dataset. The same analysis was performed with eight sequenced oomycetes (Additional file 1: ST1a, b). The A. euteiches assembly contained > 94% of the Stramenopiles dataset and appears as the better one among the sequenced oomycetes. All the data were included in an updated version of the AphanoDB database dedicated to “omics” studies on the Aphanomyces genus . AphanoDB v2.0 provides new tools as a genome browser, gene annotation facilities, and Basic Local Alignment Search Tool (BLAST) tools.
Genome sequences and assemblies of the crayfish pathogen A. astaci and the opportunistic pathogen A. stellatus were generated, giving an estimated size of 45 Mb and 62 Mb (Table 1) with 16,479 and 25,573 predicted genes, respectively. As expected, genome assemblies of A. astaci and A. stellatus compiled only from short reads generated by Illumina are more fragmented and must be considered as drafts.
These data were used to perform a phylogenetic analysis with a gene set recently used to estimate timescale evolution of oomycetes  (see Methods for further details). Aphanomyces species form a group which diverged from other Saprolegniales (Saprolegnia parasitica, Achlya hypogena, Thraustotheca clavata) more than 100 Mya (Fig. 1). This analysis also shows that divergence times are very deep within the Aphanomyces genus, with the three Aphanomyces species diverging from each other more than 50 Mya.
Transposable Elements (TE)
TEs are known to play a prevalent role in the evolution of eukaryotic pathogens [2, 3]. A de novo characterization of TEs was thus performed on all Aphanomyces genomes. The aim of the TE analysis was to uncover and annotate all TE copies in all three Aphanomyces genomes to facilitate future studies that will investigate the origin, evolution, and genomic impact of these elements in more detail. The fraction of Aphanomyces genomes occupied by TEs (5–13%; Additional file 1: ST1d) is lower than that for all other sequenced oomycetes (17–74%) in which TEs have been mined, such as Phytophthora species [32, 39], the downy mildew Plasmopara viticola , and the white rust Albugo laibachii , with the exception of Pythium ultimum (5%) . It is noteworthy that in oomycetes genome size is strongly correlated with TE content (R2 = 0.85), supporting the hypothesis that TEs are a major determinant of genome size in these taxa . In all three Aphanomyces species, the most abundant TEs are DNA transposons of the Tc1-mariner superfamily (up to 2.5% of the genome in A. astaci). Interestingly, none of the TEs that we identified are shared between all three Aphanomyces species, and very few TEs are shared between two species: two are shared between A. astaci and A. euteiches (74 and 83% identity over 741 and 170 bp respectively), two are shared between A. stellatus and A. euteiches (84 and 86% identity over 334 and 1262 bp respectively), and four are shared between A. astaci and A. stellatus (between 78% identity over 1820 bp and 84% identity over 195 bp) (Fig. 2). Furthermore, BLASTN searches using all TEs identified in Aphanomyces spp. as queries against all Repbase TEs did not reveal any significant hit. Together, these results suggest that Aphanomyces species TE families are characterized by a high rate of turnover.
Comparative analysis of A. euteiches and A. astaci proteomes
To identify conserved and specific features of Aphanomyces proteomes, an OrthoMCL analysis  was performed using A. euteiches and A. astaci proteomes and nine deep-sequenced oomycete proteomes (S. parasitica, P. infestans, P. sojae, P. parasitica, P. ramorum, Hyaloperonospora arabidopsidis, A. laibachii, Py. ultimum, and Py. irregulare) (Additional file 1: ST1e). A total of 2296 orthologous groups (34,404 genes) were detected in the 11 genomes defining a “core proteome” set (Additional file 1: ST1f). In this set 104 groups corresponding to 1528 genes defined the “core secretome” (Additional file 1: ST1f). A focus on the A. euteiches secretome (1240 genes, ~ 6% of the proteome, Additional file 1: ST1g) shows that 70% of the sequences harboring a Gene Ontology (GO) term are related to the “hydrolase activity” category (GO:0016787) (Additional file 1: ST1h and Fig. 3a). This category is enriched in enzymes with glycosyl hydrolase or peptidase functions and other typical oomycete secreted proteins, such as protease inhibitors. The secretome of A. astaci is predicted to contain 744 genes (< 5% of the proteome). While this set is probably not complete, 323 sequences harbor a GO annotation and more than 65% (217 sequences) fall into the “hydrolase activity” category with a predominance of “peptidase GO:0008233” function (160 sequences, 73%). A closer examination of both secretomes showed that numerous genes detected in A. euteiches are not reported in A. astaci (Fig. 3a). In addition, about 72% of the A. euteiches-specific secretome (i.e., sequences not present in other oomycetes including A. astaci, 506 genes) did not display a putative functional domain (368 genes), and 80% encoded proteins below 300 amino residues in size (296/368, Additional file 1: ST1i, Fig. 3b). The same observation was made for the A. astaci-specific secretome, where 160 specific genes did not harbor a functional annotation and 86% (138/160, Additional file 1: ST1i) coded for small proteins (Fig. 3b). Interestingly, 59% of these small proteins were predicted to be putative effectors from A. euteiches or A. astaci using EffectorP (Additional file 1: ST1i), a software based on properties shared by fungal effectors . This suggests that Aphanomyces species presented a set of SSPs similar to fungal effectors.
Carbohydrate-Active enZymes (CAZy) and carbohydrate-binding modules (CBMs)
Since comparative analyses suggested that the repertoires of glycosyl hydrolases are divergent between A. euteiches and A. astaci, a global prediction of CAZy domains (glycosyl hydrolases (GH), carbohydrate esterases (CE), polysaccharide lyases (PL), and carbohydrate-binding modules (CBM)) was performed. This analysis revealed that the A. euteiches genome is enriched in gene coding for proteins with CAZy domains compared to the A. astaci genome (Fig. 4, Additional file 1: ST1j). For example, more than 300 genes coding for proteins with at least one predicted GH domain were detected in the A. euteiches genome, a repertoire size similar to those predicted in oomycete genomes such as Phytophthora species , whereas only 109 genes were found in the A. astaci genome.
A closer examination of this repertoire revealed that several enzyme families targeting plant-specific polysaccharides (pectins, hemicellulases) are expanded in A. euteiches and absent in A. astaci. Clearly, the A. astaci genome lacks genes coding for hemicellulases (e.g., GH, 10, 11, CE4 families) and pectinases (e.g., GH28, PL1, PL3, and PL4), which are largely represented in the A. euteiches genome (Fig. 4, Additional file 1: ST1j). Based on the CAZy database, eight GH families correspond to eukaryotic cellulases. The GH5, GH6, and GH7 families, which encode endocellulases and cellobiohydrolases, are highly represented in A. euteiches as compared to other oomycetes. Since oomycete cell walls contain cellulose, these enzymes can play a role in oomycete cell wall remodeling. Interestingly, while GH5 and GH6 cellulases are present in A. astaci and A. euteiches, a large family of GH7 was found only in A. euteiches (39 GH7-encoding gene), suggesting that this class of enzymes could play a role in plant pathogenesis (Fig. 4).
In phytopathogenic fungi it is suggested that α-l-arabinofuranosidases such as GH62 and GH54 are involved in plant penetration and pathogenesis [44, 45]. While GH62 are absent in oomycetes belonging to the Peronosporalean lineage analyzed so far, 12 putative GH62-coding sequences are detected in A. euteiches, while GH54 are missing (Fig. 4, Additional file 1: ST1j). Among GH62, seven are predicted to be secreted. It has also been suggested that α-l-arabinofuranosidases, by degrading arabinofuranose side chains, improve the accessibility of the xylan backbone of hemicellulose to xylanases as GH10 and GH11 , which are present in A. euteiches (22 and 3 sequences respectively). Thus, the presence of genes coding for these enzymes suggests that A. euteiches can express a complete repertoire of hemicellulose-degrading enzymes, distinct from those produced by other oomycetes.
The availability of CAZyme repertoires from distantly related oomycetes offers the opportunity to investigate their origin. Since most of these genes are closely related to fungal genes, it has been suggested that some of them were acquired by oomycetes by horizontal gene transfer (HGT) events from true fungi . In certain cases, this hypothesis was sustained by phylogenetic analyses. This is the case for a pectate lyase gene from P. infestans (EEY64154)  for which seven orthologous genes, corresponding to the PL1 family, were detected in the A. euteiches genome but not in A. astaci (Additional file 1: ST1j). Phylogenetic analysis using these sequences (Additional file 1: ST1k, Additional file 2: Figure S1) supports the hypothesis that these genes were acquired early in the evolution of oomycetes, before the Saprolegniale-Peronosporale divergence, and probably lost during the specialization of Aphanomyces species to animal hosts.
Numerous CAZymes are associated with CBMs to facilitate enzyme activity . In A. euteiches, many CAZymes are associated with CBMs, and 130 CBM1 (cellulose-binding modules) predicted proteins are detected in this species, but only 7 genes in A. astaci (Additional file 1: ST1j, Fig. 4). In contrast, A. astaci harbors a set of genes encoding proteins with a putative chitin-binding domain (PF03067, IPR004302, CBM33, now reclassified as the AA10 domain in CAZy) that are not detected in A. euteiches (Additional file 1: ST1j, Fig. 4). Interestingly, > 80% (19/23) of these domains are associated with a predicted signal peptide. These data suggested that CBM33 from A. astaci might play a role in the interaction with the crustacean shell, as reported for the PlCBP49 CBM33-containing virulence factor of the honey bee pathogen Paenibacillus larvae that degrades chitin .
Aphanomyces spp. cytoplasmic effectors (RxLR and CRN oomycete effectors)
An important discovery regarding oomycete genome sequencing projects resides in the identification of predicted secreted proteins that could be delivered into host cells to aid pathogenicity. Two types of cytoplasmic effectors dominate the oomycete secretome: the RxLR effectors and the Crinklers (CRNs). These effector proteins are characterized by an amino acid signature located at the N-terminus of the sequence. We thus investigated the presence of these host-targeting signals in A. euteiches and A. astaci species. The genomes of both Aphanomyces were searched for RxLRs and CRNs as previously reported [17, 40]. This approach did not allow the identification of RxLR effectors on both Aphanomyces genomes, as previously suggested upon analysis of EST libraries from the same strain of A. euteiches . This analysis sustains the view that RxLR effectors seem to be absent in the Saprolegniale lineage . For predicting CRNs, a combination of automatic searches (hidden Markov model (HMM), regular expression) and manual curation methods was used, and 160 and 31 CRNs and CRN-like genes were detected in the A. euteiches and A. astaci genomes respectively (Additional file 1: ST1k). For A. euteiches, this number is similar to the one reported in phytopathogenic Phytophthora species (> 80 P. capsici; > 200 P. sojae; > 400 P. infestans [32, 51]).
Consistent with previous data on oomycete genomes [7, 52, 53], fewer than 25% of A. euteiches CRNs and none in A. astaci were predicted to be secreted by means of SignalP analysis. By contrast, around 60% of AeCRNs and AeCRN-like genes harbor a predicted nuclear-localization signal (NLS). Among them 16 have both a predicted signal peptide and an NLS. A majority of AeCRNs (> 60%) harbor a LYLAK motif at the N-terminal rather than the canonical Phytophthora LxLFLAK motif, as previously reported upon complementary DNA (cDNA) annotation of A. euteiches [28, 29, 54]. We noticed that the N-terminal trafficking signal is less conserved in A. astaci as compared to CRNs from A. euteiches, and only five sequences harbor a putative header signal in combination with a C-terminal domain (Additional file 1: ST1k). The headers of CRNs are followed by a more diverse C-terminal domain that confers effector activity . Based on sequence similarity, 36 domains were initially defined for the CRN repertoire of P. infestans, and new C-termini domains have been characterized upon the CRN repertoire annotation of various oomycete species [53, 55]. HMM searches and manual assignment on AeCRNs and AeCRN-like sets showed that the necrotic DXZ, D5, and D2 kinase domains are among the most widespread C-termini domains in A. euteiches (Additional file 1: ST1k). The DC domain, reported as a putative DNA-binding helix-hairpin-helix (HhH) motif in PsCRN108 from P. sojae , is also largely represented in A. euteiches but not in A. astaci. The DXZ and DN17 domains frequently detected in phytopathogenic oomycetes and the chytrid Batrachochytrium dendobraditis [32, 55, 57] are well represented in A. astaci.
Dynamic changes of A. euteiches transcriptome during infection of M. truncatula roots
To get an insight into the expression and regulation of A. euteiches genes during different life stages, RNA-seq analyses were performed on RNAs isolated from A. euteiches zoospores and mycelium grown in liquid culture as well as in infected M. truncatula roots harvested 1, 3, and 9 days post inoculation (dpi). For the zoospores and the in vitro mycelium libraries, around 50 M reads were obtained, of which 70–77% were mapped in pairs to gene-coding regions (Additional file 3: ST2a). For M. truncatula-infected roots, 46—71 M reads were obtained, of which most could not be mapped to the A. euteiches genome, since a high amount of M. truncatula material was present in the samples. In addition, since less than 1% of the reads of 1 dpi could be mapped to the A. euteiches genome, this time point was excluded from the analysis (data not shown). For time points 3 and 9 dpi, 6–13% of the reads could be mapped in pairs to gene-coding regions (Additional file 3: ST2a). After filtering for lowly expressed genes and data normalization, a multi-dimensional scaling plot revealed a clear separation between expression patterns of the different life stages of A. euteiches, while the biological repeats of each life stage (mycelium, zoospore, 3 dpi, and 9 dpi) clustered together (Additional file 4: Figure S2). This plot suggested that different gene subsets are expressed during M. truncatula infection.
For the 16,786 (85.8%) genes that were expressed, differential expression between the different life stages was determined. Around 36–37% of the genes were differentially expressed between zoospores and the other life stages, while only 10–11% of the genes showed differential expression between the in vitro grown A. euteiches mycelium and the infected root material (Additional file 3: ST2b). This could reflect a transcriptome remodeling during host infection. To further investigate in what processes the differentially expressed genes are involved, a Gene Ontology (GO) enrichment analysis was performed. During infection, genes that are upregulated between 3 and 9 dpi are mainly involved in “carbohydrate metabolic (GO:0005975)” and “proteolysis (GO:0006508)” processes, while downregulated genes belonged mainly to transport processes (GO:0055085, GO:0030001). The opposite situation is detected when comparing zoospores with in vitro grown mycelium (Additional file 3: ST2c–e), suggesting important physiological differences between zoospores and mycelium life stages of A. euteiches as reported for Phytophthora infestans .
We then investigated the expression pattern of the A. euteiches secretome with a focus on CAZymes, proteases and protease inhibitors, CBMs, and CRNs/CRN-like effectors (Fig. 5a). When zoospores and mycelium stages are compared, most groups of genes encoding secreted proteins show a similar percentage of differential expression with the exception of the “polysaccharide lyase (PL)” category, which displays the highest percentage of differentially expressed genes. In contrast, during infection (3 and 9 dpi) around three times as much of the genes coding for secreted proteins are upregulated, with proteases, glycoside hydrolases (GH), and PL (86% and 76%) showing the highest percentages of upregulated genes (Fig. 5a). Less than 2% of CRN genes are upregulated at 3 and 9 dpi, while 13% are upregulated in zoospores as compared to in vitro grown mycelium, suggesting that a subset of AeCRN is present at the early stage of Medicago infection (Fig. 5a).
Heatmaps to visualize expression level of genes rather than differential expression show that, for GH, several groups can be identified (Fig. 5b, Additional file 3: ST2f) with groups II, IV, and VI containing genes that are upregulated during infection compared to mycelium. These groups contain a large number of genes coding for proteins with predicted cellulase (GH5, GH6, and GH7) and polygalacturonase (GH28) activity. They also include numerous hemicellulase genes (GH10, GH11) and one secreted arabinofuranosidase (GH62). Most of the polysaccharide lyases (PL) are also strongly induced during infection (Fig. 5d, Additional file 3: ST2g). Secreted PL1 are detected in group II or III, where gene expression is less abundant in mycelium as compared to zoospore and infection stages. This pattern is similar to the one observed in Phytophthora capsici, where the PL1 family in combination with PL16 and PL20 (absent in the Ae genome) account for nearly all of the contribution of the 22 PL genes to Phytophthora virulence . Most of the carbohydrate esterases (CE) showed a consistent low to medium expression in the different life stages (Additional file 5: Figure S3A, Additional file 3: ST2i). For the CBM-containing proteins without a predicted catalytic site mostly composed of CBM1 domains (cellulose-binding), several groups of genes can be identified with group II being strongly induced and expressed during infection while groups III and V are specifically upregulated or downregulated respectively in the zoospore stage (Additional file 5: Figure S3B, Additional file 3: ST2j). This expression pattern suggested that A. euteiches-secreted CBM1 domains may contribute either to A. euteiches virulence or microorganism development, as reported for the CBM1-containing protein CBEL from P. parastica that mediates adhesion to cellulosic substrates and contributes to Phytophthora cell wall architecture [60, 61].
For the secreted peptidases, several groups can also be identified (Fig. 5b, Additional file 3: ST2h). Many peptidases show constitutive expression during all life stages (groups I, VII, and VIII), while some are induced in zoospores (groups V and VI) or during M. truncatula root infection (groups II, III, IV, and IX). These last groups contain a high number of serine proteases (S1 family) that could be considered as candidates for pathogenicity factors. Indeed, serine proteases have been characterized as the major extracellular proteolytic enzymes secreted by Phytophthora spp. and indispensable for necrosis . Notably, none of the secreted protease inhibitors is differentially expressed during infection. However, most of the secreted protease inhibitors are highly expressed during all life stages of Ae (Additional file 5: Fig. 3C, Additional file 3: ST2k).
Together, these analyses indicate that expression of several families of genes coding secreted proteins are induced in zoospores and during infection of M. truncatula roots compared to saprophytically grown mycelium. This supports an important role in the pathogenesis of cell wall-degrading enzymes specifically found in A. euteiches and not in A. astaci targeting plant cell wall polysaccharides (pectins, hemicellulose).
Identification and functional characterization of small secreted proteins in A. euteiches (AeSSPs)
As reported above, 40% of A. euteiches-specific genes that encoded putative secreted proteins did not display a functional annotation and are below 300 amino acid residues in size (< 300 genes, Fig. 3b, Additional file 1: ST1i). Consequently, putative AeSSPs represented 24% of the secretome. Around 5% are organized in clusters (≥ 3 adjacent SSP genes, Additional file 6: ST3a). The clusters are distributed all over the genome and comprise 3 to 9 genes each. The clusters mainly contain AeSSP genes from the same OrthoMCL group, indicating that they might have arisen by duplication.
To ascertain the expression of AeSSPs during host infection, we checked the expression data presented above (e.g., mycelium, zoospore, interaction 3 dpi (T3), interaction 9 dpi (T9)). Since we could not exploit RNA-seq data generated at 1 day post infection, we included data from our previous set of cDNA generated from mycelium (MYC library) and A. euteiches grown in contact with roots during 1 or 2 days (INT T1 + T2,  (Additional file 6: ST3a). As shown in Fig. 6a, a large set of AeSSPs are upregulated in zoospores, in mycelium in contact with plant roots, or during infection. In total, 120 AeSSPs have been found to be induced in at least one condition.
We selected in AphanoDB v2.0, the AeSSP cluster from scaffold2_1449549_1653277 (Fig. 6b) and scaffold2_134062_1449527 to evaluate the effector function of AeSSP genes. These clusters are unique, since in addition to their expression at the beginning of the interaction, they contain AeSSPs with a predicted NLS (AeSSP1251 and AeSSP1254, AeSSP1256), suggesting that these proteins could be translocated to the host cell to target nuclear components. To investigate if the AeSSPs can play a role in virulence, Nicotiana benthamiana leaves transiently expressing the AeSSPs were infected with P. capsici (Fig. 7a). In the assay, one side of the leaf was infiltrated with Agrobacterium tumefaciens containing an AeSSP:green fluorescent protein (GFP) fusion construct (AeSSP1250, AeSSP1254, AeSSP1255, and AeSSP1256), while on the other side of the leaf GFP was delivered alone. Quantification of the lesion surfaces at 3 dpi showed that AeSSP1256 significantly enhanced the susceptibility of Nicotiana compared to GFP alone, while no differences were observed for AeSSP1250, AeSSP1254, and AeSSP1255 (Fig. 7b). This indicates that AeSSP1256 may efficiently contribute to oomycete pathogenicity.
To investigate the subcellular localization of AeSSPs, chimeric genes encoding AeSSPs independently with (full-length version) or without (matured form) their own signal peptide under the control of the CaMV35S promoter were generated in fusion with a GFP marker. As shown in Fig. 7c, both versions of AeSSP1250 are detected either in the nucleus or the cytoplasm of Nicotiana cells, while AeSSP1255 versions are mainly cytoplasmic. The NLS-containing AeSSP1254 and AeSSP1256 are nuclear localized without any labeling of the nucleolus. The pattern of fluorescence is similar with the full version or the matured form of AeSSP1254 and AeSSP1256. We noticed that the expression of both versions of AeSSP1256 led to the detection of fluorescence as a ring surrounding the nucleolus in addition to filament-like structures. A closer view of AeSSP1256 in fusion with its own signal peptide confirmed the accumulation around the nucleolus and also as filament-like structures (Fig. 7d). This unexpected nuclear localization is reminiscent of the one observed with the cytoplasmic effector CRN79_188 from Phytophthora capsici upon its transient expression in Nicotiana leaf .
Since in all cases the presence of the native signal peptide did not alter the localization of the corresponding AeSSP, we tested whether the AeSSP entered the plant secretory pathway. A functional endoplasmic reticulum (ER) retention signal (KDEL motif) was added at the C-terminal of the full-length form of AeSSP1256 to trap the protein in the ER. In Nicotiana leaves expressing a +SP:AeSSP1256:GFP:KDEL construct, an extensive network throughout the cytoplasm was labeled as observed with an ER-marker  (Fig. 8a). In contrast, expression of the matured form –SP:AeSSP1256:GFP:KDEL produced fluorescence in the nucleus. This indicates that +SP:AeSSP1256 is delivered to the ER secretion pathway and that the predicted signal peptide of AeSSP1256 is active in N. benthamiana. To confirm AeSSP1256 trafficking in Nicotiana cells, we used the Brefeldin A (BFA) drug. BFA inhibits transport from the ER to the Golgi and causes the formation of membranous islands throughout the cell . As expected, the localization of the full-length version of AeSSP1256 is disrupted by BFA, while the treatment did not affect the localization of the matured form. Thus, only +SP:AeSSP1256 transits through the Golgi in Nicotiana (Fig. 8b). Finally, we checked the effect and subcellular localization of AeSSP1256 in Medicago composite plants expressing the +SP:AeSSP1256:GFP construct. The expression of AeSSP1256 did not provoke any necrosis or alteration of root system development (data not shown). As illustrated in Fig. 8c, AeSSP1256 in fusion with its own signal peptide is nuclear localized 15 days after root transformation by A. rhizogenes, as observed previously in Nicotiana leaf.
In this study, we used a combination of genomics and transcriptomics approaches to identify gene repertoires involved in the adaptation of Aphanomyces pathogens to plant or animal hosts. This work provides the first genomics insight into the Aphanomyces genus, allowing one to precisely locate the phylogenetic position of Aphanomyces spp. within the oomycete lineage and providing clues for understanding animal and plant pathogenic evolution among oomycetes. Phylogenetic analysis suggested that specialization of Aphanomyces species to plant or animal hosts is an ancient event (more than 50 Mya), and analysis of gene contents revealed that phytopathogenic A. euteiches possesses a large and diverse repertoire of genes coding cell wall-degrading enzymes (CWDEs) which target plant cell wall polysaccharides, absent in A. astaci. In turn, A. astaci shows an expansion of protease genes, and during evolution it acquired genes coding proteins predicted to target chitin, the main component of the crayfish shell. These results indicate that host specialization is correlated with the presence of secretomes, which has been shaped by various evolutionary events including gene acquisition, gene losses, and gene amplification. Interestingly, transcriptome analyses showed that most of the genes coding enzymes able to degrade plant cell wall polysaccharides such as pectins and hemicellulases are strongly expressed during pathogenesis, strengthening their role in pathogenesis. Analyses performed on distantly related oomycete species, notably Phytophthora and Saprolegnia, gave similar results [31, 43, 66, 67], pointing out the key roles of degrading enzymes for pathogenesis and adaptation to specific ecological niches.
These results also raised the question of the origin of CWDE sequences in the Aphanomyces genus. Most of the secreted A. euteiches CWDEs acting against plant wall polysaccharides, such as pectinases and hemicellulases, show strong similarities to Phytophthora enzymes. Some of these genes have been suggested to be acquired by lateral gene transfer from a fungal donor, notably in Phytophthora [47, 48]. A striking example is a pectate lyase gene . Interestingly, several paralogs of this gene were found in A. euteiches but not in A.astaci, suggesting that this gene was acquired before the divergence between the Saprolegnian and Peronosporalean lineages and was lost by A. astaci during adaptation to animal hosts. Availability of more oomycete genomes is needed to further define the origin of oomycete CWDEs involved in interactions with plants; however, our results indicate that acquisitions of plant CWDEs in oomycetes occur early during oomycete evolution.
By combining comparative genomics and transcriptomics, we identified a large set of SSPs which may represent new oomycete putative effectors. Among the 296 AeSSP genes, 120 are induced during at least one infection condition analyzed. Functional studies on four of these candidates revealed that these proteins are localized in various subcellular compartments, and one of them enhanced plant susceptibility to oomycete infection. Large repertoires of SSPs have been evidenced upon genome annotation of fungi interacting with plants [68, 69], animals , and insects . Also unexpected was the large repertoire of SSPs predicted in mycorrhizal fungi [72, 73]. SSPs were also recently reported in bacteria such as the plant pathogen Pseudomonas syringae , but up to now no SSPs were described in oomycete genomes. Comparative fungal genomics studies showed evidences of rapid evolution of SSPs in related pathogens with different host ranges [70, 75]. A survey on 136 fungal secretomes archived in the Fungal Secretome Database (FSD) established that microorganisms living in close interaction with their hosts (symbiotic organisms, biotrophs) have commonly higher proportions of species-specific SSPs than necrotrophs or hemibiotrophs . SSPs are frequently lineage specific and associated with host adaptation/specialization and are considered as putative effectors. Therefore, our work makes SSPs new candidates to be crucial players in oomycete adaptation to new hosts, particularly in species lacking the large family of RxLR effectors found in the Peronosporalean lineage.
In this paper, we have analyzed the genomes of two close Aphanomyces species with the aim of identifying genetic determinants involved in host adaptation. However, the divergence between A. euteiches and A. astaci is still substantial, and these two species occupy two distinct ecological niches (soil vs aquatic environments). Thus, it is expected that many of the differences observed do not apply exclusively to pathogenesis but also to adaptation to these ecological niches. Importantly, it has been suggested that effectors can also play a role in competition or co-operation with other microorganisms occupying the same ecological niche. Sequencing of more Aphanomyces species is underway and will certainly help to refine the set of genes involved in host pathogenesis.
Aphanomyces spp. isolates and DNA preparation
The mycelia of A. euteiches isolate ATCC201684, A. astaci (Genotype E, strain Li07, provided by A. Petrusek, Czech Republic), and A. stellatus isolate CBS 578.67 were grown for 4 days in liquid YG medium (2.5% yeast extract, 5% glucose) at 23 °C in the dark. Biological samples were frozen in liquid nitrogen and the DNA extracted as reported in .
Aphanomyces euteiches genome sequencing and assembly
CNS-Genoscope, Evry, France performed the sequencing and assemblies of the A. euteiches ATCC201684 genome, using a combination of Sanger and Illumina technologies. For 454 libraries of A. euteiches, DNA was extracted and fragmented to a range of 5–10 kb or around 20 kb using a HydroShear instrument. Fragments were end-repaired, and the extremities were ligated to 454 circularization adapters. Fragments were size selected respectively to 8 kb or 20 kb through regular gel electrophoresis and circularized using Cre-Lox recombination. Circular DNA was fragmented again by nebulization or using the Covaris E210 instrument (Covaris, Inc., Woburn, MA, USA). Fragments were end-repaired and ligated with library adapters. Mate-pair libraries were amplified and purified. Single-stranded libraries were isolated, then bound to capture beads and amplified in an oil emulsion (emPCR). The libraries were then loaded on a picotiter plate and sequenced using a GS FLX sequencer according to the manufacturer’s protocol. We also prepared 454 single-end read libraries according to the Roche standard procedure using RL (GS FLX Titanium Rapid Library Preparation Kit, Roche Diagnostics, Indianapolis, IN, USA). The libraries were sequenced using a 1/4 Pico Titer Plate on a 454 GS FLX instrument with Titanium chemistry (Roche Diagnostics). For the Illumina library, 2 μg of genomic DNA (gDNA) of A. euteiches was sheared to a 150–700-bp range using the Covaris E210 instrument (Covaris, Inc.). Sheared DNA was used for Illumina library preparation according to a semiautomatized protocol. Briefly, end repair, A tailing, and Illumina-compatible adapter (BiooScientific) ligation were performed using the SPRIworks Library Preparation System and SPRI TE instrument (Beckman Coulter, Simsbury, CT, USA) according to the manufacturer’s protocol. A 300–600-bp size selection was applied in order to recover most of the fragments. The DNA fragments were amplified by 10 cycles of PCR using Kapa HiFi HotStart DNA Polymerase (Life Technologies) and Illumina adapter-specific primers. The libraries were purified with 0.8× AMPure XP beads (Beckman Coulter). After library profile analysis with an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA) and qPCR quantification, the library was sequenced using 100 base-length read chemistry in paired-end mode on the Illumina HiSeq2000 (Illumina, Canton, MA, USA). All reads were assembled with Newbler version vMapAsmResearch-04/19/2010-patch-08/17/2010. The 6319 contigs were linked into 349 scaffolds. The sequence quality of scaffolds from the Newbler assembly was improved as described previously  by automatic error corrections with Solexa/Illumina reads (146-fold genome coverage), which have a different bias in error type compared with 454 reads. Finally, the assembly was gap closed using Illumina data and GapCloser . Putative misassemblies were identified and corrected using default parameters of REAPR (version 1.0.17) . Assembly completeness was estimated using BUSCO v3  based on a set of common fungal genes (F) or Alveolata/Stramenopiles genes (AS), aka benchmarking universal single-copy orthologs (BUSCOs). We found 83.1% (F)/78.6% (AS) complete BUSCOs, 3.8% (F)/0% (AS) duplicated BUSCOs, and 3.1% (F)/1.7% (AS) fragmented BUSCOs, leading to 13.8% (F)/19.7% (AS) missing BUSCOs in A. euteiches. Statistics for different sequencing technologies performed in this study for the A. euteiches ATCC201684 genome are presented below.
Aphanomyces astaci and Aphanomyces stellatus genome sequencing and assemblies
The GeT-PlaGe core facility, Toulouse, France realized Illumina sequencing of the A. astaci and A. stellatus genomes, and the assemblies were performed by the Genotoul Bioinformatic Platform, Toulouse, France. DNA-seq libraries were prepared using an Illumina TruSeq DNA v2 Library Prep Kit following the manufacturer’s instructions. DNA was fragmented by sonication, size selection was performed using E-Gel 0.8% (Thermo Fisher Scientific, Waltham, MA, USA), and the adapters were ligated. Ten PCR cycles were applied to amplify the library before final purification with Agencourt AMPure XP beads (Beckman Coulter). Library quality was assessed using an Agilent Bioanalyzer, and the libraries were quantified by qPCR using the Kapa Library Quantification Kit. DNA-seq experiments were performed on an Illumina HiSeq2000 Sequencer using a paired-end read length of 2 × 100 pb with the HiSeq v3 Reagent Kit. The raw reads have been quality checked and stored in NG6 . FastQC (version 0.10.0) (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/) was used to produce quality metrics and bwa aln (version 0.6.1-r104) to search Escherichia coli-, yeast-, and phage-contaminated reads. The reads of A. astaci and A. stellatus were assembled using MaSuRCa, version 2.0 , and the assembly metrics were calculated using the assemblathon_stats.pl script .
Structural and functional annotation of Aphanomyces genomes
The assembled data of Aphanomyces were annotated with Augustus v2.757  trained with the assembled RNA-seq transcript generated in this study and the publicly available ESTs previously obtained from A. euteiches  using autoAug.pl  and PASA . RNA-seq reads were first assembled with trinityrnaseq_r2012–10-05 . The accuracy of the prediction was evaluated by mapping the RNA-seq reads to the genomes using bwa . Genes were annotated using BLASTP against the RefSeq database . For protein family classification, InterProScan  and the Pfam protein domain database  were used. Gene ontologies were classified based on InterProScan annotation IDs. Kyoto Encyclopedia of Genes and Genomes (KEGG) and Enzyme Commission (EC number) data were obtained with KEGG Automatic Annotation Server (KAAS) predicted protein sequences  and were also mapped to the Clusters of Orthologous Groups (COG) classification system . Predictions of signal peptides were performed using SignalP 4.1 . Carbohydrate-Active enZymes (CAZymes) in the protein models were predicted using the dbCAN pipeline . Orthology comparisons between the predicted Aphanomyces genomes and protein datasets from nine deeply sequenced oomycete genomes were performed via OrthoMCLcompanion  using standard parameters. Interspersed repeat sequences were de novo identified in the genomes of A. euteiches, A. stellatus, and A. astaci using the RepeatModeler 1.0.8 pipeline . Short (< 1000 bp) Class II transposons that were devoid of recognizable open reading frames were classified as miniature inverted-repeat transposable elements (MITEs). When possible, we assigned them to a superfamily based on the nature of their target site duplication (TSD): piggybac (TTAA motif), Tc1-mariner (TA motif), and PIF-Harbinger (TTA or TAA motif) . The remaining RepeatModeler unclassified consensus sequences (producing no BLASTX hits and having no distinct boundaries) were classified as “unclear”. A library of repeated sequences was then constructed for each species and used to mask each genome with RepeatMasker 4.0.5 . To characterize the global evolutionary dynamics of Aphanomyces TEs, we plotted the frequency distribution of the percent divergence between each consensus and all their cognate copies. To assess the extent to which the TEs are related to each other and to other known TEs, we used each library as a query to perform BLASTN searches on the other libraries and on the Repbase  library of TEs (e-value cut-off = 10–20). Gene annotations were visualized in Apollo . All the data generated in this work were incorporated in an updated version (version 2.0) of AphanoDB [37, 100].
Preparation of RNA material
Seeds of M. truncatula Gaertn. F83005.5 were scarified, sterilized, and in vitro cultured as previously described [101, 102]. Roots of 10-day-old plants were inoculated with A. euteiches ATCC201684 zoospores as described in . Infected roots were harvested 1 dpi, 3 dpi, and 9 dpi. In parallel, flasks with 50 mL PG medium (20.0 g/L animal peptone; 5.0 g/L glucose) were inoculated with 1000 zoospores of A. euteiches. Mycelium samples were collected from the PG medium 1 dpi, 3 dpi, and 9 dpi. Zoospores were collected by centrifugation at 14.000× g at 4 °C for 45 min. For all samples three biological replicates were collected. RNA was isolated using the RNA Plant Mini Kit (QIAGEN, Hilden, Germany) according to the manufacturer’s protocol, except for the DNAse treatment (RNase-free DNase, QIAGEN), which was done on the column for 20 min. The RNA concentration was determined, and the quality of the RNA was verified using a Fragment Analyzer (Advanced Analytical, Ankeny, IA, USA).
Illumina sequencing of RNA samples generated from A. euteiches mycelium (MY) and mycelium grown in contact with M. truncatula roots (INT) , was performed by CNS-Genoscope, Evry, France. Starting with 2 μg of total RNA, double-stranded cDNA was first generated using the TruSeq RNA sample prep kit (Illumina, Canton, MA, USA), and then paired-end libraries were prepared using NEBNext Sample Reagent Set (New England Biolabs, Ipswich, MA, USA). Briefly, the messenger RNAs (mRNAs) were polyA-selected, chemically fragmented, and converted into single-stranded cDNA using random hexamer priming. The second strand was generated to create double-stranded cDNA. The cDNA was then end-repaired and 3’-adenylated, and Illumina adapters were added. The ligation products were purified and the DNA fragments (> 200 pb) were PCR-amplified using Platinum Pfx DNA Polymerase (Life Technologies) and Illumina adapter-specific primers. After library profile analysis by the Agilent 2100 Bioanalyzer (Agilent Technologies) and qPCR quantification (MxPro, Agilent Technologies), the libraries were sequenced on an Illumina HiSeq2000 instrument using 101 base-length read chemistry in a paired-end mode. Statistics for INT and MY libraries are reported below.
In this study new RNA samples were generated from A. euteiches mycelium, zoospores, or infected M. truncatula roots (1 dpi, 3 dpi, and 9 dpi) according to [103, 104]. RNA-seq libraries were prepared according to Illumina’s protocols using the Illumina TruSeq Stranded mRNA Sample Prep Kit to analyze mRNA, at the GeT-PlaGe core facility, Toulouse. Briefly, mRNA was selected using poly-T beads, and cDNA was generated using random hexamer priming, and adapters were then added. Ten cycles of PCR were applied to amplify the libraries. Library quality was assessed using an Agilent Bioanalyzer, and the libraries were quantified by qPCR using the Kapa Library Quantification Kit. RNA-seq experiments have been performed on an Illumina HiSeq2500 using a paired-end read length of 2 × 100 pb with the Illumina TruSeq SBS Sequencing Kits v3. Statistics are presented in the manuscript.
RNA-sequencing data analysis
Reads obtained from the different life stages of A. euteiches (zoospore, mycelium) or infected roots (1 dpi, 3 dpi, and 9 dpi), were mapped to the A. euteiches genome with CLC Genomics Workbench 10.1.1 (QIAGEN). Predefined settings were used except for the similarity fraction, which was set at 0.98, and the read counts per gene (paired reads count as one) were exported. Using the R package edgeR , the data were filtered for genes with low counts across all libraries (counts per million (cpm) > 2 in at least three libraries) and then trimmed mean of M (TMM) normalized , the tagwise dispersion was estimated, and the differential expression was calculated. Using the generalized linear model likelihood ratio test, p values were obtained, and multiplicity correction was performed by applying the Benjamini-Hochberg method [107, 108]. The multi-dimensional scaling plot was created using the 500 genes with the highest dispersion over all libraries. The enrichment of GO terms in up- or downregulated genes was tested using a Fisher’s exact test with the classical algorithm in the topGO R package . Heatmaps were created using the gplots R package , and clustering was performed according to the complete linkage method with Euclidean distance measure.
Phylogenetic and divergence time analysis of Aphanomyces species
To assess the phylogenetic position and divergence times of A. astaci, A. euteiches, and A. stellatus within oomycetes, we followed the methodology of Matari and Blair , who produced a robust timetree of oomycetes. We used the amino acid alignments of 40 genes that were assembled for 17 oomycetes and one outgroup species (Tetrahymena thermophila), to which we added the sequences of the three Aphanomyces species. These 40 datasets were selected among 70 initial alignments of genes involved in regulation of gene expression, because there was strong evidence supporting their orthology relationships and they contained minimal missing data . Amino acid sequences were retrieved from the proteomes of the three Aphanomyces species using sequences of the three Saprolegniales (Thraustotheca clavata, Saprolegnia parasitica, and Achlya hypogyna) as queries in BLASTP searches. Most genes could be identified for the three Aphanomyces species, except six in A. astaci (HMG-CBF-BFY, MnmA, p15, Ssl1, TAF6, TAF12) and one in A. stellatus (TFIIB). The 40 alignments were then concatenated and subjected to a maximum likelihood phylogenetic analysis in PhyML v3.0  using the WAG model of amino acid substitutions as in . The robustness of the tree was evaluated by performing 1000 bootstrap replicates. Divergence times were conducted in BEAST v1.8.2  using the same parameters as in . Briefly, we performed 50 million generations run using the PhyML-generated tree as a guide tree and treating each of the 40 datasets as a separate partition with the WAG substitution model, a Yule speciation process (uniform distribution; 0–100; initial value 0.01), and the random local clock model, which was shown to best fit the data by . The calibration strategy follows that of  and consists of three calibration points modeled with a gamma prior distribution: (1) diatom node: 5–95% quantiles = 74–100 Myrs, (2) diatom + Ectocarpus node: 5–95% quantiles = 176–202 Myrs, and (3) oomycetes + ochrophytes node: (5–95% quantiles = 418–550 Myrs). The root age was modeled using a uniform prior distribution (408–1750 Myrs; initial value 635). We used Tracer v1.6  to visualize convergence and determine the burn-in and FigTree v1.4  to generate the dated tree of oomycetes.
Construction of plasmid vectors and Agrobacterium-mediated transformations
The AeSSPs sequences were amplified by PCR from A. euteiches gDNA with specific primers (Additional file 6: ST3b). The CACC cloning site was added to each forward primer. PCR products were purified using the PCR Clean-Up Kit (Promega, Madison, WI, USA) and introduced in the pENTRY-D-TOPO vector (pENTR/D-TOPO Cloning Kit, Invitrogen). Positive clones were introduced in a pK7FWG2 vector (Invitrogen) or pAMpAT/YFP vector. After sequencing, positive clones were introduced in A. tumefaciens GV3101 and A. rhizogenes ARqua1. For the KDEL fusion, the GFP construct was amplified from the obtained pK7FWG2 vector by PCR using primers that introduced a C-terminal SEKDEL sequence (Additional file 6: ST3b). Cloning was performed as previously reported using a pK2GW7 vector (Invitrogen). For leaf infiltration, A. tumefaciens GV3101-transformed strains were syringe-infiltrated as described in . For M. truncatula roots transformation, A. rhizogenes ARqua1 strains were used and confocal imaging was performed at 28 dpi, as described in .
Brefeldin A treatment and P. capsici infection assay
Brefeldin A (BFA, Sigma-Aldrich, St. Louis, MO, USA) treatment was performed 24 h after agroinfiltration of N. benthamiana. Leaf discs were vacuum infiltrated with TBS1X (50 mM Tris-HCl; 150 mM NaCl pH 7.6) and 15 μg/mL BFA for 30 min, and incubated for 24 h at room temperature. Leaf discs were washed in TBS1X before confocal imaging. For the infection assay, Phytophthora capsici LT3112 was grown on V8 agar plates for 7 days at 22 °C. Zoospore preparation and inoculation on agroinfiltrated N. benthamiana leaf and symptom measurement were performed as reported in .
Imaging was performed on Leica DM6B-Z-CS or Leica AOBS TCS SP2 SE laser scanning confocal microscopes. Excitation wavelengths and emission filters were 488 (GFP) or 514 nm (YFP) long-pass. Images were acquired with a 40× water immersion lens or a 25× Fluotar Visir water objective, and corresponded to Z projections of scanned tissues. Image processing was performed using ImageJ software including three-dimensional reconstruction to compute projections of serial confocal sections.
Cell wall-degrading enzyme
Small Secreted Protein
Beakes GW, Glockling SL, Sekimoto S. The evolutionary phylogeny of the oomycete “fungi”. Protoplasma. 2011;249(1):3–19.
Raffaele S, Kamoun S. Genome evolution in filamentous plant pathogens: why bigger can be better. Nat Rev Microbiol. 2012;10(6):417–30.
Judelson HS. Dynamics and innovations within oomycete genomes: insights into biology, pathology, and evolution. Eukaryot Cell. 2012;11(11):1304–12.
Dong S, Raffaele S, Kamoun S. The two-speed genomes of filamentous pathogens: waltz with plants. Curr Opin Genet Dev. 2015;35:57–65.
Raffaele S, Farrer RA, Cano LM, Studholme DJ, MacLean D, Thines M, Jiang RH, Zody MC, Kunjeti SG, Donofrio NM, et al. Genome evolution following host jumps in the Irish potato famine pathogen lineage. Science. 2010;330:1540–3.
Dong S, Stam R, Cano LM, Song J, Sklenar J, Yoshida K, Bozkurt TO, Oliva R, Liu Z, Tian M, et al. Effector specialization in a lineage of the Irish potato famine pathogen. Science. 2014;343(6170):552–5.
Amaro TM, Thilliez GJ, Motion GB, Huitema E. A perspective on CRN proteins in the genomics age: evolution, classification, delivery and function revisited. Front Plant Sci. 2017;8:99.
Whisson SC, Boevink PC, Wang S, Birch PR. The cell biology of late blight disease. Curr Opin Microbiol. 2016;34:127–35.
Schornack S, van Damme M, Bozkurt TO, Cano LM, Smoker M, Thines M, Gaulin E, Kamoun S, Huitema E. Ancient class of translocated oomycete effectors targets the host nucleus. Proc Natl Acad Sci U S A. 2010;107(40):17421–6.
Engelhardt S, Boevink PC, Armstrong MR, Ramos MB, Hein I, Birch PR. Relocalization of late blight resistance protein R3a to endosomal compartments is associated with effector recognition and required for the immune response. Plant Cell. 2012;24(12):5142–58.
Du Y, Berg J, Govers F, Bouwmeester K. Immune activation mediated by the late blight resistance protein R1 requires nuclear localization of R1 and the effector AVR1. New Phytol. 2015;207(3):735–47.
Hein I, Gilroy EM, Armstrong MR, Birch PR. The zig-zag-zig in oomycete-plant interactions. Mol Plant Pathol. 2009;10(4):547–62.
Dieguez-Uribeondo J, Garcia MA, Cerenius L, Kozubikova E, Ballesteros I, Windels C, Weiland J, Kator H, Soderhall K, Martin MP. Phylogenetic relationships among plant and animal parasites, and saprotrophs in Aphanomyces (Oomycetes). Fungal Genet Biol. 2009;46(5):365–76.
Gaulin E, Jacquet C, Bottin A, Dumas B. Root rot disease of legumes caused by Aphanomyces euteiches. Mol Plant Pathol. 2007;8(5):539–48.
Jones FR, Drechsler C. Root rot of peas in the United States caused by Aphanomyces euteiches. J Agric Res. 1925;30:293–325.
Gaulin E, Bottin A, Jacquet C, Dumas B. Aphanomyces euteiches and legumes. In: Lamour K, Kamoun S, e, editors. Oomycetes genetics and genomics: diversity, interactions, and research tools. Hoboken: Wiley; 2009.
Malvick D, Grau C. Characteristics and frequency of Aphanomyces euteiches races 1 and 2 associated with alfalfa in the Midwestern United States. Plant Dis. 2001;85:740–4.
Wicker E, Rouxel F, Hullé M. Pathogenic characteristics and frequency of Aphanomyces euteiches from pea in France. Plant Pathol. 2001;50:433–42.
Scott W. A monograph of the genus Aphanomyces. Virginia Agr Exp Sta Tech Bull. 1961;151:1–95.
Bonhomme M, André O, Badis Y, Ronfort J, Burgarella C, Chantret N, Prosperi JM, Briskine R, Mudge J, Debéllé F, et al. High-density genome-wide association mapping implicates an F-box encoding gene in Medicago truncatula resistance to Aphanomyces euteiches. New Phytol. 2014;201(4):1328–42.
Pilet-Nayel ML, Muehlbauer FJ, McGee RJ, Kraft JM, Baranger A, Coyne CJ. Quantitative trait loci for partial resistance to Aphanomyces root rot in pea. Theor Appl Genet. 2002;106:28–39.
Svoboda J, Mrugala A, Kozubikova-Balcarova E, Kouba A, Dieguez-Uribeondo J, Petrusek A. Resistance to the crayfish plague pathogen, Aphanomyces astaci, in two freshwater shrimps. J Invertebr Pathol. 2014;121:97–104.
Svoboda J, Mrugala A, Kozubikova-Balcarova E, Petrusek A. Hosts and transmission of the crayfish plague pathogen Aphanomyces astaci: a review. J Fish Dis. 2017;40:127–40.
Souty-Grosset C, Holdich DM, Noel PY, Reynolds JD, Haffner P. (eds) Atlas of crayfish in Europe. Muséum National d'Histoire Naturelle, Paris. 2006
Dieguez-Uribeondo J, Huang TS, Cerenius L, Soderhall K. Physiological adaptation of an Aphanomyces astaci strain isolated from the freshwater crayfish Procambarus clarkii. Mycol Res. 1995;99:574–8.
Becking T, Mrugala A, Delaunay C, Svoboda J, Raimond M, Viljamaa-Dirks S, Petrusek A, Grandjean F, Braquart-Varnier C. Effect of experimental exposure to differently virulent Aphanomyces astaci strains on the immune response of the noble crayfish Astacus astacus. J Invertebr Pathol. 2015;132:115–24.
Global Invasive Species Database. http://www.iucngisd.org/gisd/100_worst.php.
Gaulin E, Madoui MA, Bottin A, Jacquet C, Mathé C, Couloux A, Wincker P, Dumas B. Transcriptome of Aphanomyces euteiches: new oomycete putative pathogenicity factors and metabolic pathways. PLoS One. 2008;3(3):e1723.
Ramirez-Garces D, Camborde L, Pel MJ, Jauneau A, Martinez Y, Neant I, Leclerc C, Moreau M, Dumas B, Gaulin E. CRN13 candidate effectors from plant and animal eukaryotic pathogens are DNA-binding proteins which trigger host DNA damage response. New Phytol. 2016;210(2):602–17.
Camborde L, Jauneau A, Brière C, Deslandes L, Dumas B, Gaulin E. Detection of nucleic acid-protein interactions in plant leaves using fluoresecnce lifetime time imaging microscopy. Nature Protocol. 2017;12(9):1933–50.
Jiang RH, de Bruijn I, Haas BJ, Belmonte R, Lobach L, Christie J, van den Ackerveken G, Bottin A, Bulone V, Diaz-Moreno SM, et al. Distinctive expansion of potential virulence genes in the genome of the oomycete fish pathogen Saprolegnia parasitica. PLoS Genet. 2013;9(6):e1003272.
Haas B, Kamoun S, Zody MC, Jiang RH, Handsaker RE, Cano LM, Grabherr M, Kodira CD, Raffaele S, Torto-Alalibo T, et al. Genome sequence and analysis of the Irish potato famine pathogen Phytophthora infestans. Nature. 2009;461(7262):393–8.
Kemen E, Gardiner A, Schultz-Larsen T, Kemen AC, Balmuth AL, Robert-Seilaniantz A, Bailey K, Holub E, Studholme DJ, Maclean D, et al. Gene gain and loss during evolution of obligate parasitism in the white rust pathogen of Arabidopsis thaliana. PLoS Biol. 2011;9:e1001094.
Sharma R, Xia X, Cano LM, Evangelisti E, Kemen E, Judelson H, Oome S, Sambles C, van den Hoogen DJ, Kitner M, et al. Genome analyses of the sunflower pathogen Plasmopara halstedii provide insights into effector evolution in downy mildews and Phytophthora. BMC Genomics. 2015;16(1):741.
Waterhouse RM, Seppey M, Simao FA, Manni M, Ioannidis P, Klioutchnikov G, Kriventseva EV, Zdobnov EM. BUSCO applications from quality assessments to gene prediction and phylogenomics. Mol Biol Evol. 2017;35(3):543–8.
Yin L, An Y, Qu J, Li X, Zhang Y, Dry I, Wu H, Lu J. Genome sequence of Plasmopara viticola and insight into the pathogenic mechanism. Sci Rep. 2017;7:46553.
Madoui M, Gaulin E, Mathe C, Clemente H, Couloux A, Wincker P, Dumas B. AphanoDB: a genomic resource for Aphanomyces pathogens. BMC Genomics. 2007;8:471.
Matari NH, Blair JE. A multilocus timescale for oomycete evolution estimated under three distinct molecular clock models. BMC Evol Biol. 2014;14:101.
Tyler B, Tripathy S, Zhang X, Dehal P, Jiang R, Aerts A, Arredondo F, Baxter L, Bensasson D, Beynon J, et al. Phytophthora genome sequences uncover evolutionary origins and mechanisms of pathogenesis. Science. 2006;313(5791):1261–6.
Lévesque CA, Brouwer H, Cano L, Hamilton JP, Holt C, Huitema E, Raffaele S, Robideau GP, Thines M, Win J et al. Genome sequence of the necrotrophic plant pathogen Pythium ultimum reveals original pathogenicity mechanisms and effector repertoire. Genome Biol 2010, 11(7):R73.
Li L, Stoeckert CJ Jr, Roos DS. OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 2003;13(9):2178–89.
Sperschneider J, Gardiner DM, Dodds PN, Tini F, Covarelli L, Singh KB, Manners JM, Taylor JM. EffectorP: predicting fungal effector proteins from secretomes using machine learning. New Phytol. 2016;210(2):743–61.
Brouwer H, Coutinho PM, Henrissat B, de Vries RP. Carbohydrate-related enzymes of important Phytophthora plant pathogens. Fungal Genet Biol. 2014;72:192–200.
Lanver D, Berndt P, Tollot M, Naik V, Vranes M, Warmann T, Munch K, Rossel N, Kahmann R. Plant surface cues prime Ustilago maydis for biotrophic development. PLoS Pathog. 2014;10(7):e1004272.
Yajima W, Liang Y, Kav NN. Gene disruption of an arabinofuranosidase/beta-xylosidase precursor decreases Sclerotinia sclerotiorum virulence on canola tissue. Mol Plant-Microbe Interact. 2009;22(7):783–9.
de Vries RP, Kester HC, Poulsen CH, Benen JA, Visser J. Synergy between enzymes from Aspergillus involved in the degradation of plant cell wall polysaccharides. Carbohydr Res. 2000;327(4):401–10.
Savory F, Leonard G, Richards TA. The role of horizontal gene transfer in the evolution of the oomycetes. PLoS Pathog. 2015;11(5):e1004805.
Richards TA, Soanes DM, Jones MD, Vasieva O, Leonard G, Paszkiewicz K, Foster PG, Hall N, Talbot NJ. Horizontal gene transfer facilitated the evolution of plant parasitic mechanisms in the oomycetes. Proc Natl Acad Sci U S A. 2011;108(37):15258–63.
Larroque M, Barriot R, Bottin A, Barre A, Rougé P, Dumas B, Gaulin E. The unique architecture and function of cellulose-interacting proteins in oomycetes revealed by genomic and structural analyses. BMC Genomics. 2012;13:605.
Garcia-Gonzalez E, Poppinga L, Funfhaus A, Hertlein G, Hedtke K, Jakubowska A, Genersch E. Paenibacillus larvae chitin-degrading protein PlCBP49 is a key virulence factor in American Foulbrood of honey bees. PLoS Pathog. 2014;10(7):e1004284.
Shen D, Liu T, Ye W, Liu L, Liu P, Wu Y, Wang Y, Dou D. Gene duplication and fragment recombination drive functional diversification of a superfamily of cytoplasmic effectors in Phytophthora sojae. PLoS One. 2013;8(7):e70036.
Adhikari BN, Hamilton JP, Zerillo MM, Tisserat N, Lévesque CA, Buell CR. Comparative genomics reveals insight into virulence strategies of plant pathogenic oomycetes. Plos One. 2013;8(10):e75072.
As-sadi F, Carrere S, Gascuel Q, Hourlier T, Rengel D, Le Paslier MC, Bordat A, Boniface MC, Brunel D, Gouzy J, et al. Transcriptomic analysis of the interaction between Helianthus annuus and its obligate parasite Plasmopara halstedii shows single nucleotide polymorphisms in CRN sequences. BMC Genomics. 2011;12:498.
Gaulin E. Effector mediated communication of filamentous plant pathogens with their hosts. Advances Botanical Research. 2017;82:161–85.
Stam R, Jupe J, Howden AJ, Morris JA, Boevink PC, Hedley PE, Huitema E. Identification and characterisation CRN effectors in Phytophthora capsici shows modularity and functional diversity. PLoS One. 2013;8(3):e59517.
Song T, Ma Z, Shen D, Li Q, Li W, Su L, Ye T, Zhang M, Wang Y, Dou D. An oomycete CRN effector reprograms expression of plant HSP genes by targeting their promoters. PLoS Pathog. 2015;11(12):e1005348.
Sun G, Yang Z, Kosch T, Summers K, Huang J. Evidence for acquisition of virulence effectors in pathogenic chytrids. BMC Evol Biol. 2011;11:195.
Ah-Fong AM, Kim KS, Judelson HS. RNA-seq of life stages of the oomycete Phytophthora infestans reveals dynamic changes in metabolic, signal transduction, and pathogenesis genes and a major role for calcium signaling in development. BMC Genomics. 2017;18(1):198.
Fu L, Zhu C, Ding X, Yang X, Morris DP, Tyler BM, Zhang XG. Characterization of cell death-inducing members of the pectate lyase gene family in Phytophthora capsici and their contributions to infection of pepper. Mol Plant-Microbe Interact. 2015;28(7):766–75.
Gaulin E, Drame N, Lafitte C, Torto-Alalibo T, Martinez Y, Ameline-Torregrosa C, Khatib M, Mazarguil H, Villalba-Mateos F, Kamoun S, et al. Cellulose binding domains of a Phytophthora cell wall protein are novel pathogen-associated molecular patterns. Plant Cell. 2006;18(7):1766–77.
Gaulin E, Jauneau A, Villalba F, Rickauer M, Esquerre-Tugaye M, Bottin A. The CBEL glycoprotein of Phytophthora parasitica var. nicotianae is involved in cell wall deposition and adhesion to cellulosic substrates. J Cell Sci. 2002;115(23):4565–75.
Hosseini S, Elfstrand M, Heyman F, Funck Jensen D, Karlsson M. Deciphering common and specific transcriptional immune responses in pea towards the oomycete pathogens Aphanomyces euteiches and Phytophthora pisi. BMC Genomics. 2015;16:627.
Stam R, Howden AJ, Delgado-Cerezo M, Amaro TM MM, Motion GB, Pham J, Huitema E. Characterization of cell death inducing Phytophthora capsici CRN effectors suggests diverse activities in the host nucleus. Front Plant Sci. 2013;4:387.
Nelson BK, Cai X, Nebenfuhr A. A multicolored set of in vivo organelle markers for co-localization studies in Arabidopsis and other plants. Plant J. 2007;51(6):1126–36.
Lam SK, Cai Y, Tse YC, Wang J, Law AH, Pimpl P, Chan HY, Xia J, Jiang L. BFA-induced compartments from the Golgi apparatus and trans-Golgi network/early endosome are distinct in plant cells. Plant J. 2009;60(5):865–81.
Ospina-Giraldo MD, Griffith JG, Laird EW, Mingora C. The CAZyome of Phytophthora spp.: a comprehensive analysis of the gene complement coding for carbohydrate-active enzymes in species of the genus Phytophthora. BMC Genomics. 2010;11:525.
Zerillo MM, Adhikari BN, Hamilton JP, Buell CR, Lévesque CA, Tisserat N. Carbohydrate-active enzymes in Pythium and their role in plant cell wall and storage polysaccharide degradation. PLoS One. 2013;8(9):e72572.
Duplessis S, Cuomo CA, Lin YC, Aerts A, Tisserant E, Veneault-Fourrey C, Joly DL, Hacquard S, Amselem J, Cantarel BL, et al. Obligate biotrophy features unraveled by the genomic analysis of rust fungi. Proc Natl Acad Sci U S A. 2011;108(22):9166–71.
O'Connell RJ, Thon MR, Hacquard S, Amyotte SG, Kleemann J, Torres MF, Damm U, Buiate EA, Epstein L, Alkan N, et al. Lifestyle transitions in plant pathogenic Colletotrichum fungi deciphered by genome and transcriptome analyses. Nat Genet. 2012;44(9):1060–5.
Meerupati T, Andersson KM, Friman E, Kumar D, Tunlid A, Ahren D. Genomic mechanisms accounting for the adaptation to parasitism in nematode-trapping fungi. PLoS Genet. 2013;9(11):e1003909.
Hu X, Xiao G, Zheng P, Shang Y, Su Y, Zhang X, Liu X, Zhan S, St Leger RJ, Wang C. Trajectory and genomic determinants of fungal-pathogen speciation and host adaptation. Proc Natl Acad Sci U S A. 2014;111(47):16796–801.
Martin F, Selosse MA. The Laccaria genome: a symbiont blueprint decoded. New Phytol. 2008;180(2):296–310.
Kamel L, Tang N, Malbreil M, San Clemente H, Le Marquer M, Roux C, Frei Dit Frey N. The comparison of expressed candidate secreted proteins from two arbuscular mycorrhizal fungi unravels common and specific molecular tools to invade different host plants. Front Plant Sci. 2017;8:124.
Shindo T, Kaschani F, Yang F, Kovacs J, Tian F, Kourelis J, Hong TN, Colby T, Shabab M, Chawla R, et al. Screen of non-annotated small secreted proteins of Pseudomonas syringae reveals a virulence factor that inhibits tomato immune proteases. PLoS Pathog. 2016;12(9):e1005874.
Condon BJ, Leng Y, Wu D, Bushley KE, Ohm RA, Otillar R, Martin J, Schackwitz W, Grimwood J, MohdZainudin N, et al. Comparative genome structure, secondary metabolite, and effector coding capacity across Cochliobolus pathogens. PLoS Genet. 2013;9(1):e1003233.
Kim KT, Jeon J, Choi J, Cheong K, Song H, Choi G, Kang S, Lee YH. Kingdom-wide analysis of fungal small secreted proteins (SSPs) reveals their potential role in host association. Front Plant Sci. 2016;7:186.
Aury JM, Cruaud C, Barbe V, Rogier O, Mangenot S, Samson G, Poulain J, Anthouard V, Scarpelli C, Artiguenave F, et al. High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies. BMC Genomics. 2008;9:603.
Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, He G, Chen Y, Pan Q, Liu Y, et al. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience. 2012;1(1):18.
Hunt M, Kikuchi T, Sanders M, Newbold C, Berriman M, Otto TD. REAPR: a universal tool for genome assembly evaluation. Genome Biol. 2013;14(5):R47.
Mariette J, Escudie F, Allias N, Salin G, Noirot C, Thomas S, Klopp C. NG6: integrated next generation sequencing storage and processing environment. BMC Genomics. 2012;13:462.
Zimin AV, Marcais G, Puiu D, Roberts M, Salzberg SL, Yorke JA. The MaSuRCA genome assembler. Bioinformatics. 2013;29(21):2669–77.
Script to calculate basic set of metrics from a genome assembly. http://korflab.ucdavis.edu/datasets/Assemblathon/Assemblathon2/Basic_metrics/assemblathon_stats.pl.
Stanke M, Waack S. Gene prediction with a hidden Markov model and a new intron submodel. Bioinformatics. 2003;19(Suppl 2):ii215–25.
Pipeline for training and running the gene finder AUGUSTUS automatically. https://fossies.org/linux/augustus/scripts/README.autoAug.
Haas BJ, Delcher AL, Mount SM, Wortman JR, Smith RK Jr, Hannick LI, Maiti R, Ronning CM, Rusch DB, Town CD, et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 2003;31(19):5654–66.
Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, Adiconis X, Fan L, Raychowdhury R, Zeng Q, et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011;29(7):644–52.
Li H, Durbin R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010;26(5):589–95.
Altschul S, Madden T, Schaffer A, Zhang J, Zhang Z, Miller W, Lipman D. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25(17):3389–402.
Jones P, Binns D, Chang HY, Fraser M, Li W, McAnulla C, McWilliam H, Maslen J, Mitchell A, Nuka G, et al. InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014;30(9):1236–40.
Finn RD, Coggill P, Eberhardt RY, Eddy SR, Mistry J, Mitchell AL, Potter SC, Punta M, Qureshi M, Sangrador-Vegas A, et al. The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res. 2016;44(D1):D279–85.
Moriya Y, Itoh M, Okuda S, Yoshizawa A, Kanehisa M. KAAS: an automatic genome annotation and pathway reconstruction server. Nucleic Acids Res. 2007;35:W182–5.
Tatusov RL, Fedorova ND, Jackson JD, Jacobs AR, Kiryutin B, Koonin EV, Krylov DM, Mazumder R, Mekhedov SL, Nikolskaya AN, et al. The COG database: an updated version includes eukaryotes. BMC Bioinformatics. 2003;4:41.
A server to predict the presence and location of signal peptide cleavage sites in amino acid sequences from different organisms. http://www.cbs.dtu.dk/services/SignalP/.
Yin Y, Mao X, Yang J, Chen X, Mao F, Xu Y. dbCAN: a web resource for automated carbohydrate-active enzyme annotation. Nucleic Acids Res. 2012;40(Web Server issue):W445–51.
A pipeline to analyse genomics data. https://bbric.toulouse.inra.fr/hub/cgi/hub.cgi.
A de-novo repeat family identification and modeling package. http://www.repeatmasker.org/.
Wicker T, Sabot F, Hua-Van A, Bennetzen JL, Capy P, Chalhoub B, Flavell A, Leroy P, Morgante M, Panaud O, et al. A unified classification system for eukaryotic transposable elements. Nat Rev Genet. 2007;8(12):973–82.
Jurka J, Kapitonov VV, Pavlicek A, Klonowski P, Kohany O, Walichiewicz J. Repbase Update, a database of eukaryotic repetitive elements. Cytogenet Genome Res. 2005;110(1–4):462–7.
Lewis SE, Searle SM, Harris N, Gibson M, Lyer V, Richter J, Wiel C, Bayraktaroglu L, Birney E, Crosby MA, et al. Apollo: a sequence annotation editor. Genome Biol. 2002;3(12):Research0082.
A genomic database dedicated to Aphanomyces genus. http://www.polebio.lrsv.ups-tlse.fr/aphanoDB/.
Boisson-Dernier A, Chabaud M, Garcia F, Becard G, Rosenberg C, Barker DG. Agrobacterium rhizogenes-transformed roots of Medicago truncatula for the study of nitrogen-fixing and endomycorrhizal symbiotic associations. Mol Plant-Microbe Interact. 2001;14(6):695–700.
Djebali N, Jauneau A, Ameline-Torregrosa C, Chardon F, Jaulneau V, Mathe C, Bottin A, Cazaux M, Pilet-Nayel ML, Baranger A, et al. Partial resistance of Medicago truncatula to Aphanomyces euteiches is associated with protection of the root stele and is controlled by a major QTL rich in proteasome-related genes. Mol Plant-Microbe Interact. 2009;22(9):1043–55.
Badreddine I, Lafitte C, Heux L, Skandalis N, Spanou Z, Martinez Y, Esquerré-Tugayé M, Bulone V, Dumas B, Bottin A. Cell wall chitosaccharides are essential components and exposed patterns of the phytopathogenic oomycete Aphanomyces euteiches. Eukaryot Cell. 2008;7(11):1980–93.
Badis Y, Bonhomme M, Lafitte C, Huguet S, Balzergue S, Dumas B, Jacquet C. Transcriptome analysis highlights preformed defences and signalling pathways controlled by the prAe1 quantitative trait locus (QTL), conferring partial resistance to Aphanomyces euteiches in Medicago truncatula. Mol Plant Pathol. 2015;16(9):973–86.
Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–40.
Robinson MD, Oshlack A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 2010;11(3):R25.
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Statist Soc. 1995;57(1):289–300.
McCarthy DJ, Chen Y, Smyth GK. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Res. 2012;40(10):4288–97.
Alexa A, Rahnenfuhrer J: topGO: Enrichment analysis for Gene Ontology. R package version 2.20.0. 2010.
Warnes G, Bolker B, Bonebakker L, Gentleman R, Liaw W, Lumley T, Maechler M, Magnusson A, Moeller S, Schwartz M, et al. gplots: various R programming tools for plotting data. R package version 2.17.0. 2015. https://cran.r-project.org/web/packages/gplots/index.html.
Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol. 2010;59(3):307–21.
Drummond AJ, Suchard MA, Xie D, Rambaut A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol Biol Evol. 2012;29(8):1969–73.
A cross-platform program for Bayesian analysis of molecular sequences. http://beast.bio.ed.ac.uk.
We would like to thank Adam Petrusek (Charles University in Prague, Czech Republic) for the gift of the Aphanomyces astaci strain.
EG was supported by an IBiSA-Genoscope project (APHANO-GP: AP09/10-No.19) and an Agence Nationale pour la Recherche JCJC grant (APHANO-Effect, ANR-12-JSV6-0004-01). This work was performed in collaboration with the GeT core facility, Toulouse, France (http://get.genotoul.fr), and was supported by France Génomique National Infrastructure, funded as part of the “Investissement d’avenir” program (ANR-10-INBS-09).
Availability of data and materials
Assemblies, sequencing, annotation, and expression data are available in AphanoDB v2.0 (http://www.polebio.lrsv.ups-tlse.fr/aphanoDB/). Genome assemblies and raw data are also available at the EMBL | European Nucleotide Archive (ENA) (https://www.ebi.ac.uk/ena/browse) under study numbers PRJEB24016, PRJEB24021, and PRJEB24018. Transcriptomic data are available at the National Center for Biotechnology Information (NCBI) | Gene Expression Omnibus (GEO) (https://www.ncbi.nlm.nih.gov/geo/) under accession number [GEO:GSE109500]. Aphanomyces spp. strains are available on request.
Ethics approval and consent to participate
Does not apply for this study.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
ST1. Genome features and annotations. 1a. Genome resources and nomenclature used in this study. 1b. BUSCO analysis. 1c. TE analysis. 1d. Orthology analysis (OrthoMCL) summary (nine oomycetes and Ae, Aa). 1e. OrthoMCL, group ID (11 oomycete proteomes). 1f. Ae, Aa: predicted secreted genes ID. 1g. Ae, Aa: GO analysis (secretome). 1h. Ae, Aa: specific secretomes analysis. 1i. CAZyome analysis. 1j. Fungal and oomycete pectate lyase sequences. 1k. CRN effectors predicted in Ae and Aa genomes. (XLSX 2962 kb)
Figure S1. Phylogenetic analysis of the A. euteiches PL1 gene family. A maximum likelihood phylogeny analysis was performed on an oomycete and fungal PL1 sequence alignment using the neighbor joining construction methods and the WAG protein substitution model. Bootstrap analysis was performed with 1000 replicates. (PPTX 89 kb)
ST2. A. euteiches expression analysis. 2a. Statistics of RNA-seq experiments. 2b. Percentage of up- and downregulated A. euteiches genes. 2c. Overrepresented GO terms in zoospores vs mycelium. 2d. Overrepresented GO terms at 3 dpi vs mycelium. 2e. Overrepresented GO terms at 9 dpi vs mycelium. 2f. Expression data for heatmap construction (GH). 2g. Expression data for heatmap construction (PL). 2h. Expression data for heatmap construction (proteases). 2i. Expression data for heatmap construction (CE). 2j. Expression data for heatmap construction (CBM). 2k. Expression data for heatmap construction (protease inhibitor). (XLSX 131 kb)
Figure S2. RNA-seq samples relationship. Multi-dimensional scaling plot (Euclidean distance; top = 500 genes) showing the leading log2-fold change (leading logFC) between the normalized samples of A. euteiches. Three biological replicates per condition. (PPTX 85 kb)
Figure S3. Gene expression of A. euteiches. Heatmaps of the Log2 RPKM values of carbohydrate esterases (A), carbohydrate-binding module (B) and proteases inhibitors (C). Colors on the left of the heatmaps indicate the subgroup the CE, the CBM or the PInh belongs to and if present the group of the corresponding class, and colors on the right indicate if the genes are significantly up- (red) or down- (blue) regulated in zoospores, infected roots 3 dpi and infected roots 9 dpi compared to expression in mycelium grown in vitro. (PPTX 99 kb)
ST3. A. euteiches small secreted proteins (AeSSPs). 3a. AeSSP classification and expression. 3b. Primers used in this study. (XLSX 37 kb)
About this article
Cite this article
Gaulin, E., Pel, M.J.C., Camborde, L. et al. Genomics analysis of Aphanomyces spp. identifies a new class of oomycete effector associated with host adaptation. BMC Biol 16, 43 (2018). https://doi.org/10.1186/s12915-018-0508-5
- Host adaptation