Recombination and insertion events involving the botulinum neurotoxin complex genes in Clostridium botulinum types A, B, E and F and Clostridium butyricumtype E strains
© Hill et al; licensee BioMed Central Ltd. 2009
Received: 10 September 2009
Accepted: 5 October 2009
Published: 5 October 2009
Clostridium botulinum is a taxonomic designation for at least four diverse species that are defined by the expression of one (monovalent) or two (bivalent) of seven different C. botulinum neurotoxins (BoNTs, A-G). The four species have been classified as C. botulinum Groups I-IV. The presence of bont genes in strains representing the different Groups is probably the result of horizontal transfer of the toxin operons between the species.
Chromosome and plasmid sequences of several C. botulinum strains representing A, B, E and F serotypes and a C. butyricum type E strain were compared to examine their genomic organization, or synteny, and the location of the botulinum toxin complex genes. These comparisons identified synteny among proteolytic (Group I) strains or nonproteolytic (Group II) strains but not between the two Groups. The bont complex genes within the strains examined were not randomly located but found within three regions of the chromosome or in two specific sites within plasmids. A comparison of sequences from a Bf strain revealed homology to the plasmid pCLJ with similar locations for the bont/bv b genes but with the bont/a4 gene replaced by the bont/f gene. An analysis of the toxin cluster genes showed that many recombination events have occurred, including several events within the ntnh gene. One such recombination event resulted in the integration of the bont/a1 gene into the serotype toxin B ha cluster, resulting in a successful lineage commonly associated with food borne botulism outbreaks. In C. botulinum type E and C. butyricum type E strains the location of the bont/e gene cluster appears to be the result of insertion events that split a rarA, recombination-associated gene, independently at the same location in both species.
The analysis of the genomic sequences representing different strains reveals the presence of insertion sequence (IS) elements and other transposon-associated proteins such as recombinases that could facilitate the horizontal transfer of the bonts; these events, in addition to recombination among the toxin complex genes, have led to the lineages observed today within the neurotoxin-producing clostridia.
Clostridium botulinum is a taxonomic designation for at least four diverse groups of Gram positive spore-forming anaerobic bacteria that produce the most potent naturally occurring toxin known, botulinum neurotoxin (BoNT). Production of BoNT has been the single criterion for inclusion within the C. botulinum species and was adopted in order to prevent scientific and medical confusion regarding the intoxication known as botulism. However, this single criterion has resulted in a species designation that encompasses clades of strains that should be considered as four separate species. Phylogenetic analysis of 16S rrn genes of C. botulinum strains clearly separates them into four Groups (I-IV) and supports this historical classification scheme based upon biochemical and biophysical parameters . Group I contains proteolytic serotype A, B and F strains, as well as bivalent (bv) Ab, Ba, Af, and Bf strains; Group II consists of nonproteolytic (np) and saccharolytic serotype B, E and F strains; Group III consists of serotype C and D strains; and Group IV consists solely of serotype G strains . Group IV has been recognized as a distinct species and its members have been given the additional name of C. argentinense . Further Group designations (V and VI) have been proposed for other clostridial species found to express BoNT, such as the BoNT/F-producing C. baratii strains and the BoNT/E-producing C. butyricum strains .
The 16S rrn dendrogram also shows that the tetanus toxin-producing Clostridia, C. tetani, occupies a distinct clade when compared to the other clostridial species. This species was one of the first clostridial genomes to be sequenced revealing the presence of the tetanus toxin within a 74 kb plasmid . Recent genomic sequences of different C. botulinum strains have revealed single or bivalent bonts are located within plasmids as often as within the chromosome [9–11]. Unlike tetanus toxin, which appears uniform from strain to strain, bont gene sequence comparisons have identified multiple variants that are recognized as serotypes and subtypes.
Comparisons of the BoNT/A-G protein sequences in strains representing the different Groups show that BoNT protein identities range from 34%-64% among the seven serotypes . In addition, the variation observed in BoNT protein sequences within the serotypes, except in type G, has resulted in designations of BoNT subtypes within a serotype (for example subtypes A1-A5 within BoNT/A).
The discordant phylogeny of the serological classification of the toxins with the 16S rrn analyses and Group designations indicates that the bont genes have been horizontally transferred between various clostridial lineages. Horizontal gene transfer events are observed within other bacterial species and contribute to bacterial evolution . Although the exact transfer mechanisms active within the clostridia remain unclear, the regions flanking the bont and toxin complex genes include partial and complete insertion sequence (IS) elements and gene duplication events indicative of mobile element activity. In addition, the genes of several bonts are located within plasmids or phage [9–11]. These findings suggest possible mechanisms that could enable the horizontal transfer of bont . Recombination events within the bont genes (mosaic bont/c/d and bont/a1/a3 for example) and within the ntnh gene that precedes the bont gene have been observed and contribute significantly to BoNT diversity [5, 6, 13, 14]. Although the three plasmids that contain bont/a3, bont/a4, bont/bv b or bont/b1 genes are largely homologous, each shows regions of inversions and deletions .
Because the toxin complex genes appear to move among the clostridia, they cannot be used to infer the phylogenetic relationships of the host bacteria. However, the sequences and the locations of the bont gene clusters provide clues to earlier gene transfer and recombination events. In order to better understand these events, we compared the available genomic sequences of several strains within the Group I, II and VI designations. Chromosome and plasmid synteny were analysed and the specific locations and sequences flanking the bont complex genes were examined within C. botulinum types A, B, E and F strains and a C. butyricum type E strain. Plasmid locations for the bont/np b gene within the Eklund 17 B strain and for the bont/bv b and bont/f genes within the bivalent Bf strain were identified. A detailed examination of the toxin complex genes and their flanking regions revealed recombination and insertion events that have contributed to the diversity observed today.
Chromosomal and plasmid synteny
List of analyzed genomes.
Locus tag ID2
NC 004557/NC 004565
Plasmid synteny was also examined by comparing the bont-containing plasmids (pCLK with bont/a3, pCLJ with bont/a4 and bont/bv b, pCLD with bont/b1) from Group I to each other and to the Group II pCLL with bont/np b and pE88 in C. tetani. These plasmids each contain genes encoding: 329 proteins (pCLK); 195 proteins (pCLD); 305 proteins (pCLJ); 54 proteins (pCLL); and 59 proteins (pE88). Although the plasmids containing bont/a3, bont/a4 and bont/b1 vary in size (148 kb - 270 kb), Figure 2 panel 2a shows large regions of conserved organization among these plasmids and a small inversion (16.7 kb) that contains the bont/a3 relative to the bont/a4.
The genomic sequence of the Group II B strain, Eklund 17B, revealed the location of the bont/np b within a small (47.6 kb) plasmid, pCLL, that was unique when compared to other bont-containing plasmids. Synteny plots show that pCLL differs from pCLK (Figure 2 panel 2b) and pE88, the plasmid within C. tetani that contains tetanus toxin (Figure 2 panel 2c). None of the C. botulinum plasmids (pCLK, pCLJ or pCLD) shared synteny to C. tetani pE88 (data not shown).
Although the sequence data for the Bf strain is incomplete, four Bf contigs share synteny to the bivalent pCLJ that contains bont/a4 and bont/bv b (Figure 2 panel 2d). The same inversion (16.7 kb) identified in panel 2a is observed when the contigs are compared to pCLJ. The evidence for the plasmid location of bont/bv b and bont/f is supported by the sequence homology of the four contigs to pCLJ and a detailed examination of the location of the bont/bv b and bont/f is described later.
These results show that the Group I C. botulinum A, B and F strains share a similar chromosome organization to each other and to C. sporogenes but not to the Group II nonproteolytic B strain or serotype E strains, the Group VI BoNT/E-producing C. butyricum or C. tetani. The plasmids containing bont/a3, bont/a4, bont/bv b, bont/b1 or bont/f gene clusters also show similarity to each other but not to the C. tetani pE88 or pCLL containing bont/np b. Comparisons between the Group II C. botulinum BoNT/E or npBoNT/B-producing strains revealed that their chromosomal backgrounds share synteny with each other but not with the Group VI C. butyricum type E strain. These relationships confirm the different genomic backgrounds within C. botulinum and C. tetani and support the 16S rrn analyses and historical C. botulinum Group designations.
Components of the BoNT gene clusters
Figure 3 shows that the ha gene cluster is found within serotype A subtype BoNT/A1 strains and all of the serotype B strains, including the gene cluster harboring the silent bont/(b) gene within BoNT/A1(B) strains. The orfX gene cluster is found within all of the other strains examined here, including BoNT/A2, BoNT/A3, BoNT/A4 strains and the bont/a1 gene cluster within the BoNT/A1(B) strain. It is also found within the proteolytic BoNT/F Langeland strain, the bont/f gene cluster in the bivalent Bf strain, the BoNT/E1 and BoNT/E3 strains and the BoNT/E-producing C. butyricum strain.
The bont/a1 gene appears to be the only bont so far identified within either of the two types of toxin complexes. The bont/a1 gene in strains ATCC 3502, ATCC 19397 and Hall is located within the ha cluster and the bont/a1 within the BoNT/A1(B) strain, as well as several other BoNT/A1 strains, is located within the orfX cluster . It appears that the location of the bont/a1 gene within the ha cluster resulted from a recombination event in the middle of the serotype B ntnh gene that has been previously reported . The first half of the ntnh gene in the BoNT/A1 strain is 99.7% identical to the ntnh within serotype B strains. After a recombination event occurring at approximately 1,965 nucleotides from the start codon of the 3,594 bp gene, the second half of the ntnh gene is equally similar to the ntnh gene in serotypes A2, A3 and A4 (90 to 95% identity) strains. This event has resulted in a bont/a1 gene residing within an ha cluster that contains a hybrid or recombinant B/A ntnh gene.
The ntnh recombination event locating the bont/a1 gene within the ha cluster has resulted in a very successful lineage that is frequently identified in botulism cases. The many strains representing this event, such as ATCC 3502, ATCC 19397 and Hall, contribute to the acceptance of the ha cluster in association with the bont/a1 gene. However, the orfX cluster is more likely to be the ancestral toxin gene cluster containing the bont/a1 gene, as indicated by the location of the other bont/a subtype genes (bont/a2 bont/a3 and bont/a4) and the bont/a1 gene of the silent B strains within the orfX cluster. In addition, the bont/a1 genes within the ha cluster are located in a different region of the chromosome from the bont/a1 genes in the orfX cluster, as described below.
Location of the BoNTs within the chromosome
The arsC gene is part of a group of genes (arsA, arsB, arsC, arsD, and arsR) that encodes for proteins involved in arsenic reduction. BoNT/A1, BoNT/A1(B), BoNT/A2, and BoNT/F strains contain all five genes, but BoNT/A3, BoNT/Ba4 and BoNT/B1 strains lack genes for arsA, arsB and arsD. Recently, it has been shown that certain BoNT/B2 strains lacking the full gene complement are sensitive to arsenic, while BoNT/B2 strains containing all five genes are relatively resistant to arsenic .
The gene sequence of the split rarA in the serotype E strains can be spliced together to encode an intact fully functional protein. The location of the split (codon 102) is in the same site in both the C. botulinum and C. butyricum strains. Interestingly, the inserted sequences not only contain the bont/e gene cluster, but also contain another rarA gene that is intact. Therefore, these strains retain an intact copy of rarA in addition to the one that is split.
Figure 8(b) compares the nucleotide sequences of the spliced and intact rarA gene in these strains and other species. The dendrogram shows that the intact (inserted) rarA are almost identical to each other in the BoNT/E-producing C. botulinum and C. butyricum strains, suggesting a common source. However, the sequences of the spliced rarA genes within these C. botulinum and C. butyricum strains are not identical. The spliced rarA within the Beluga and Alaska E43 strains are almost identical to each other and very similar to the Eklund 17B strain rarA sequence. The different sequences of the rarA genes that are split by the bont/e insertion in C. botulinum and C. butyricum show that these were separate events occurring in different bacterial backgrounds.
The mechanism of the insertion event likely involves the rarA protein, which is a resolvase involved in recombination or insertion events of transposons. Transposon activities within Gram positive bacteria are not well characterized but are known to be responsible for genetic exchange of antibiotic resistance genes and/or genomic islands in other bacteria such as Staphylococcus aureus methicillin resistance, for example . The rarA insertion site was likely targeted by the presence of a rarA gene within the inserted region. The presence of an IS element and a transposon resolvase involved in horizontal gene transfer suggests that either or both could have played a role in the insertion of the bont/e gene cluster into the chromosome.
Location of the BoNTs within plasmids
The plasmid location of the bont/a3, bont/a4, bont/bv b and bont/b1 genes from the analysed strains has been previously described [9, 10]. The bont/np b gene was recently identified by pulsed field gel electrophoresis to be located within a small plasmid . The genomic sequence data for the Eklund 17B strain verified the presence of bont/np b within a unique 47.6 kb plasmid. In addition, the location of the bont/bv b and bont/f within a plasmid (pBf) in the Bf strain was identified based upon synteny results and the high sequence homology of four Bf strain contigs (ABDP01000018.1, ABDP01000023.1, ABDP01000034.1 and ABDP01000069.1) with pCLJ, pCLD and pCLK. The comparisons of pCLJ to the Bf contig sequences yielded the following results: 99% identity, 89% coverage to contig ABDP01000023.1 (68.4 kb) that contains bont/f; 99% identity, 81% coverage to contig ABDP01000018.1 (84.3 kb) that contains bont/bv b; 96% identity, 52% coverage to contig ABDP01000034.1 (16.8 kb); and 98% identity, 65% coverage to contig ABDP01000069.1 (0.8 kb).
The second plasmid site within the Group I strains contains either the bont/bv b or the bont/b1 gene. However, the bont/np b is located within a very different plasmid and host background from the proteolytic strains. Examination of the regions flanking the bont/np b reveals that downstream is an IS element, a transposon-associated resolvase and site-specific recombinase. Like bont/e, the bont/np b is another example where a bont is in proximity to a transposon-associated protein involved in recombination and insertion events within a Group II background.
Recombination within the ntnhgene
Another recombination event was observed in the BoNT/A2-producing 7I03-H strain associated with an infant botulism case in Japan, evident from its location within the dendrogram. The first 2000 bases in this recombinant ntnh are almost identical with a BoNT/C1 ntnh (99.6% identity) and the final 1582 bases are 99.1% identical to the ntnh of the BoNT/A2 Kyoto-F strain (designated C/A ntnh) . The site of this recombination event is in the same region, but not in the same site, as the hybrid B/A ntnh described above.
The dendrogram also illustrates that the ntnh gene of the BoNT/A2-4 subtypes and the serotype F Langeland strain are very similar to each other, yet their bonts differ. A comparison of the ntnh genes for BoNT/A2 Kyoto-F and BoNT/F Langland shows them to be 97.0% identical for the first 3,443 nucleotides, but the identity decreases to 51.0% in the final 58 nucleotides. This finding indicates that a possible recombination event has occurred either in the 3' terminus of the ntnh gene and/or in the intergenic region between the ntnh and bont genes. The occurrence of this recombination event is also supported by the location of the serotype F Langeland ntnh gene with the ntnh genes of BoNT/A2,/A3, and/A4 strains in the dendrogram, and not with the ntnh genes of other serotype F strains.
These examples show the ability of the ntnh gene from the toxin complex of serotypes A and C, B and A and the 3' terminus or the intergenic region between an A ntnh and the bont/f genes to recombine; such recombination events have contributed to the variation observed. These events also illustrate the proximity of bacteria containing these genes to each other within an anaerobic environment that allows exchange and recombination.
Comparisons of the complete and shotgun sequence data from strains representing the Group I and II strains of C. botulinum and a C. butyricum type E strain were performed in order to further understand the variation observed among the BoNT-producing clostridia and to examine the unusual attributes observed within the species. These include the presence of similar bonts in different genomic backgrounds (bont/e in C. botulinum and C. butyricum for example), the presence of different bonts in similar backgrounds (serotype A proteolytic B and F C. botulinum strains) and the existence of bivalent strains. New technologies have made genomic sequencing more affordable and rapidly provide a wealth of sequence information that molecularly describes an organism. This study utilized the clostridial genomic sequence data and generated comparisons of: the 16S rrn genes from various clostridial species; the genomic synteny among strains; the locations of bont toxin clusters; and the components in their flanking regions. The data ties previous historical research with molecular results and increases our understanding of the species.
The molecular data supports the historical species Group I-IV classification system for C. botulinum based upon biochemical and physical properties. Comparisons of the organization of the genomic sequences in synteny plots presented here confirm that serotype A, B, and F of proteolytic Group I strains share a similar C. sporogenes genetic background. Likewise, the genomic organization within the Group II nonproteolytic strains that express B, E and F toxins share similarity to each other. The 16S rrn dendrogram shows that the different Groups I-IV within the C. botulinum species designation are clearly as distinct as other clades of clostridia that have been classified or named as separate species.
The location of the bont gene in these strains revealed that the sites are not randomly distributed in the host genomes. The bont and associated cluster genes are located within plasmids of varying sizes (47.6 - 270 kb) as well as within the chromosome. Franciosa et al. recently examined the location of the toxin cluster in 63 BoNT/B-producing C. botulinum strains using pulsed field gel electrophoresis; they discovered that each of the toxin gene clusters were located within plasmids ranging in size from ~55 to ~245 kb .
The presence of the toxin cluster within either plasmids, or within the chromosome in strains of the same or different serotypes, is consistent with horizontal transfer events mediated by plasmids or phage and recombination events mediated by mobile genetic elements such as transposons. These events result in the integration of the bont genes into different locations (plasmids, chromosome) and different host backgrounds (Group I-VI), as is observed within the BoNT-producing clostridia. The detailed examination of the bont locations reveals that these events occur with a greater frequency by homologous or targeted transposition rather than random or novel integration events.
The species also appears to undergo active recombination within the toxin complex genes, particularly at multiple sites within the ntnh gene. Examples of recombination include: (1) ntnh - the hybrid B/A ntnh placing the bont/a1 within the ha cluster of serotype B and the hybrid C/A ntnh placing the bont/a2 following a C/A ntnh hybrid; (2) bont - the hybrid bont/a2 gene consisting of bont/a1 and bont/a3; bont/c/d and bont/d/c hybrids; and (3) ntnh/bont - the site between the ntnh and bont genes placing a bont/f following a bont/a2 ntnh gene. These recombination events compound the confusion of the taxonomy of the species and make it difficult to clearly describe the strains with the current nomenclature. Clearly, from the examples listed above, the multiple recombination events have significantly contributed to the genetic diversity observed in the bonts.
This study provides the first molecular information to explain the unusual observation of a bont/e within both C. botulinum and C. butyricum type E strains. By examining the bont/e location within the two species, an insertion event was identified which targeted the same rarA gene. The rarA is a transposon-associated gene with recombinase activity that could explain the precise excision and integration of the bont/e in the two species. Interestingly, the comparison of sequences of the spliced and intact rarA genes revealed that this insertion event occurred separately in the two species, yet the inserted region containing the bont/e gene was from a common source.
Other transposon-associated proteins were identified downstream from the bont/np b where an IS element, resolvase and site-specific recombinase are located. Unfortunately, Gram positive transposons are not well characterized and elude detection because they lack perfect inverted repeats flanking the transposed region or are not replicated in the process . Although specific transposons were not identified near the toxin complex genes, transposon-associated proteins were found. The identification of these proteins, the presence of the toxin complex in different host backgrounds, its location within the chromosome as often as within plasmids and the identification of specific targeted insertion sites in the same or different species implicate transposon activity as at least one mechanism for bont movement.
The genomic analyses also discovered the location of two bont genes within plasmids, the bont/np b in the Eklund 17B strain and the bont/bv b and bont/f within the Bf strain. The bont/np b-containing plasmid could have been horizontally transferred to a Group II bacterial background, or it could have been the result of a transposon-mediated insertion into a unique plasmid. Likewise the bont/bv b and bont/f location within a plasmid homologous to the bivalent pCLJ with the bont/a4 replaced with bont/f shows that the two sequenced bivalent strains contain bonts in similar locations and that the bonts are distant to each other. It is interesting that, within the two sequenced bivalent strains, the bonts are within either an ha cluster or an orfX cluster. These different clusters could provide differing protection or expression of the bont.
The finding that the ntnh gene has recombined to place the bont/a1 within the ha cluster associated with BoNT/B strains helps resolve the perception of the 'normal' toxin cluster associated with bont/a1 strains. The success of the ha toxin cluster strains, as evidenced by their widespread isolation in conjunction with human botulism cases, indicates that the ha components must confer some cultural or toxicity advantage that is not yet clearly understood.
This study, which compares 15 clostridial genomic sequences, was undertaken in order to identify the underlying events that result in the genetic diversity within the C. botulinum species. As more genomic sequences become available, additional clues to understanding this complex species and its many toxin types and subtypes will be uncovered. This molecular analysis provided: (1) a 16S rrn dendrogram of the clostridial species that included recently sequenced members; (2) synteny plots that visualize chromosomal and plasmid gene organization; (3) the identification of common locations of the bonts within the chromosome and plasmid; (4) the components of the bont-containing regions that identify common features: (5) a description of an insertion event mediated by a transposon-associated resolvase placing bont/e in both C. botulinum and C. butyricum type E strains; (6) plasmid analyses which show that the bonts within the Bf strain and npBoNT/B strain are located within a plasmid; and (7) the identification and examples of recombination within the ntnh gene, bont gene and the region between these two genes.
The findings illustrate that the bonts within the clostridia insert, recombine and are exchanged both within a species and among species. The presence of bont genes within stable plasmids that are not lost suggests the genes confer some survival advantage to the host bacteria. Whether the bont gene is within a plasmid or chromosome, a single or bivalent arrangement or within the orfX or ha toxin gene cluster, the toxin has been both retained in, and spread among, a variety of different clostridial species termed Groups. The toxin complex genes have undergone recombination, insertion and horizontal gene transfer events that have yielded many variations of the bont gene, thereby producing the toxin serotypes and subtypes. Horizontal gene transfer events and genomic rearrangements are important mechanisms for bacterial survival and evolution. Within the clostridia these attributes have enabled the bont genes to continue to survive in different clostridial host backgrounds and environments.
Table 1 lists the strains examined in this study. They represent C. botulinum A, B, E and F serotypes and subtypes, including two bivalent strains (BoNT/Ba4, BoNT/Bf), a strain containing both bont/a1 and bont/b gene clusters where the bont/b gene is not expressed (BoNT A1(B)) and a BoNT/E4-expressing C. butyricum; a C. tetani and a C. sporogenes strain was included for comparison . Some genomic sequences were complete or in several large contigs and others were whole genome shotgun sequences.
Annotation of the assembled genome sequence was carried out with the genome annotation system GenDB  and RAST server . A combined gene prediction strategy was applied by means of the GLIMMER 2.0 system and the CRITICA program suite  along with postprocessing by the RBSfinder tool . tRNA genes were identified with tRNAscan-SE . The deduced proteins were functionally characterized by automated searches in public databases, including SWISS-PROT and TrEMBL , Pfam , TIGRFAM , InterPro , and KEGG . Additionally, SignalP , helix-turn-helix  and TMHMM  were applied. Finally, each gene was functionally classified by assigning clusters of orthologous groups (COG) number and corresponding COG category  and gene ontology numbers .
Genome and plasmid comparisons
Homology searches were conducted at the nucleotide and amino acid sequence level using BLAST . In order to obtain a list of orthologs from bacteroidete genomes, a Perl script that determines bidirectional best hits was written; for example, genes g and h were considered orthologs if h was the best BLASTP hit for g and vice versa. E values of 10-15 were acceptable. A gene was considered strain specific if it had no hits with an E value of 10-5 or less. Additional genomic comparisons and dotplot analyses were performed with genome alignment tools, such as MUMmer2 , NUCmer  and the web interface Artemis Comparison Tool (ACT) http://www.webact.org.
The comparison of toxin gene island insertion patterns was identified using the ACT alignment program at the default settings. Predicted toxin gene island insertion sites were identified from sequence alignments and breakpoint sites were further manually curated. Gene definition was manually annotated by inspecting BLASTP results and sequence alignments. The gene name and locus ID were assigned based on the NCBI Reference Sequence file. Insertion sequence (IS) elements were identified and classified by using the IS Finder database .
Plasmid analysis of the Bf contigs was performed by using BLASTN with the pCLJ sequence and 70 Bf contigs. All sequences scoring above the E value cutoff at 1e-20 were extracted for further comparison using the PROmer program from MUMmer package. Four putative pBf sequences from contigs ABDP01000018.1, ABDP01000023.1, ABDP01000034.1 and ABDP01000069.1 were aligned to pCLJ sequences. MUMmerplot was used to display the four contigs (pBf) that were ordered according to pCLJ reference coordinates.
DNA alignments were created with a combination of Sequencer software http://www.genecodes.com/, PAUP http://paup.csit.fsu.edu/, MUSCLE http://www.drive5.com/muscle/, CLUSTAL-W http://www.ebi.ac.uk/clustalw/ and hand editing with BioEdit http://www.mbio.ncsu.edu/BioEdit/bioedit.html software and were gap stripped then analysed using PHYLIP http://evolution.genetics.washington.edu/phylip.html with dnadist with the F84 model of evolution and a transition to transversion ratio of 2.0 (default) and neighbor joining algorithms. Dendrograms were rendered with FigTree http://tree.bio.ed.ac.uk/software/figtree/. Intra- and inter-serotype BoNT gene recombination was explored with SimPlot http://sray.med.som.jhmi.edu/SCRoftware/simplot/ and BioEdit.
nonproteolytic botulinum neurotoxin B
Aremis Comparison Tool
This work was funded in part by the Department of Homeland Security Science and Technology Directorate under contract number HSHQDC-08-C-00158. The DOE Joint Genome Institute at Los Alamos National Laboratory acknowledges the support of the Intelligence Technology Innovation Center for this research. This work was partially supported by NIAID cooperative agreement U01 AI056493.
The opinions, interpretations and recommendations are those of the authors and are not necessarily those of the USArmy.
We are grateful to Yi-Feng Chang for his assistance in the genome annotation. We also thank Dr Stephen Arnon for his constructive edits and discussions.
- Hutson RA, Thompson DE, Lawson PA, Schocken-Itturino RP, Bottger EC, Collins MD: Genetic interrelationships of proteolytic Clostridium botulinum types A, B, and F and other members of the Clostridium botulinum complex as revealed by small-subunit rRNA gene sequences. Antonie van Leeuwenhoek. 1993, 64: 273-283. 10.1007/BF00873087.View ArticlePubMedGoogle Scholar
- Smith LD, Sugiyama H: Chapter III. Cultural and serological characteristics. Botulism: the organism, its toxins, the disease. Edited by: Smith LD, Sugiyama H. 1988, Springfield: Charles C Thomas, 23-37. 2Google Scholar
- Summanen P: Recent taxonomic changes for anaerobic gram-positive and selected gram-negative organisms. Clin Infect Dis. 1993, 16 (suppl 4): S168-174.View ArticlePubMedGoogle Scholar
- Hatheway CL, Ferreira JL: Detection and identification of Clostridium botulinum neurotoxins. Adv Exp Med Biol. 1996, 391: 481-498.View ArticlePubMedGoogle Scholar
- Collins MD, East AK: Phylogeny and taxonomy of the food-borne pathogen Clostridium botulinum and its neurotoxins. J Appl Microbiol. 1998, 84: 5-17. 10.1046/j.1365-2672.1997.00313.x.View ArticlePubMedGoogle Scholar
- Hill KK, Smith TJ, Helma CH, Ticknor LO, Foley BT, Svensson RT, Brown JL, Johnson EA, Smith LA, Okinaka RT, Jackson PJ, Marks JD: Genetic diversity among botulinum neurotoxin-producing clostridial strains. J Bacteriol. 2007, 189: 818-832. 10.1128/JB.01180-06.PubMed CentralView ArticlePubMedGoogle Scholar
- Carter AT, Paul CJ, Mason DR, Twine SM, Alston MJ, Logan SM, et al: Independent evolution of neurotoxin and flagellar genetic loci in proteolytic Clostridium botulinum. BMC Genomics. 2009, 10: 115-10.1186/1471-2164-10-115.PubMed CentralView ArticlePubMedGoogle Scholar
- Bruggemann H, Baumer S, Fricke WF, Wiezer A, Liesegang H, Decker I, et al: The genome sequence of Clostridium tetani, the causative agent of tetanus disease. Proc Natl Acad Sci. 2003, 100: 1316-21. 10.1073/pnas.0335853100.PubMed CentralView ArticlePubMedGoogle Scholar
- Smith TJ, Hill KK, Foley BT, Detter JC, Munk AC, Bruce DC, et al: Analysis of the neurotoxin complex genes in Clostridium botulinum A1-A4 and B1 strains: BoNT/A3,/Ba4 and/B1 clusters are located within plasmids. PLoS ONE. 2007, 2: e1271-10.1371/journal.pone.0001271.PubMed CentralView ArticlePubMedGoogle Scholar
- Marshall KM, Bradshaw M, Pellett S, Johnson EA: Plasmid encoded neurotoxin genes in Clostridium botulinum serotype A subtypes. Biochem Biophys Res Comm. 2007, 361: 49-54. 10.1016/j.bbrc.2007.06.166.PubMed CentralView ArticlePubMedGoogle Scholar
- Franciosa G, Maugliani A, Scalfaro C, Aureli P: Evidence that plasmid-borne botulinum neurotoxin type B genes are widespread among Clostridium botulinum serotype B strains. PLOSOne. 2009, 4: e4829-View ArticleGoogle Scholar
- Kelly BG, Vespermann A, Bolton DJ: The role of horizontal gene transfer in the evolution of selected foodborne bacterial pathogens. Food Chem Toxicol. 2009, 47: 951-68. 10.1016/j.fct.2008.02.006.View ArticlePubMedGoogle Scholar
- Dineen SS, Bradshaw M, Johnson EA: Neurotoxin gene clusters in Clostridium botulinum type A strains: sequence comparison and evolutionary implications. Curr Microbiol. 2003, 46: 345-352. 10.1007/s00284-002-3851-1.View ArticlePubMedGoogle Scholar
- Moriishi K, Koura M, Fujii N, Fujinaga Y, Inoue K, Syuto B, Oguma K: Molecular cloning of the gene encoding the mosaic neurotoxin, composed of parts of botulinum neurotoxin types C1 and D, and PCR detection of this gene from Clostridium botulinum type C organisms. Appl Environ Microbiol. 1996, 62: 662-667.PubMed CentralPubMedGoogle Scholar
- American type culture collection catalogue of bacteria and phages. 1992, Maryland: American Type Culture Collection, 18
- Lewis KH, Hill EV: Practical media and control measures for producing highly toxic cultures of Clostridium botulinum, Type A. J Bacteriol. 1947, 53: 213-230.PubMed CentralPubMedGoogle Scholar
- Sebaihia M, Peck MW, Minton NP, Thomson NR, Holden MT, Mitchell WJ, et al: Genome sequence of a proteolytic (Group I) Clostridium botulinum strain Hall A and comparative analysis of the clostridial genomes. Genome Res. 2007, 17: 1082-1092. 10.1101/gr.6282807.PubMed CentralView ArticlePubMedGoogle Scholar
- Jacobson MJ, Lin G, Raphael B, Andreadis J, Johnson EA: Analysis of neurotoxin cluster genes in Clostridium botulinum strains producing botulinum neurotoxin serotype A subtypes. Appl Environ Microbiol. 2008, 74: 2778-2786. 10.1128/AEM.02828-07.PubMed CentralView ArticlePubMedGoogle Scholar
- Matsumura T, Jin Y, Kabumoto Y, Takegahara Y, Oguma K, Lencer WI, Fujinaga Y: The HA proteins of botulinum toxin disrupt intestinal epithelial intercellular junctions to increase toxin absorption. Cell Microbiol. 2008, 10: 355-364.PubMedGoogle Scholar
- Raphael BH, Luquez C, McCroskey LM, Joseph LA, Jacobson MJ, Johnson EA, Maslanka SE, Andreadis JD: Genetic homogeneity of Clostridium botulinum type A1 strains with unique toxin gene clusters. Appl Environ Microbiol. 2008, 74: 4390-4397. 10.1128/AEM.00260-08.PubMed CentralView ArticlePubMedGoogle Scholar
- East AK, Bhandari M, Stacey JM, Campbell KD, Collins MD: Organization and phylogenetic interrelationships of genes encoding components of the botulinum toxin complex in proteolytic Clostridium botulinum types A, B and F: evidence of chimeric sequences in the gene encoding the nontoxic nonhemagglutinin component. Int J Syst Bacteriol. 1996, 46: 1105-1112.View ArticlePubMedGoogle Scholar
- Lindstrom M, Hinderink K, Somervuo P, Kivinieme K, Nevas M, Chen Y, Auvinen P, Carter AT, Mason DR, Peck MW, Korkeala H: Comparative genomic hybridization analysis of two predominant Nordic Group I (proteolytic) Clostridium botulinum type B clusters. Appl Environ Microbiol. 2009, 75: 2643-51. 10.1128/AEM.02557-08.PubMed CentralView ArticlePubMedGoogle Scholar
- Dineen SS, Bradshaw M, Karasek CE, Johnson EA: Nucleotide sequence and transcriptional analysis of the type A2 neurotoxin gene cluster in Clostridium botulinum. FEMS Microbiol Lett. 2004, 235: 9-16. 10.1111/j.1574-6968.2004.tb09561.x.View ArticlePubMedGoogle Scholar
- Gill SR, Fouts DE, Archer GL, Mongodin EF, DeBoy RT, Ravel J, Paulsen IT, Kolonay JF, Brinkac L, Beanan M, Dodson RJ, Daugherty SC, Madupu R, Angiuoli SV, Durkin AS, Haft DH, Vamathevan J, Khouri H, Utterback T, Lee C, Dimitrov G, Jiang L, Qin H, Weidman J, Tran K, Kang K, Hance IR, Nelson KE, Fraser CM: Insights on evolution of virulence and resistance from the complete genome analysis of an early methicillin-resistant Staphylococcus aureus strain and a biofilm-producing methicillin-resistant Staphylococcus epidermidis strain. J Bacteriol. 2005, 187: 2426-2438. 10.1128/JB.187.7.2426-2438.2005.PubMed CentralView ArticlePubMedGoogle Scholar
- Kubota T, Shinakawa S, Kozaki S, Isogai E, Isogai H, Kimura K, Fujii N: Mosaic type of the nontoxic-nonhemagglutinin component gene in Clostridium botulinum type A strain isolated from infant botulism in Japan. Biochem Biophys Res Comm. 1996, 224: 843-848. 10.1006/bbrc.1996.1110.View ArticlePubMedGoogle Scholar
- Transposable elements in gram-positive bacteria. Mobile DNA. Edited by: Berg DE, Howe MM. 1989, Washington, D.C: American Society for Microbiology, 269-288.
- Meyer F, Goesmann A, McHardy AC, Bartels D, Bekel T, Clausen , Kalinowski J, Linke B, Rupp O, Giegerich R, Puhler A: GenDB--an open source genome annotation system for prokaryote genomes. Nucleic Acids Res. 2003, 31: 2187-2195. 10.1093/nar/gkg312.PubMed CentralView ArticlePubMedGoogle Scholar
- Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Relch C, Stevens R, Vassleva O, Vonstein V, Wilke A, Zagnitko O, et al: The RAST server: rapid annotations using subsystems technology. BMC Genomics. 2008, 9: 75-10.1186/1471-2164-9-75.PubMed CentralView ArticlePubMedGoogle Scholar
- McHardy AC, Goesmann A, Pühler A, Meyer F: Development of joint application strategies for two microbial gene finders. Bioinformatic. 2004, 20: 1622-1631. 10.1093/bioinformatics/bth137.View ArticleGoogle Scholar
- Suzek BE, Ermolaeva MD, Schreiber M, Salzberg SL: A probabilistic method for identifying start codons in bacterial genomes. Bioinformatics. 2001, 17: 1123-1130. 10.1093/bioinformatics/17.12.1123.View ArticlePubMedGoogle Scholar
- Lowe TM, Eddy SR: tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 1997, 25: 955-964. 10.1093/nar/25.5.955.PubMed CentralView ArticlePubMedGoogle Scholar
- Boeckmann B, Bairoch A, Apweiler R, Blatter M-C, Estreicher A, Gasteiger E, Martin MJ, Michoud K, O'Donovan C, Phan I, Pilbout S, Schneider M: The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res. 2003, 31: 365-370. 10.1093/nar/gkg095.PubMed CentralView ArticlePubMedGoogle Scholar
- Bateman A, Birney E, Cerruti L, Durbin R, Etwiller L, Eddy SR, Griffiths-Jones S, Howe KL, Marshall M, Sonnhammer EL: The Pfam protein families database. Nucleic Acids Res. 2002, 30: 276-280. 10.1093/nar/30.1.276.PubMed CentralView ArticlePubMedGoogle Scholar
- Haft DH, Loftus BJ, Richardson DL, Yang F, Eisen JA, Paulsen IT, White O: TIGRFAMs: a protein family resource for the functional identification of proteins. Nucleic Acids Res. 2001, 29: 41-43. 10.1093/nar/29.1.41.PubMed CentralView ArticlePubMedGoogle Scholar
- Mulder NJ, Apweiler R, Attwood TK, Bairoch A, Barrell D, Bateman A, Binns D, Biswas M, Bradley P, Bork P, Bucher P, Copley RR, Courcelle E, Das U, Durbin R, Falquet L, Fleischmann W, Griffiths-Jones S, Haft D, Harte N, Hulo N, Kahn D, Kanapin A, Krestyaninova M, Lopez R, Letunic I, Lonsdale D, Silventoinen V, Orchard SE, Pagni M, Peyruc D, Ponting CP, Selengut JD, Servant F, Sigrist CJ, Vaughan R, Zdobnov EM: The InterPro database, 2003 brings increased coverage and new features. Nucleic Acids Res. 2003, 31: 315-318. 10.1093/nar/gkg046.PubMed CentralView ArticlePubMedGoogle Scholar
- Kanehisa M, Goto MS: KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000, 28: 27-30. 10.1093/nar/28.1.27.PubMed CentralView ArticlePubMedGoogle Scholar
- Nielsen H, Krogh A: Prediction of signal peptides and signal anchors by a hidden Markov model. Proceedings of the Sixth International Conference on Intelligent Systems for Molecular Biology (ISMB 6). 1998, California: AAAI Press, 122-130.Google Scholar
- Dodd IB, Egan JB: Improved detection of helix-turn-helix DNA-binding motifs in protein sequences. Nucleic Acids Res. 1990, 18: 5019-5026. 10.1093/nar/18.17.5019.PubMed CentralView ArticlePubMedGoogle Scholar
- Krogh A, Larsson B, von Heijne G, Sonnhammer EL: Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. J Mol Biol. 2001, 305: 567-580. 10.1006/jmbi.2000.4315.View ArticlePubMedGoogle Scholar
- Tatusov RL, Galperin MY, Natale DA, Koonin EV: The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Res. 2000, 28: 33-36. 10.1093/nar/28.1.33.PubMed CentralView ArticlePubMedGoogle Scholar
- Harris MA, Clark J, Ireland A, Lomax J, Ashburner M, Foulger R, Eilbeck K, Lewis S, Marshall B, Mungall C, Richter J, Rubin GM, Blake JA, Bult C, Dolan M, Drabkin H, Eppig JT, Hill DP, Ni L, Ringwald M, Balakrishnan R, Cherry JM, Christie KR, Costanzo MC, Dwight SS, Engel S, Fisk DG, Hirschman JE, Hong EL, Nash RS, Sethuraman A, Theesfeld CL, Botstein D, Dolinski K, Feierbach B, Berardini T, Mundodi S, Rhee SY, Apweiler R, Barrell D, Camon E, Dimmer E, Lee V, Chisholm R, Gaudet P, Kibbe W, Kishore R, Schwarz EM, Sternberg P, Gwinn M, Hannick L, Wortman J, Berriman M, Wood V, de la Cruz N, Tonellato P, Jaiswal P, Seigfried T, White R: The gene ontology (GO) database and informatics resource. Nucleic Acids Res. 2004, 32: 258-261. 10.1093/nar/gkh066.View ArticleGoogle Scholar
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.PubMed CentralView ArticlePubMedGoogle Scholar
- Delcher AL, Phillippy A, Carlton J, Salzberg SL: Fast algorithms for large-scale genome alignment and comparison. Nucleic Acids Res. 2002, 30: 2478-2483. 10.1093/nar/30.11.2478.PubMed CentralView ArticlePubMedGoogle Scholar
- Kurtz S, Phillippy A, Delcher A, Smoot M, Shumway M, Antonescu C, Salzberg SL: Versatile and open software for comparing large genomes. Genome Biology. 2004, 5: R12-10.1186/gb-2004-5-2-r12.PubMed CentralView ArticlePubMedGoogle Scholar
- Carver TJ, Rutherford KM, Berriman M, Rajandream MA, Barrell BG, Parkhill J: ACT: the Artemis Comparison Tool. Bioinformatics. 2005, 21: 3422-3423. 10.1093/bioinformatics/bti553.View ArticlePubMedGoogle Scholar
- Siguier P, Perochon J, Lestrade L, Mahillon J, Chandler M: ISfinder: the reference centre for bacterial insertion sequences. Nucleic Acids Res. 2006, 34: D32-36. 10.1093/nar/gkj014.PubMed CentralView ArticlePubMedGoogle Scholar
- Lole KS, Bollinger RC, Paranjape RS, Gadkari D, Kulkarni SS, Novak NG, Ingersoll R, Sheppard HW, Ray SC: Full-length human immunodeficiency virus type 1 genomes from subtype C-infected seroconverters in India, with evidence of intersubtype recombination. J Virol. 1999, 73: 152-60.PubMed CentralPubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.