Phylogeographic reconstruction of a bacterial species with high levels of lateral gene transfer
- Talima Pearson1,
- Philip Giffard2, 3,
- Stephen Beckstrom-Sternberg1, 4,
- Raymond Auerbach1, 15,
- Heidie Hornstra1,
- Apichai Tuanyok1,
- Erin P Price1, 4,
- Mindy B Glass5,
- Benjamin Leadem1,
- James S Beckstrom-Sternberg4,
- Gerard J Allan6,
- Jeffrey T Foster1,
- David M Wagner1,
- Richard T Okinaka1, 7,
- Siew Hoon Sim8,
- Ofori Pearson9,
- Zaining Wu10,
- Jean Chang10,
- Rajinder Kaul10,
- Alex R Hoffmaster5,
- Thomas S Brettin11,
- Richard A Robison12,
- Mark Mayo2,
- Jay E Gee5,
- Patrick Tan8, 13,
- Bart J Currie2, 14 and
- Paul Keim1, 4Email author
© Pearson et al; licensee BioMed Central Ltd. 2009
Received: 24 July 2009
Accepted: 18 November 2009
Published: 18 November 2009
Phylogeographic reconstruction of some bacterial populations is hindered by low diversity coupled with high levels of lateral gene transfer. A comparison of recombination levels and diversity at seven housekeeping genes for eleven bacterial species, most of which are commonly cited as having high levels of lateral gene transfer shows that the relative contributions of homologous recombination versus mutation for Burkholderia pseudomallei is over two times higher than for Streptococcus pneumoniae and is thus the highest value yet reported in bacteria. Despite the potential for homologous recombination to increase diversity, B. pseudomallei exhibits a relative lack of diversity at these loci. In these situations, whole genome genotyping of orthologous shared single nucleotide polymorphism loci, discovered using next generation sequencing technologies, can provide very large data sets capable of estimating core phylogenetic relationships. We compared and searched 43 whole genome sequences of B. pseudomallei and its closest relatives for single nucleotide polymorphisms in orthologous shared regions to use in phylogenetic reconstruction.
Bayesian phylogenetic analyses of >14,000 single nucleotide polymorphisms yielded completely resolved trees for these 43 strains with high levels of statistical support. These results enable a better understanding of a separate analysis of population differentiation among >1,700 B. pseudomallei isolates as defined by sequence data from seven housekeeping genes. We analyzed this larger data set for population structure and allele sharing that can be attributed to lateral gene transfer. Our results suggest that despite an almost panmictic population, we can detect two distinct populations of B. pseudomallei that conform to biogeographic patterns found in many plant and animal species. That is, separation along Wallace's Line, a biogeographic boundary between Southeast Asia and Australia.
We describe an Australian origin for B. pseudomallei, characterized by a single introduction event into Southeast Asia during a recent glacial period, and variable levels of lateral gene transfer within populations. These patterns provide insights into mechanisms of genetic diversification in B. pseudomallei and its closest relatives, and provide a framework for integrating the traditionally separate fields of population genetics and phylogenetics for other bacterial species with high levels of lateral gene transfer.
Efforts to understand the evolutionary history of organisms have benefited from the availability of increasing amounts of molecular data, especially whole genome sequences (WGSs). The availability of multiple WGSs has led to more accurate reconstructions of phylogenetic relationships within several bacterial species [1–9], but all of these studies have been limited by a small number of WGSs (19 or fewer genomes). The availability of multiple WGSs per species is currently quite rare, but the cost of generating WGSs continues to decline and it is anticipated that future phylogenetic studies will routinely employ multiple WGSs.
Due to their short evolutionary history and clonality, Bacillus anthracis  and Mycobacterium tuberculosis  were ideal models for pioneering phylogenetic work using multiple WGSs, but hurdles in phylogenetic reconstruction persist for other species. The genomes of these two species exhibit almost no homoplasy (the appearance of similar character states in unrelated samples due to evolutionary convergence or parallelisms) due to their recent species derivation and complete clonality. Thus, character differences, as measured by single nucleotide polymorphisms (SNPs), are assumed to have arisen only once in their evolutionary history. Also, these two species exhibit no evidence of conspecific lateral gene transfer (LGT), which can cause apparent homoplasy by placing alleles with common origins in different genetic backgrounds. In contrast, most bacterial species, including Burkholderia pseudomallei, have a longer history of mutation accumulation, as well as a history of LGT [11–13], which increase the probability of homoplasy and apparent homoplasy, respectively. Thus, for all but the most recently emerged and clonal species, fine-scale phylogenetic reconstruction has been elusive using common genetic markers. Recent sequencing efforts for B. pseudomallei and other closely related species provided the opportunity for pioneering phylogenetic work on a species with high levels of LGT.
B. pseudomallei causes the severe disease melioidosis  and is widely distributed in soil and fresh water in Southeast Asia and tropical Australia . Animal to animal transmission is rare but a wide variety of animals can be infected [16, 17], reseeding nearby areas [17, 18] and providing limited dispersion for this otherwise immobile species. These small-scale movements should be reflected in the population structure of B. pseudomallei, with geographic barriers such as oceans being traversed rarely or not at all. A monophyletic group of isolates within the B. pseudomallei group has diverged to become an equine pathogen, B. mallei , which does not survive well in soil. Like B. pseudomallei, the closely related B. thailandensis and B. oklahomensis live in soil but are much less pathogenic and are phylogenetically distinct from B. pseudomallei/B. mallei .
Various molecular methods have been used for phylogenetic reconstruction of these Burkholderia species, with different levels of success. Multiple-locus variable number tandem repeat (VNTR) analysis (MLVA) of B. pseudomallei and B. mallei is effective for determining relationships among very closely related isolates, but not broad patterns of relatedness [20, 21]. Multilocus sequence typing (MLST) of seven housekeeping genes  can be used to identify epidemiologically linked isolates of the same sequence type (ST) and determine phylogenetic relationships at a species level , but efforts to infer relationships among STs within B. pseudomallei have yielded little statistical support [16, 23, 24]. This is due to homologous recombination within and possibly among B. thailandensis, B. pseudomallei, and B. mallei [11–13, 24, 25], as well as limitations of restricted gene sampling in highly recombining populations . Microarray-based comparative gene hybridization analysis of 23 strains from these three Burkholderia species avoided problems associated with limited gene sampling by targeting close to 7,000 open reading frames discovered from the WGS of a single B. pseudomallei isolate, K96243 . However, the subsequent phylogeny derived from this work was heavily weighted towards isolates from Southeast Asia  and may not be representative of the global evolution of these species. Thus, the need for a comprehensive phylogeny with extensive character sampling persists.
Whole genome SNP phylogenies are highly accurate in terms of defining both branching order and branch lengths, despite collapsed secondary branches that lead to isolates that have not been sequenced [5, 28, 29]. SNPs are more evolutionarily informative than most other types of molecular markers due to intrinsically slow mutation rates, few character states, and extensive distribution across the entire genome. In addition, a large number of shared, orthologous SNP loci facilitate robust characterization of deep relationships and high resolution among closely related individuals [28, 30, 31]. However, because SNPs are relatively rare and scattered throughout a genome, WGSs from multiple strains are required for identification.
Here, we construct a robust, large-scale phylogeny of B. pseudomallei and its close relatives, overcoming significant problems associated with LGT and SNP discovery by using a large number of orthologous SNP loci distributed across the entire genome and shared among 43 fully sequenced genomes. We compare these findings to broad population patterns determined using sequence data from seven housekeeping genes in a global collection of >1,700 isolates. To our knowledge, this broad-scale integration of phylogenetics from whole genome sequence comparisons and population genetics from extensive MLST data in a spatial context is unprecedented for this species and provides a model for assessing the dispersal and differentiation of other bacteria that have high levels of recombination.
Results and Discussion
Phylogenetic patterns revealed by shared, orthologous SNPs derived from WGSs
All B. pseudomallei bifurcations received the highest possible level of statistical support (Figure 2). The Australian population of B. pseudomallei is more ancient than either the Asian B. pseudomallei group or the B. mallei group. The Australian isolates form a deep paraphyletic group within the B. pseudomallei/B. mallei phylogeny, and the most divergent isolates in this group are five isolates from Australia and E208 from Ecuador (Figure 2). The remaining isolates form two clades: the B. mallei clade and a group of B. pseudomallei dominated by Asian isolates. The phylogenetic position of B. mallei confirms previous research suggesting that this species arose from a B. pseudomallei lineage and experienced a recent radiation . Our results suggest that none of the B. pseudomallei isolates included in this study are closely related to the B. mallei clone, and all Asian B. pseudomallei isolates fall into a single coherent group that diverged relatively recently, after the split with B. mallei.
LGT events may have had only a slight effect on phylogenetic topology, suggesting that while most events are spread evenly throughout the B. pseudomallei phylogenetic space, some highways of gene sharing may exist. Homoplastic SNPs are located throughout shared genomic regions and are interspersed with non-homoplastic SNPs, suggesting that all regions may have been subject to LGT events and that such events likely involved small stretches of DNA (Additional file 4a). Our test of residual homoplasies (materials and methods) resulted in a tree that is remarkably similar to the previous topology (Additional file 4b,c). Topological similarities have three explanations: 1) a lack of preferential transfer of DNA always involving the same lineages (no highways of gene sharing), 2) convergent or reverse mutations evenly spread throughout the phylogeny, and 3) sequencing errors evenly spread throughout phylogenetic space. Stochastic variation in homoplastic mutations and sequencing errors may explain topological differences within the B. mallei clade, which was mostly supported by non-homoplastic characters. Stochastic variation in all three parameters may be sufficient to cause topological differences among B. pseudomallei relationships, however as three of the four changed bifurcations remain statistically robust, we suspect that some LGT events consistently involved the same lineages. The congruency between trees created with homoplastic SNPs and all orthologous shared SNPs further increases our confidence that depicted phylogenetic relationships provide a reasonable hypothesis for the actual patterns of descent for the individual isolates. As the addition of more taxa increases phylogenetic accuracy , we suspect that the actual patterns of vertical descent will become even more distinct as more genomes are sequenced. This confirms the appropriateness of depicting the core evolutionary history of this set of organisms with a phylogenetic tree, rather than a network, as individual LGT events do not involve a large enough portion of the genome to disrupt the core phylogenetic patterns. In other words, despite high levels of LGT, the underlying core evolutionary trajectory can be determined and follows a bifurcating pattern. Indeed, as the evolutionary history of individual genes are determined, mapping these events onto the core phylogenetic tree will provide insights into gene flow within a species and will cause the core tree to appear more network-like. These analyses reveal the value and necessity of whole genome orthologous SNPs for defining patterns of descent even in organisms that are not completely clonal, with high levels of LGT. Sequence comparisons from a small number of genes such as MLST schemes would be grossly insufficient for defining these relationships.
Gene flow dynamics revealed by MLST data
While the use of 23 WGSs from a single species is a high number for microbial phylogenetic studies, this number will seem more and more diminutive as the cost and speed of sequencing technologies increase. However, until the day when hundreds of genomes from a single species are sequenced, phylogenetic analyses will suffer from limited taxon sampling that may not represent natural diversity. It is therefore imperative that such phylogenetic information be integrated with data from a wider sampling of isolates, even though genotyping data will be more limited. Despite the limitations of MLST and the program eBURST  for phylogenetic inferences to determine exact evolutionary relationships between individual isolates [26, 36], a large amount of MLST data nonetheless represents a valuable resource from which population-level trends can be gleaned. Analysis of allele frequencies can facilitate recognition of distinct populations, and comparisons of allelic diversity among populations are informative since ancient populations are expected to be more diverse than more recent populations, wherein genetic diversity is limited by founder effects . Also, levels of intra- and inter-population connectedness, as measured by allele sharing, can be suggestive of levels of horizontal gene transfer and relatedness. Lastly, MLST data can be used to estimate levels of recombination that provide insights into LGT frequencies from measuring homoplasy, the standardized index of association  and the relative contribution of recombination and mutation in generating diversity . We therefore analyzed MLST data from >1,700 isolates of B. thailandensis, B. pseudomallei, and B. mallei from an online database http://bpseudomallei.mlst.net downloaded on July 28, 2008. B. pseudomallei isolates in this database were collected from human and animal infections as well as a variety of environmental sources. As might be expected, STs that are found in clinical and animal cases are a subset of those found in the environment. Approximately 47% of STs are from Southeast Asia, 45% are from Australasia, and 8% are from other geographic regions.
The approaches described here provide a framework for phylogenetic analysis of species such as B. pseudomallei with high levels of LGT and homoplasy. Selecting only SNPs from whole genome comparisons eliminated faster evolving loci that are more prone to homoplasy , and sampling >14,000 SNPs spread across the genome reduced the confounding effects of convergent evolution and LGT. Deleted and duplicated genomic regions in Burkholderia are frequent [11, 21, 50, 13] and can lead to missing data and sampling paralogous rather than orthologous loci [51, 52], respectively. We therefore selected loci that were always present but not duplicated in any of the sequenced genomes. High clade credibility values coupled with a non-conflicting phylogenetic pattern of homoplastic SNPs provided confidence in the phylogenetic hypotheses presented here. The inferred phylogenies are a meaningful approximation of descent, and not simply a depiction of the inevitable stochastic variation in similarity that would be present in products of any finite random sampling of an infinitely panmictic population.
The validity of using phylogenetic trees to depict the evolutionary history of organisms exhibiting LGT has been hotly debated, with some authors championing web-like structures to depict instances of reticulate evolution  and others suggesting the importance and appropriateness of discerning patterns of vertical inheritance . Certainly, intra- and interspecific genetic exchange has shaped the genome of extant Burkholderia isolates. However, although a large proportion of the genome may have been shaped this way over evolutionary time, only a very small portion of the genome is laterally inherited from generation to generation. Thus, a phylogenetic tree remains a valid way of representing the major patterns of descent for these species. On such a tree, the small connective threads that depict LGT and discordant individual gene phylogenies can subsequently be strung as individual genes are studied.
It is likely that the most recent common ancestor to B. pseudomallei existed on the Australian continent. Our phylogenetic analyses indicate a tendency for Australian B. pseudomallei isolates to be associated with a more ancient common ancestor compared to other isolates. This pattern also is supported by completely independent MLST results from 599 B. pseudomallei STs that showed that the Australasian population is defined by greater allelic diversity and fewer shared alleles. The presence of B. thailandensis isolates in Australia and the phylogenetic position of Burkholderia sp. MSMB43 point to the possibility that Burkholderia sp. MSMB43, B. thailandensis, B. pseudomallei, and B. mallei isolates are all descendants from an Australian B. thailandensis-like isolate, although this pattern is based on very few B. thailandensis and Burkholderia. sp. isolates. As more B. thailandensis isolates are discovered, their phylogenetic and geographic associations will be critical for confirming or rejecting this provisional hypothesis.
The monophyletic B. mallei clade diverged from B. pseudomallei before the current Southeast Asian population was established (Figure 2). The long branch leading to B. mallei strains suggests a long passage of time before a rapid radiation led to the extant population. A high consistency index among SNPs from whole genome comparisons of B. mallei strains provides evidence for a completely clonal mode of descent for this species since its relatively recent radiation, in contrast with B. pseudomallei. The lack of LGT among B. mallei isolates is not surprising given the loss of recombination opportunities associated with host sequestration and inability to thrive in the environment; it is likely that LGT between B. mallei and B. pseudomallei has not occurred for these same reasons. Although host specialization may account for the differential rates of LGT between the B. pseudomallei and B. mallei populations, other barriers may influence LGT among B. pseudomallei populations.
The mechanistic basis for high recombination frequencies observed in Southeast Asian populations of B. pseudomallei, compared to Australian populations, is of considerable interest. As sequences diverge, the likelihood of homologous recombination decreases [55–58]. Therefore, perhaps the greater genetic distances among Australian B. pseudomallei strains may, in part, explain lower levels of LGT in this population versus the more closely related and more connected Southeast Asian population. However, B. thailandensis shares more alleles with the Southeast Asian population of B. pseudomallei than with the Australian population (7:1), providing some evidence that LGT between species does occur despite genetic divergence. Different levels of LGT among populations may be due to greater abundance of B. thailandensis in Southeast Asia, providing greater opportunities for physical contact and LGT. In Australia, the typically lower abundance of B. pseudomallei in the environment  may account for lower rates of LGT in comparison to the Southeast Asian population . Large, intensively farmed artificial wetlands such as the rice paddy fields of Thailand may favor high cell densities and mobility of strains. Conversely, the largely tropical savannah areas of Northern Australia dispersed over vast distances with limited low density grazing and human populations would be expected to impede gene flow . A third scenario is that these populations may have evolved differential intrinsic LGT rates, however we have no evidence to support this hypothesis.
B. pseudomallei is subdivided into two distinct subpopulations with distinct geographic distributions that are separated by Wallace's Line. For hundreds of years naturalists have noted a tendency for plant and animal populations to be divided along Wallace's Line  but, to our knowledge, no prokaryotic examples have been reported. Two mutually exclusive hypotheses may explain the biogeographic separation of the Australian B. pseudomallei population from the more recent Asian population along Wallace's Line, both of which are reliant on the geological history of the region. Islands on the western side of Wallace's Line are part of the Eurasian tectonic plate, whereas those on the eastern side are on the Australian plate . Perhaps B. pseudomallei was introduced into Southeast Asia after the late Miocene (approximately 12 million years ago (Ma)) collision of these two plates in the vicinity of Wallace's Line. Conversely, like other species, the biogeographic separation may have begun with the divergence of an ancestral population living in Gondwanaland. This initial divergence would be related to plate tectonic motion approximately 140 Ma when the Indian subcontinent split from Gondwanaland. Populations could have been subsequently introduced into Asia during the collision of the Indian plate and the Eurasian plate that began approximately 55 Ma  and then spread to the western edge of Wallace's Line. It was previously postulated that B. pseudomallei may have originated in Gondwanaland and dispersed with the breakup of that ancient supercontinent (the Gondwana hypothesis), or alternatively dispersed from Australia to Southeast Asia via the later Miocene land bridges that partially linked those regions . However, low MLST allelic diversity and sharing of prevalent alleles between strains from Australia and Southeast Asia suggests that B. pseudomallei may actually be a much younger species . A founding population must therefore have crossed Wallace's Line more recently than the late Miocene. Such an event would have to be rare to allow for genetic divergence to occur; indeed, B. pseudomallei does not survive well in sea water [66, 67]. Although all molecular clock estimates are fraught with potential inaccuracies regarding estimates of mutation fixation rates and generation times, these two dispersion hypotheses differ by more than an order of magnitude(<12 Ma, and >140 Ma), making it likely that even a rough estimate of divergence times can discriminate between these two hypotheses. Indeed, using a range of mutation rates and generation times similar to those determined in other bacterial species, our molecular clock estimates support the hypothesis of a founding population of B. pseudomallei crossing Wallace's Line and becoming isolated from the larger population, with subsequent spread throughout Southeast Asia (Additional file 6). The range of our estimates for the time of divergence between the two populations (16 thousand years ago (Ka) - 225 Ka) coincides with the times of recent glacial periods when low sea levels would have maximized the potential for dispersion amongst what are now islands in the Malay Archipelago. We also dated the last common B. pseudomallei ancestor to between 24.9 Ka and 346 Ka and the divergence of B. thailandensis and B. pseudomallei to between 307 Ka and 4.27 Ma.
Our results demonstrate that, given large amounts of molecular data and extensive sampling, past evolutionary and biogeographic events can be reconstructed despite relatively high levels of LGT. Our use of evolutionarily informative SNPs derived from WGSs is imperative for maximizing phylogenetic resolution and reduces the likelihood that individual LGT events will corrupt the overall phylogeny, as can be expected with limited genomic sampling. Despite the problems with using limited genomic sampling schemes for determining fine scale phylogenetic patterns of relatedness in B. pseudomallei, such schemes are widely accessible and thus result in large data sets. Fortunately, the resolution of MLST data is sufficient for determining broad patterns of population dynamics and distribution for B. pseudomallei and adds this species to the growing list of bacterial species in which biogeographic structuring has been demonstrated [68, 69]. More comprehensive phylogenetic and population studies will set the stage for framing and addressing further questions about single gene evolution, dispersal, and population sub-structuring.
SNPs were detected using an in-house pipeline starting with pairwise genomic comparisons using MUMmer (Stefan Kurtz, Hamburg, Germany) . We ensured orthology by requiring each SNP locus (with 100 bp on either side of the SNP) to be present in every sequenced genome and any locus that was duplicated in any genome was discarded. SNP loci with an additional SNP within 7 bp were also discarded (but see Additional file 3) to avoid possible artifacts of slight alignment errors. An average quality score of ≥15 was required for the 10 bases on each side of a SNP.
The evolutionary model that best fit the SNP data derived from whole genome comparisons was determined by Modeltest 3.6 (David Posada, Vigo, Spain) , analyzed under the Akaike's Information Criterion. The best fit model was used in Mr. Bayes 3.1 (John Huelsnebeck, Bret Larget, Paul van der Mark, Fredrik Ronquist, Donald Simon. Tallahassee, Florida, USA)  for phylogenetic inference. The Markov Chain Monte Carlo algorithm was run for 2,000,000 generations and sampled every 100 generations. A burn-in set of 2,000 trees was discarded. For all data sets, we ran the default of four chains, the log likelihood converged on a stable value well before 2,000 trees. The program SplitsTree4 (Daniel Huson and David Bryant. Tübingen, Germany)  was used to compute a Neighbor-Net network using uncorrected distances and equal angle splits.
Analysis of residual homoplasies
Combining loci with different evolutionary histories, due to LGT, can result in a tree that reflects neither the history of individual genes, nor the history of the group of organisms. To determine whether LGT changed the core phylogenetic relationships among isolates, we created trees using only homoplastic SNPs. Each homoplastic SNP allele will have been inherited in a vertical manner (from mother cell to daughter cell) for the vast majority of its history, and only transferred laterally a few times. As such, the phylogenetic information content for homoplastic loci reflects both the evolutionary history of vertical descent as well as the history of LGT or convergent mutations. If LGT events involve small genomic regions and occur among a variety of lineages, the portion of phylogenetic information content due to LGT will be incongruent across genomic regions, whereas the portion that reflects the patterns of vertical descent will remain congruent. The incongruent information due to LGT will be diluted by conflicting LGT patterns from other loci, allowing the portion of phylogenetic information due to vertical descent to dictate tree structure. Thus, we identified 8,213 non-homoplastic SNPs as those loci whose allelic differences could be explained by a single change, given the phylogeny of Figure 2b. These non-homoplastic loci (Additional file 4a) were excluded from a Bayesian phylogenetic analysis of only the 6,331 remaining homoplastic SNPs as performed for Figure 2b (Additional file 4b). The differences between the tree using all 14,544 SNPs and the 6,331 homoplastic SNPs are highlighted (Additional file 4c).
Population structure analyses
The program Structure 2.2  was used to analyze 601 STs across seven loci assuming one to five populations for 10 iterations each. A burn-in of 1,000 replications was discarded and 5,000 additional replications were analyzed. The burn-in period was sufficient for stabilization of log likelihood values. The plot shown (Figure 3) for each value of K is based on the run with the highest likelihood value. Likewise, FST values for K = 2 were calculated from the Structure 2.2 run with the highest likelihood value and show the divergence of each population from an estimate of ancestral allele frequencies. The level of divergence between populations (ΦPT) was computed with 999 permutations using GenAlEx (Rod Peakall and Peter Smouse. Canberra, Australia) . For this test, STs were assigned to either an Australian or Southeast Asian population based on the geographic region designated on the MLST database.
Calculation of other population metrics
The program START2 (Keith Jolley. Oxford, UK)  was used to calculate the standardized index of association  using only STs. Nei's  diversity index (D = 1-∑(allele frequency)2) for each MLST locus was calculated and averaged across all seven loci for each species or population. To calculate the relative contribution of recombination and mutation on allelic variation, we used the methods described elsewhere  except we used the program eBURST  to identify the most likely ancestral ST for each clonal complex. For the purposes of these calculations, the B. pseudomallei STs were divided into Australasian and Southeast Asian populations based on 95% assignment by the program Structure (see 'population structure analyses' above) into respective populations. The two STs assigned to B. mallei were excluded from these groups. The Neisseria MLST database contained both N. gonorrhoeae and N. meningitidis and were separated based on species labelling within the database. The Campylobacter database also contained both C. jejuni and C. coli STs, some of which were incorrectly labelled (EPP unpublished data). We therefore identified errors based on phylogenetic grouping and eliminated STs with ambiguous assignments. The recombination to mutation ratios for C. jejuni that we report here are similar to previously published values .
Molecular clock calculations
Molecular clock estimates were performed by a set of in-house Perl and Java scripts. First, all protein-coding gene sequences (excluding pseudogenes) from both chromosomes of B. pseudomallei strain K96243 were downloaded as gene references. A BLAST-like Alignment Tool (BLAT) database  was constructed for all B. pseudomallei, B. mallei, and B. thailandensis genomes used in the comparisons and then BLAT was performed for each K96243 gene against the combined Burkholderia database. The BLAT output was used to align the gene sequences for all taxa relative to the coding direction in each K96243 gene. Numbers of observed synonymous (sSNPs) and potential synonymous SNP sites (sSites) were calculated for each taxon pair. Ages of divergence were calculated for each taxon pair using the following formula: Age = sSNPs/(MR × sSites × generations × 2). Divergence times at a given bifurcation were calculated by averaging the times calculated for every taxon pair that shared that bifurcation point. As mutation rates (MR) and generations per year are unknown for B. pseudomallei, B. mallei, and B. thailandensis, we used a range of MR [78, 79] and generations per year  to calculate the divergence point of the Australian and Asian populations of B. pseudomallei.
BLAST-like alignment tool
lateral gene transfer
multilocus sequence typing
multiple-locus VNTR analysis
single nucleotide polymorphism
Variable Number Tandem Repeat
whole genome sequence.
We would like to thank Richard Lenski for helpful comments on a previous version of this manuscript. This work was supported by the U.S. Department of Homeland Security S&T CB Division Bioforensics R&D Program, NIH-NIAID grants U54AI-56359 and U01AI-075568, and Project Grant (no. 383504) from the Australian National Health and Medical Research Council. Use of products/names does not constitute endorsement by DHS of USG. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
- Achtman M, Morelli G, Zhu P, Wirth T, Diehl I, Kusecek B, Vogler AJ, Wagner DM, Allender CJ, Easterday WR, et al: Microevolution and history of the plague bacillus, Yersinia pestis. Proc Natl Acad Sci USA. 2004, 101: 17837-17842. 10.1073/pnas.0408026101.PubMed CentralView ArticlePubMedGoogle Scholar
- Alland D, Whittam TS, Murray MB, Cave MD, Hazbon MH, Dix K, Kokoris M, Duesterhoeft A, Eisen JA, Fraser CM, Fleischmann RD: Modeling bacterial evolution with comparative-genome-based marker systems: application to Mycobacterium tuberculosis evolution and pathogenesis. J Bacteriol. 2003, 185: 3392-3399. 10.1128/JB.185.11.3392-3399.2003.PubMed CentralView ArticlePubMedGoogle Scholar
- Holt KE, Parkhill J, Mazzoni CJ, Roumagnac P, Weill FX, Goodhead I, Rance R, Baker S, Maskell DJ, Wain J, et al: High-throughput sequencing provides insights into genome variation and evolution in Salmonella Typhi. Nat Genet. 2008, 40: 987-993. 10.1038/ng.195.PubMed CentralView ArticlePubMedGoogle Scholar
- Kennedy AD, Otto M, Braughton KR, Whitney AR, Chen L, Mathema B, Mediavilla JR, Byrne KA, Parkins LD, Tenover FC, et al: Epidemic community-associated methicillin-resistant Staphylococcus aureus: recent clonal expansion and diversification. Proc Natl Acad Sci USA. 2008, 105: 1327-1332. 10.1073/pnas.0710217105.PubMed CentralView ArticlePubMedGoogle Scholar
- Pearson T, Busch JD, Ravel J, Read TD, Rhoton SD, U'Ren JM, Simonson TS, Kachur SM, Leadem RR, Cardon ML, et al: Phylogenetic discovery bias in Bacillus anthracis using single-nucleotide polymorphisms from whole-genome sequencing. Proc Natl Acad Sci USA. 2004, 101: 13536-13541. 10.1073/pnas.0403844101.PubMed CentralView ArticlePubMedGoogle Scholar
- Schoen C, Blom J, Claus H, Schramm-Gluck A, Brandt P, Muller T, Goesmann A, Joseph B, Konietzny S, Kurzai O, et al: Whole-genome comparison of disease and carriage strains provides insights into virulence evolution in Neisseria meningitidis. Proc Natl Acad Sci USA. 2008, 105: 3473-3478. 10.1073/pnas.0800151105.PubMed CentralView ArticlePubMedGoogle Scholar
- Smith EE, Buckley DG, Wu Z, Saenphimmachak C, Hoffman LR, D'Argenio DA, Miller SI, Ramsey BW, Speert DP, Moskowitz SM, et al: Genetic adaptation by Pseudomonas aeruginosa to the airways of cystic fibrosis patients. PNAS. 2006, 103: 8487-8492. 10.1073/pnas.0602138103.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhang W, Qi W, Albert TJ, Motiwala AS, Alland D, Hyytia-Trees EK, Ribot EM, Fields PI, Whittam TS, Swaminathan B: Probing genomic diversity and evolution of Escherichia coli O157 by single nucleotide polymorphisms. Genome Res. 2006, 16: 757-767. 10.1101/gr.4759706.PubMed CentralView ArticlePubMedGoogle Scholar
- Vogler AJ, Birdsell D, Price LB, Bowers JR, Beckstrom-Sternberg SM, Auerbach RK, Beckstrom-Sternberg JS, Johansson A, Clare A, Buchhagen JL, et al: Phylogeography of Francisella tularensis: Global Expansion of a Highly Fit Clone. J Bacteriol. 2009, 191: 2474-2484. 10.1128/JB.01786-08.PubMed CentralView ArticlePubMedGoogle Scholar
- Gutacker MM, Smoot JC, Migliaccio CA, Ricklefs SM, Hua S, Cousins DV, Graviss EA, Shashkina E, Kreiswirth BN, Musser JM: Genome-wide analysis of synonymous single nucleotide polymorphisms in Mycobacterium tuberculosis complex organisms: resolution of genetic relationships among closely related microbial strains. Genetics. 2002, 162: 1533-1543.PubMed CentralPubMedGoogle Scholar
- Holden MT, Titball RW, Peacock SJ, Cerdeno-Tarraga AM, Atkins T, Crossman LC, Pitt T, Churcher C, Mungall K, Bentley SD, et al: Genomic plasticity of the causative agent of melioidosis, Burkholderia pseudomallei. Proceedings of the National Academy of Sciences of the United States of America. 2004, 101: 14240-14245. 10.1073/pnas.0403302101.PubMed CentralView ArticlePubMedGoogle Scholar
- Tuanyok A, Auerbach RK, Brettin TS, Bruce DC, Munk AC, Detter JC, Pearson T, Hornstra H, Sermswan RW, Wuthiekanun V, et al: A horizontal gene transfer event defines two distinct groups within Burkholderia pseudomallei that have dissimilar geographic distributions. Journal of bacteriology. 2007, 189: 9044-9049. 10.1128/JB.01264-07.PubMed CentralView ArticlePubMedGoogle Scholar
- Tuanyok A, Leadem BR, Auerbach RK, Beckstrom-Sternberg SM, Beckstrom-Sternberg JS, Mayo M, Wuthiekanun V, Brettin TS, Nierman WC, Peacock SJ, et al: Genomic islands from five strains of Burkholderia pseudomallei. BMC Genomics. 2008, 9: 566-10.1186/1471-2164-9-566.PubMed CentralView ArticlePubMedGoogle Scholar
- White NJ: Melioidosis. The Lancet. 2003, 361: 1715-1722. 10.1016/S0140-6736(03)13374-0.View ArticleGoogle Scholar
- Cheng AC, Currie BJ: Melioidosis: epidemiology, pathophysiology, and management. Clin Microbiol Rev. 2005, 18: 383-416. 10.1128/CMR.18.2.383-416.2005.PubMed CentralView ArticlePubMedGoogle Scholar
- Godoy D, Randle G, Simpson AJ, Aanensen DM, Pitt TL, Kinoshita R, Spratt BG: Multilocus sequence typing and evolutionary relationships among the causative agents of melioidosis and glanders, Burkholderia pseudomallei and Burkholderia mallei. J Clin Microbiol. 2003, 41: 2068-2079. 10.1128/JCM.41.5.2068-2079.2003.PubMed CentralView ArticlePubMedGoogle Scholar
- Sprague LD, Neubauer H: Melioidosis in Animals: A Review on Epizootiology, Diagnosis and Clinical Presentation. Journal of Veterinary Medicine Series B. 2004, 51: 305-320. 10.1111/j.1439-0450.2004.00797.x.View ArticlePubMedGoogle Scholar
- Low Choy J, Mayo M, Janmaat A, Currie BJ: Animal melioidosis in Australia. Acta Tropica. 2000, 74: 153-10.1016/S0001-706X(99)00065-0.View ArticleGoogle Scholar
- Glass MB, Steigerwalt AG, Jordan JG, Wilkins PP, Gee JE: Burkholderia oklahomensis sp. nov., a Burkholderia pseudomallei-like species formerly known as the Oklahoma strain of Pseudomonas pseudomallei. Int J Syst Evol Microbiol. 2006, 56: 2171-2176. 10.1099/ijs.0.63991-0.View ArticlePubMedGoogle Scholar
- Pearson T, U'Ren JM, Schupp JM, Allan GJ, Foster PG, Mayo MJ, Gal D, Choy JL, Daugherty RL, Kachur S, et al: VNTR analysis of selected outbreaks of Burkholderia pseudomallei in Australia. Infect Genet Evol. 2007, 7: 416-423. 10.1016/j.meegid.2006.12.002.View ArticlePubMedGoogle Scholar
- U'Ren JM, Schupp JM, Pearson T, Hornstra H, Friedman CL, Smith KL, Daugherty RR, Rhoton SD, Leadem B, Georgia S, et al: Tandem repeat regions within the Burkholderia pseudomallei genome and their application for high resolution genotyping. BMC Microbiol. 2007, 7: 23-10.1186/1471-2180-7-23.PubMed CentralView ArticlePubMedGoogle Scholar
- Maiden MCJ, Bygraves JA, Feil E, Morelli G, Russell JE, Urwin R, Zhang Q, Zhou J, Zurth K, Caugant DA, et al: Multilocus sequence typing: A portable approach to the identification of clones within populations of pathogenic microorganisms. PNAS. 1998, 95: 3140-3145. 10.1073/pnas.95.6.3140.PubMed CentralView ArticlePubMedGoogle Scholar
- Cheng AC, Godoy D, Mayo M, Gal D, Spratt BG, Currie BJ: Isolates of Burkholderia pseudomallei from Northern Australia Are Distinct by Multilocus Sequence Typing, but Strain Types Do Not Correlate with Clinical Presentation. J Clin Microbiol. 2004, 42: 5477-5483. 10.1128/JCM.42.12.5477-5483.2004.PubMed CentralView ArticlePubMedGoogle Scholar
- Vesaratchavest M, Tumapa S, Day NPJ, Wuthiekanun V, Chierakul W, Holden MTG, White NJ, Currie BJ, Spratt BG, Feil EJ, Peacock SJ: Nonrandom distribution of Burkholderia pseudomallei clones in relation to geographical location and virulence. J Clin Microbiol. 2006, 44: 2553-2557. 10.1128/JCM.00629-06.PubMed CentralView ArticlePubMedGoogle Scholar
- Robertson GA, Thiruvenkataswamy V, Shilling H, Price EP, Huygens F, Henskens FA, Giffard PM: Identification and interrogation of highly informative single nucleotide polymorphism sets defined by bacterial multilocus sequence typing databases. J Med Microbiol. 2004, 53: 35-45. 10.1099/jmm.0.05365-0.View ArticlePubMedGoogle Scholar
- Rokas A, Williams BL, King N, Carroll SB: Genome-scale approaches to resolving incongruence in molecular phylogenies. Nature. 2003, 425: 798-804. 10.1038/nature02053.View ArticlePubMedGoogle Scholar
- Ong C, Ooi CH, Wang D, Chong H, Ng KC, Rodrigues F, Lee MA, Tan P: Patterns of large-scale genomic variation in virulent and avirulent Burkholderia species. Genome Res. 2004, 14: 2295-2307. 10.1101/gr.1608904.PubMed CentralView ArticlePubMedGoogle Scholar
- Pearson T, Okinaka RT, Foster JT, Keim P: Phylogenetic understanding of clonal populations in an era of whole genome sequencing. Infect Genet Evol. 2009, 9: 1010-9. 10.1016/j.meegid.2009.05.014.View ArticlePubMedGoogle Scholar
- Worobey M: Anthrax and the art of war (against ascertainment bias). Heredity. 2005, 94: 459-460. 10.1038/sj.hdy.6800636.View ArticlePubMedGoogle Scholar
- Jakobsson M, Scholz SW, Scheet P, Gibbs JR, VanLiere JM, Fung H-C, Szpiech ZA, Degnan JH, Wang K, Guerreiro R, et al: Genotype, haplotype and copy-number variation in worldwide human populations. Nature. 2008, 451: 998-10.1038/nature06742.View ArticlePubMedGoogle Scholar
- Li JZ, Absher DM, Tang H, Southwick AM, Casto AM, Ramachandran S, Cann HM, Barsh GS, Feldman M, Cavalli-Sforza LL, Myers RM: Worldwide Human Relationships Inferred from Genome-Wide Patterns of Variation. Science. 2008, 319: 1100-1104. 10.1126/science.1153717.View ArticlePubMedGoogle Scholar
- Gee J, Glass M, Novak R, Gal D, Mayo M, Steigerwalt A, Wilkins P, Currie B: Recovery of a Burkholderia thailandensis-like isolate from an Australian water source. BMC Microbiology. 2008, 8: 54-10.1186/1471-2180-8-54.PubMed CentralView ArticlePubMedGoogle Scholar
- Felsenstein J: Cases in which parsimony and compatibility methods will be positively misleading. Systematic Zoology. 1978, 27: 401-410. 10.2307/2412923.View ArticleGoogle Scholar
- Zwickl DJ, Hillis DM: Increased Taxon Sampling Greatly Reduces Phylogenetic Error. Systematic Biology. 2002, 51: 588-10.1080/10635150290102339.View ArticlePubMedGoogle Scholar
- Feil EJ, Li BC, Aanensen DM, Hanage WP, Spratt BG: eBURST: inferring patterns of evolutionary descent among clusters of related bacterial genotypes from multilocus sequence typing data. J Bacteriol. 2004, 186: 1518-1530. 10.1128/JB.186.5.1518-1530.2004.PubMed CentralView ArticlePubMedGoogle Scholar
- Turner KM, Hanage WP, Fraser C, Connor TR, Spratt BG: Assessing the reliability of eBURST using simulated populations with known ancestry. BMC Microbiol. 2007, 7: 30-10.1186/1471-2180-7-30.PubMed CentralView ArticlePubMedGoogle Scholar
- Linz B, Balloux F, Moodley Y, Manica A, Liu H, Roumagnac P, Falush D, Stamer C, Prugnolle F, Merwe van der SW, et al: An African origin for the intimate association between humans and Helicobacter pylori. Nature. 2007, 445: 915-918. 10.1038/nature05562.PubMed CentralView ArticlePubMedGoogle Scholar
- Haubold B, Travisano M, Rainey PB, Hudson RR: Detecting linkage disequilibrium in bacterial populations. Genetics. 1998, 150: 1341-1348.PubMed CentralPubMedGoogle Scholar
- Feil EJ, Maiden MC, Achtman M, Spratt BG: The relative contributions of recombination and mutation to the divergence of clones of Neisseria meningitidis. Mol Biol Evol. 1999, 16: 1496-1502.View ArticlePubMedGoogle Scholar
- Pritchard JK, Stephens M, Donnelly P: Inference of Population Structure Using Multilocus Genotype Data. Genetics. 2000, 155: 945-959.PubMed CentralPubMedGoogle Scholar
- Currie BJ, Thomas AD, Godoy D, Dance DA, Cheng AC, Ward L, Mayo M, Pitt TL, Spratt BG: Australian and Thai Isolates of Burkholderia pseudomallei Are Distinct by Multilocus Sequence Typing: Revision of a Case of Mistaken Identity. J Clin Microbiol. 2007, 45: 3828-3829. 10.1128/JCM.01590-07.View ArticlePubMedGoogle Scholar
- Nei M: Analysis of gene diversity in subdivided populations. Proc Natl Acad Sci USA. 1973, 70: 3321-3323. 10.1073/pnas.70.12.3321.PubMed CentralView ArticlePubMedGoogle Scholar
- Feil EJ, Holmes EC, Bessen DE, Chan MS, Day NP, Enright MC, Goldstein R, Hood DW, Kalia A, Moore CE, et al: Recombination within natural populations of pathogenic bacteria: short-term empirical estimates and long-term phylogenetic consequences. Proc Natl Acad Sci USA. 2001, 98: 182-187. 10.1073/pnas.98.1.182.PubMed CentralView ArticlePubMedGoogle Scholar
- Homan WL, Tribe D, Poznanski S, Li M, Hogg G, Spalburg E, van Embden JDA, Willems RJL: Multilocus Sequence Typing Scheme for Enterococcus faecium. J Clin Microbiol. 2002, 40: 1963-1971. 10.1128/JCM.40.6.1963-1971.2002.PubMed CentralView ArticlePubMedGoogle Scholar
- Miragaia M, Thomas JC, Couto I, Enright MC, de Lencastre H: Inferring a Population Structure for Staphylococcus epidermidis from Multilocus Sequence Typing Data. J Bacteriol. 2007, 189: 2540-2552. 10.1128/JB.01484-06.PubMed CentralView ArticlePubMedGoogle Scholar
- Feil EJ, Cooper JE, Grundmann H, Robinson DA, Enright MC, Berendt T, Peacock SJ, Smith JM, Murphy M, Spratt BG, et al: How clonal is Staphylococcus aureus?. J Bacteriol. 2003, 185: 3307-3316. 10.1128/JB.185.11.3307-3316.2003.PubMed CentralView ArticlePubMedGoogle Scholar
- Maynard Smith J, Smith NH, O'Rourke M, Spratt BG: How clonal are bacteria?. Proc Natl Acad Sci USA. 1993, 90: 4384-4388. 10.1073/pnas.90.10.4384.View ArticleGoogle Scholar
- Hanage WP, Fraser C, Spratt BG: The impact of homologous recombination on the generation of diversity in bacteria. J Theor Biol. 2006, 239: 210-219. 10.1016/j.jtbi.2005.08.035.View ArticlePubMedGoogle Scholar
- Keim P, Van Ert MN, Pearson T, Vogler AJ, Huynh LY, Wagner DM: Anthrax molecular epidemiology and forensics: using the appropriate marker for different evolutionary scales. Infection, Genetics and Evolution. 2004, 4: 205-10.1016/j.meegid.2004.02.005.View ArticlePubMedGoogle Scholar
- Nierman WC, DeShazer D, Kim HS, Tettelin H, Nelson KE, Feldblyum T, Ulrich RL, Ronning CM, Brinkac LM, Daugherty SC, et al: Structural flexibility in the Burkholderia mallei genome. Proceedings of the National Academy of Sciences of the United States of America. 2004, 101: 14246-14251. 10.1073/pnas.0403306101.PubMed CentralView ArticlePubMedGoogle Scholar
- Fitch WM: Distinguishing homologous from analogous proteins. Syst Zool. 1970, 19: 99-113. 10.2307/2412448.View ArticlePubMedGoogle Scholar
- Li C, Orti G, Zhang G, Lu G: A practical approach to phylogenomics: the phylogeny of ray-finned fish (Actinopterygii) as a case study. BMC Evolutionary Biology. 2007, 7: 44-10.1186/1471-2148-7-44.PubMed CentralView ArticlePubMedGoogle Scholar
- Doolittle WF: Phylogenetic Classification and the Universal Tree. Science. 1999, 284: 2124-2128. 10.1126/science.284.5423.2124.View ArticlePubMedGoogle Scholar
- Puigbo P, Wolf YI, Koonin EV: Search for a 'Tree of Life' in the thicket of the phylogenetic forest. J Biol. 2009, 8: 59-10.1186/jbiol159.PubMed CentralView ArticlePubMedGoogle Scholar
- Majewski J, Zawadzki P, Pickerill P, Cohan FM, Dowson CG: Barriers to Genetic Exchange between Bacterial Species: Streptococcus pneumoniae Transformation. J Bacteriol. 2000, 182: 1016-1023. 10.1128/JB.182.4.1016-1023.2000.PubMed CentralView ArticlePubMedGoogle Scholar
- Townsend JP, Nielsen KM, Fisher DS, Hartl DL: Horizontal Acquisition of Divergent Chromosomal DNA in Bacteria: Effects of Mutator Phenotypes. Genetics. 2003, 164: 13-21.PubMed CentralPubMedGoogle Scholar
- Vulic M, Dionisio F, Taddei F, Radman M: Molecular keys to speciation: DNA polymorphism and the control of genetic exchange in enterobacteria. Proceedings of the National Academy of Sciences of the United States of America. 1997, 94: 9763-9767. 10.1073/pnas.94.18.9763.PubMed CentralView ArticlePubMedGoogle Scholar
- Watt VM, Ingles CJ, Urdea MS, Rutter WJ: Homology requirements for recombination in Escherichia coli. Proceedings of the National Academy of Sciences of the United States of America. 1985, 82: 4768-4772. 10.1073/pnas.82.14.4768.PubMed CentralView ArticlePubMedGoogle Scholar
- Kaestli M, Mayo M, Harrington G, Watt F, Hill J, Gal D, Currie BJ: Sensitive and Specific Molecular Detection of Burkholderia pseudomallei, the Causative Agent of Melioidosis, in the Soil of Tropical Northern Australia. Appl Environ Microbiol. 2007, 73: 6891-6897. 10.1128/AEM.01038-07.PubMed CentralView ArticlePubMedGoogle Scholar
- Smith MD, Wuthiekanun V, Walsh AL, White NJ: Quantitative recovery of Burkholderia pseudomallei from soil in Thailand. Trans R Soc Trop Med Hyg. 1995, 89: 488-490. 10.1016/0035-9203(95)90078-0.View ArticlePubMedGoogle Scholar
- Cheng AC, Ward L, Godoy D, Norton R, Mayo M, Gal D, Spratt BG, Currie BJ: Genetic Diversity of Burkholderia pseudomallei Isolates in Australia. J Clin Microbiol. 2008, 46: 249-254. 10.1128/JCM.01725-07.PubMed CentralView ArticlePubMedGoogle Scholar
- Wallace AR: Letter from Mr. Wallace concerning the geographical distribution of birds. Ibis. 1859, 1: 449-454. 10.1111/j.1474-919X.1859.tb06226.x.Google Scholar
- Hall R: Cenozoic geological and plate tectonic evolution of SE Asia and the SW Pacific: computer-based reconstructions and animations. Journal of Asian Earth Sciences. 2002, 20: 353-434. 10.1016/S1367-9120(01)00069-4.View ArticleGoogle Scholar
- DeCelles PG, Gehrels GE, Najman Y, Martin AJ, Carter A, Garzanti E: Detrital geochronology and geochemistry of Cretaceous-Early Miocene strata of Nepal: implications for timing and diachroneity of initial Himalayan orogenesis. Earth and Planetary Science Letters. 2004, 227: 313-330. 10.1016/j.epsl.2004.08.019.View ArticleGoogle Scholar
- Cheng AC, Ward L, Godoy D, Norton R, Mayo M, Gal D, Spratt BG, Currie BJ: Genetic Diversity of Burkholderia pseudomallei Isolates in Australia. J Clin Microbiol. 2008, 46: 249-254. 10.1128/JCM.01725-07.PubMed CentralView ArticlePubMedGoogle Scholar
- Chen YS, Chen SC, Kao CM, Chen YL: Effects of Soil pH, Temperature and Water Content on the Growth of Burkholderia pseudomallei. Folia Microbiologica (Praha). 2003, 48: 253-256. 10.1007/BF02930965.View ArticleGoogle Scholar
- Inglis TJJ, Sagripanti J-L: Environmental Factors That Affect the Survival and Persistence of Burkholderia pseudomallei. Appl Environ Microbiol. 2006, 72: 6865-6875. 10.1128/AEM.01036-06.PubMed CentralView ArticlePubMedGoogle Scholar
- Fuhrman JA, Steele JA, Hewson I, Schwalbach MS, Brown MV, Green JL, Brown JH: A latitudinal diversity gradient in planktonic marine bacteria. Proceedings of the National Academy of Sciences. 2008, 105: 7774-7778. 10.1073/pnas.0803070105.View ArticleGoogle Scholar
- Hughes Martiny JB, Bohannan BJM, Brown JH, Colwell RK, Fuhrman JA, Green JL, Horner-Devine MC, Kane M, Krumins JA, Kuske CR, et al: Microbial biogeography: putting microorganisms on the map. Nat Rev Micro. 2006, 4: 102-10.1038/nrmicro1341.View ArticleGoogle Scholar
- Kurtz S, Phillippy A, Delcher A, Smoot M, Shumway M, Antonescu C, Salzberg S: Versatile and open software for comparing large genomes. Genome Biology. 2004, 5: R12-10.1186/gb-2004-5-2-r12.PubMed CentralView ArticlePubMedGoogle Scholar
- Posada D, Crandall KA: MODELTEST: testing the model of DNA substitution. Bioinformatics. 1998, 14: 817-818. 10.1093/bioinformatics/14.9.817.View ArticlePubMedGoogle Scholar
- Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19: 1572-1574. 10.1093/bioinformatics/btg180.View ArticlePubMedGoogle Scholar
- Huson DH, Bryant D: Application of phylogenetic networks in evolutionary studies. Mol Biol Evol. 2006, 23: 254-267. 10.1093/molbev/msj030.View ArticlePubMedGoogle Scholar
- Peakall R, Smouse PE: genalex 6: genetic analysis in Excel. Population genetic software for teaching and research. Molecular Ecology Notes. 2006, 6: 288-295. 10.1111/j.1471-8286.2005.01155.x.View ArticleGoogle Scholar
- Jolley KA, Feil EJ, Chan MS, Maiden MC: Sequence type analysis and recombinational tests (START). Bioinformatics. 2001, 17: 1230-1231. 10.1093/bioinformatics/17.12.1230.View ArticlePubMedGoogle Scholar
- Schouls LM, Reulen S, Duim B, Wagenaar JA, Willems RJL, Dingle KE, Colles FM, Van Embden JDA: Comparative Genotyping of Campylobacter jejuni by Amplified Fragment Length Polymorphism, Multilocus Sequence Typing, and Short Repeat Sequencing: Strain Diversity, Host Range, and Recombination. J Clin Microbiol. 2003, 41: 15-26. 10.1128/JCM.41.1.15-26.2003.PubMed CentralView ArticlePubMedGoogle Scholar
- Kent WJ: BLAT---The BLAST-Like Alignment Tool. Genome Res. 2002, 12: 656-664.PubMed CentralView ArticlePubMedGoogle Scholar
- Lenski RE, Winkworth CL, Riley MA: Rates of DNA Sequence Evolution in Experimental Populations of Escherichia coli During 20,000 Generations. Journal of Molecular Evolution. 2003, 56: 498-10.1007/s00239-002-2423-0.View ArticlePubMedGoogle Scholar
- Vogler AJ, Busch JD, Percy-Fine S, Tipton-Hunton C, Smith KL, Keim P: Molecular analysis of rifampin resistance in Bacillus anthracis and Bacillus cereus. Antimicrob Agents Chemother. 2002, 46: 511-513. 10.1128/AAC.46.2.511-513.2002.PubMed CentralView ArticlePubMedGoogle Scholar
- Ochman H, Elwyn S, Moran NA: Calibrating bacterial evolution. Proc Natl Acad Sci USA. 1999, 96: 12638-12643. 10.1073/pnas.96.22.12638.PubMed CentralView ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.