Skip to main content

Table 1 Pangenome profile in 1,324 E. coli identified based on CD-HIT and ProteinOrtho. The Jaccard index measures the similarity between the two methods. The softcore genome is defined as the set of clusters of homologous genes, which exist in at least 95% of the genomes

From: To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131

Methods

PanGenome

Core Genome

Softcore Genome

Singletons

CD-HIT

25,420

425

3057

5654

ProteinOrtho

24,889

427

3056

5568

Jaccard Index

87.97%

95.41%

95.49%

93.28%