Genome mining for methanobactins
BMC Biology volume 11, Article number: 17 (2013)
Methanobactins (Mbns) are a family of copper-binding natural products involved in copper uptake by methanotrophic bacteria. The few Mbns that have been structurally characterized feature copper coordination by two nitrogen-containing heterocycles next to thioamide groups embedded in a peptidic backbone of varying composition. Mbns are proposed to derive from post-translational modification of ribosomally synthesized peptides, but only a few genes encoding potential precursor peptides have been identified. Moreover, the relevance of neighboring genes in these genomes has been unclear.
The potential for Mbn production in a wider range of bacterial species was assessed by mining microbial genomes. Operons encoding Mbn-like precursor peptides, MbnAs, were identified in 16 new species, including both methanotrophs and, surprisingly, non-methanotrophs. Along with MbnA, the core of the operon is formed by two putative biosynthetic genes denoted MbnB and MbnC. The species can be divided into five groups on the basis of their MbnA and MbnB sequences and their operon compositions. Additional biosynthetic proteins, including aminotransferases, sulfotransferases and flavin adenine dinucleotide (FAD)-dependent oxidoreductases were also identified in some families. Beyond biosynthetic machinery, a conserved set of transporters was identified, including MATE multidrug exporters and TonB-dependent transporters. Additional proteins of interest include a di-heme cytochrome c peroxidase and a partner protein, the roles of which remain a mystery.
This study indicates that Mbn-like compounds may be more widespread than previously thought, but are not present in all methanotrophs. This distribution of species suggests a broader role in metal homeostasis. These data provide a link between precursor peptide sequence and Mbn structure, facilitating predictions of new Mbn structures and supporting a post-translational modification biosynthetic pathway. In addition, testable models for Mbn transport and for methanotrophic copper regulation have emerged. Given the unusual modifications observed in Mbns characterized thus far, understanding the roles of the putative biosynthetic proteins is likely to reveal novel pathways and chemistry.
Methanotrophs are Gram-negative bacteria that use methane, a potent greenhouse gas, as their sole source of carbon and energy . As the only biological methane sink, methanotrophs have attracted much attention as a means of mitigating methane emissions [2–4]. The first step in their metabolic pathway, the oxidation of methane to methanol, is catalyzed by methane monooxygenase (MMO) enzymes, which are of broad interest in the quest to exploit abundant natural gas reserves as fuel and chemical feedstocks. Most methanotrophs utilize particulate methane monooxygenase (pMMO), a copper-dependent integral membrane enzyme [5, 6]. Under copper-limiting growth conditions, some methanotrophs can also express an alternative, soluble form of MMO (sMMO) that utilizes iron . In these methanotroph strains, the switch between pMMO and sMMO is controlled by copper: copper represses transcription of the sMMO genes and causes formation of intracytoplasmic membranes that house pMMO [8–10]. The details of this "copper switch" regulatory mechanism are not understood and represent a major outstanding question in the field.
An important part of the copper switch puzzle is the discovery of methanobactins (Mbns), a family of copper-binding natural products initially detected in the methanotroph Methylosinus trichosporium OB3b [11–13], and potentially useful in applications ranging from wastewater copper removal in the semiconductor industry  to treatment of Wilson disease, a human disorder of copper metabolism . Mbns are believed to be secreted under copper limiting conditions in a copper-free (apo) form to acquire copper from the environment and then internalized in a copper-loaded form to provide essential copper to the methanotroph [16, 17]. In support of this model, methanobactin (Mbn) promotes the copper switch [18, 19] and can mediate release of copper from insoluble mineral sources [19, 20]. In addition, direct uptake of copper-loaded Mbn (CuMbn) by Methylosinus trichosporium OB3b has been demonstrated, and proceeds via an active transport process . Because this model for Mbn function as well as aspects of its structure (vide infra) are reminiscent of iron siderophores, Mbn has also been referred to as a chalkophore  (chalko- is derived from the Greek word for copper whereas sidero- is from the Greek word for iron).
Mbn molecules from Methylosinus trichosporium OB3b, Methylocystis strain SB2, Methylocystis hirsuta CSC-1, Methylocystis strain M and Methylocystis rosea SV97T have been characterized by mass spectrometry, nuclear magnetic resonance (NMR) and crystallography (Figure 1; Additional file 1, Figure S1). These data reveal a peptidic backbone and copper coordination by two nitrogen-containing heterocycles next to thioamide groups [13, 22–25]. The Methylosinus trichosporium OB3b Mbn backbone has the sequence 1-(N-(mercapto-(5-oxo-2-(3-methylbutanoyl)-oxazol-(Z)-4-ylidene)methyl)-Gly1-Ser2-Cys3-Tyr4)-pyrrolidin-2-yl-(mercapto-(5-oxo-oxazol-(Z)-4-ylidene)methyl)-Ser5-Cys6-Met7 and is thought to derive from the peptide backbone LCGSCYPCSCM (Figure 1A).
By comparison, Methylocystis Mbns are alanine-rich, and the first nitrogen-containing heterocycle is not an oxazolone . All the Methylocystis Mbns have a similar backbone. The N-terminal residue is either arginine- or methionine-derived (the latter only in Methylocystis hirsuta CSC-1), and immediately precedes the first heterocycle. The heterocycle/thioamide pair (pyrazinadione in all structures except the NMR structure of Methylocystis strain SB2) is followed by an alanine, a serine and a sulfonated threonine (Additional file 1, Figure S1). Next is an oxaozlone/thioamide pair and an alanine followed by a methionine (Methylocystis sp. M) or a second alanine (Additional file 1, Figure S1). Additional C-terminal residues are present in some forms of the molecule. The Methylocystis rosea SV97T Mbn contains a Thr-Asn sequence , and likely derives from a peptide backbone containing the sequence RCASTCAATN (Figure 1B). Despite these structural differences, these Mbns retain their strong and specific affinity for copper .
The Mbn biosynthetic pathway has not been elucidated, and was initially suggested to involve nonribosomal peptide synthetases [16, 26], similar to production of many siderophores . However, sequencing of the Methylosinus trichosporium OB3b genome  led to the identification of a 30-amino acid open-reading frame (ORF) with similarities to the peptidic Mbn backbone , supporting previous suggestions that Mbn is produced via post-translational modification of a ribosomally synthesized precursor peptide . A similar precursor peptide was identified in an unrelated species, Azospirillum sp. B510, along with several conserved neighboring genes [17, 22], but analogous ORFs were not detected in other available methanotroph genomes, and the relevance of many of the neighboring genes surrounding the precursors was unclear.
Genes encoding the precursors of small ribosomally-produced natural products can be difficult to detect and annotate, and the underdetection of biologically relevant small ORFs is a known problem [29–31]. However, the ever-increasing rate at which bacterial genomes are released has prompted the design of genome mining tools for widespread classes of ribosomally synthesized and post-translationally modified peptide natural products (RiPPs), such as lantibiotics [32–37]. With the aim of identifying the potential for Mbn production in a wider range of bacterial species, we mined the available microbial genomes in the National Center for Biotechnology Information (NCBI) and Joint Genome Institute (JGI)/Integrated Microbial Genomes (IMG) databases, identifying 18 new Mbn-like precursors and accompanying biosynthetic genes from 16 species, including unknown or provisionally identified species present in metagenomic samples. Surprisingly, many of these precursor peptides and their operons are from non-methanotrophic species and several well-studied methanotrophic species seem to lack Mbn operons similar to that of Methylosinus trichosporium OB3b. Beyond biosynthesis-related genes, we also identified a widely-conserved set of transporters and sigma factors, which has implications for Mbn export and import as well as its involvement in cellular copper homeostasis. Finally, this bioinformatics study provides new tools to better detect Mbn-like gene clusters in novel genomes.
Results and discussion
Using a variety of bioinformatics techniques, we were able to detect putative biosynthesis operons for Mbn-like natural products in 14 new species, as well as several unidentified or tentatively identified species present in metagenomic studies (Figure 2; Additional file 2, Table S1). While five of the identified species are Type II methanotrophs like the first identified Mbn-producer, Methylosinus trichosporium OB3b, the remaining species are not. Operons were detected in β- and γ-proteobacteria as well as α-proteobacteria, to which the Type II methanotrophs belong. Both the precursor peptides and the range of non-core biosynthesis genes present in the operon hint at a set of potential modifications that may define the Mbn family. Furthermore, genes likely to be related to export, import and copper regulation are found in almost every operon. Based on sequence analysis, the presence of specific Mbn-related genes and the overall operon structure, we have provisionally divided the operons into five groups (Figure 2).
Locating the precursor peptide MbnA
Automated detection of small peptide sequences in newly-sequenced genomes is problematic . Short sequences are poorly detected by Basic Local Alignment Search Tool (BLAST) and similar sequence analysis methods, and uncurated small ORF detection results in the annotation of many spurious small ORFs. For well-established classes of small ribosomally-produced natural products, such as bacteriocins, hidden Markov model (HMM)-based tools, such as BAGEL and BAGEL2, have been developed to better detect precursors in newly sequenced genomes [32, 33]. With only two published precursor peptide sequences (MbnAs), Mbn was not a good candidate for this detection method [17, 23]. A TIGRFAM group (TIGR04071) does exist for the precursor, and is a member of the GenProp0962 family (which also includes TIGRFAM groups for MbnC and half of MbnB) , but it is based on only the two previously published precursor peptide sequences and a third suggested MbnA homologue from Gluconacetobacter sp. SXCC-1 . Four possible MbnAs detected here are also mentioned in the 2013 TIGRFAM update , but are not included in the available HMM.
Because of the limitations of direct precursor peptide detection, we pursued an alternate genome mining strategy focusing on the detection of biosynthetic proteins, followed by manual identification of unannotated precursors. This method has been used with some success for a variety of natural products, including radical S-adenosyl methionine (SAM)-modified peptides, bacteriocins in cyanobacteria, and a new class of lantibiotic-like natural products stemming from nitrile hydratase or Nif11 leader peptides [34–36, 39]. We used the MbnB and MbnC sequences from Methylosinus trichosporium OB3b  as seeds in a tBLASTn search through the NCBI's Non-redundant (NR) and Whole Genome Shotgun (WGS) databases, as well as the microbial genomes available at JGI/IMG. For every MbnB homologue detected, a 2 kb region preceding and following that gene was manually examined for 45 to 150 bp ORFs coding for short peptides with at least one cysteine in the last 10 amino acids and an N-terminal region containing multiple arginine or lysine residues.
A total of 18 novel MbnA-like ORFs were identified using these methods, one preceding every close MbnB homologue excluding truncated homologues from metagenomic sequencing. Two Methylosinus species (Methylosinus sp. LW3 and Methylocystis parvus OBBP, which may be misclassified as Methylocystis ) have two distinct MbnA genes encoding unique Mbns. While it is not uncommon for bacteria to produce multiple siderophores to control iron acquisition in different environments , a similar phenomenon has not yet been observed for chalkophores. As shown in the Multiple Alignment with Fast Fourier Transform (MAFFT) alignment (Figure 3A), both the leader and core sequences exhibit some conservation over the 20 complete sequences. MbnA-like sequences range from 23 to 35 amino acids (aa), with predicted core sequences ranging from 7 to 15 aa. The leader peptides are better conserved than the core peptides, perhaps indicating the involvement of the leader peptide in interactions with biosynthesis proteins . The leader sequences are lysine/arginine rich, with at least two such residues occurring near the beginning and one present in a conserved area immediately prior to the core sequence (Figure 3B). The core sequences are more variable, but all contain at least one C (G|A|S) (S|T) motif. Of the complete MbnA sequences, 18 have a second core cysteine and 11 contain one or two additional cysteines.
One basis for the proposed five operon groups (Figure 3C) is the nature of the MbnA sequences, including the structurally, but not genomically, characterized Methylocystis Mbns. The Group I MbnA sequences, primarily from Methylosinus genera are long (11 to 15 aa), with four non-adjacent core peptide cysteines, and contain core prolines. It is unknown whether the presence of four cysteines allows for the formation of disulfide bonds as found in Methylosinus trichosporium OB3b Mbn or whether they lead to the production of additional oxazole/thioamide pairs, analogous to the multiple thiazoles and oxazoles present in many bacteriocins .
The primarily Methylocystis Group II MbnA sequences are shorter, contain only two or three cysteines, and many have a conserved threonine which, based on NMR and crystal structures [23, 24], is likely to be a sulfotransferase target. Interestingly, the sequences from Methylocystis strain SC2 and Methylocystis rosea SV97T appear to be merged with an extracytoplasmic function (ECF) sigma factor, at least based on the annotation . It is not clear whether the precursor peptide is cleaved from these sigma factors and whether sigma factor activity remains or is altered. Although there is no structure for Mbn from Methylocystis strain SC2, its MbnA sequence and the similarity of its operon structure to that of Methylocystis rosea SV97T suggest that its Mbn will resemble Methylocystis rosea SV97T Mbn and will be identical to Methylocysis hirsuta CSC-1 Mbn . Similarly, although there are no genomes for Methylocystis strain SB2, Methylocystis strain M and Methylocystis hirsuta CSC-1, we can predict that the core peptides for their structurally characterized Mbns will be RCASTCAA, RCASTCAMT and MCASTCAAT, respectively (likely followed by -TNG, -NG and -NG), and that their leader sequences will resemble those from Methylocystis rosea SV97T and Methylocystis strain SC2 [24, 43]. A subfamily of Group II MbnAs from Methylosinus or related species (Methylosinus sp. LW3, Methylocystis parvus OBBP and a bioreactor metagenome) do not have the CASTCA(A) motif. Instead, the second cysteine is followed by a tryptophan. If the core peptide sequence dictates cysteine modification, these residues lack the C(G|A|S) motif associated with cyclization and thioamide formation in existing Mbn structures.
The remaining families include MbnAs from a variety of non-methanotrophic species. The species that have Group III MbnA sequences include two Pseudomonas species, two Azospirillum species and single species each from the Cupriavidus, Tistrella and Methylobacterium genera. In this group, the two Pseudomonas sequences, the two Azospirillum sequences and the Methylobacterium sequence are most similar, with somewhat lengthy and near-identical MbnA core sequences containing two cysteine doublets. The less similar Cupriavidus basilensis B-8 MbnA preserves the cysteine doublets whereas the Tistrella mobilis KA081020-065 sequence contains only two non-adjacent cysteines.
The Group IV MbnA sequences are currently only found in the two Gluconacetobacter species. These two sequences are nearly identical, and feature only two core cysteines, with a leader sequence potentially extended by two amino acids. Finally, the Group V MbnA-like sequences are found in Vibrio caribbenthicus ATCC BAA-2122 and Phorhabdus luminescens subsp. laumondii TT01. These sequences are short and somewhat divergent, containing only a single cysteine, which may suggest that they represent a natural product with some structural similarities to Mbn that either does not chelate copper or does not chelate copper in the same way that other chalkophores do. This overall classification scheme extends to the MbnB and MbnC sequences (vide infra), and will be subject to future modification as more MbnA sequences are identified in new genomes.
The first unknown biosynthesis protein: MbnB
MbnB is the core protein in the Mbn biosynthesis operon, and was detected in 19 operons, including truncated forms in several metagenomic samples (Figures 2 and 4; Additional file 2, Table S1). However, the initial identification of this protein in Methylosinus trichosporium OB3b has been problematic. In Methylosinus trichosporium OB3b, and one other operon detected (Methylobacterium sp. B34), MbnB is split into two ORFs, formerly annotated as MettrDRAFT_3894 and MettrDRAFT_3895, but reannotated as one entity (MettrDRAFT_3422) in a recently assembled genome build available on IMG . A TIGRFAM HMM (TIGR04159) exists for the half of the protein that resembles MettrDRAFT_3894, but does not cover MettrDRAFT_3895 . Therefore, a conjugate with a glycine replacing the stop codon between the two ORFs was used for BLAST detection and annotation.
Despite the addition of new members to the MbnB family, no motifs or domains of known function have been identified beyond occasional classification as TIM-barrel proteins . MbnB homologues may be a subfamily in the larger DUF692 family (PFAM class PF05114). However, when conducting a BLAST search or a HMM-based search for homologues, MbnB-like proteins represent a distinct subgroup, with a sharp drop-off in expectation value between the last MbnB-like protein (E <1E-50, except for sequences truncated by the end of a contig/scaffold) and other DUF692-like proteins. Notably, in the Group V operons, the MbnB gene is separated from the MbnA-like precursor by a gene-sized ORF.
A comparison of MbnB sequences (Figure 4A) strongly supports the five operon families assigned on the basis of the MbnA sequences (Figure 4B). There are about six different regions that are strikingly well-preserved, even in the Group V homologues. Without knowledge of the structure or function of MbnB, it is difficult to interpret which of these conserved regions are important. However, given that MbnB and MbnC are the only proteins with unassigned functions that are preserved in both the Methylosinus trichosporium OB3b and Methylocystis rosea SV97T operons, it is possible that one or both are responsible for the nitrogen-containing heterocycles and the neighboring thioamides that have been present in every Mbn structure obtained thus far.
The second unknown biosynthesis protein: MbnC
MbnC is the second unknown Mbn biosynthesis protein, and as with MbnB, there is an existing, if limited, TIGRFAM class (TIGR04061) . We detected MbnC-like proteins in 17 novel operons, a number that includes two fragmentary hits in a bioreactor metagenome (Figure 5A). As with MbnB, there is a broader class of distantly related hits (with high Pseudomonas representation and a more divergent C-terminal region), visible after a sharp decline in expectation value quality. This set of more distant relatives appears to correspond to the TIGRFAM family TIGR04061.
MbnC homologues are present in Groups I to IV Mbn operons. For those operons, the phylogenetic tree constructed for MbnC resembles that for MbnB and MbnA, supporting the proposed classification scheme (Figure 5B). In families with a true MbnC homologue, the predicted MbnC ORF frequently overlaps MbnB by a significant number of residues, but it is not in frame with MbnB. As with MbnB, multiple alignment of MbnC homologues confirms the broad conservation of several regions of the gene, but the relationship between the conserved regions and MbnC's potential role in biosynthesis of thioamide and nitrogen-containing heterocycles remains unclear.
The Group V operons, which appear to be the most distantly related to the Methylosinus trichosporium OB3b operon, diverge with MbnC. There do not appear to be clear Group V homologues for MbnC as there are for MbnB. There is, however, an unidentified ORF immediately neighboring the precursor, conserved primarily in these two species. This ORF could possibly encode a core biosynthetic protein for the Group V operons (Additional file 1, Figure S2). These sequences appear to have no close homologues in other species, and have a weak N-terminal similarity to the DUF692-like domain (PF05114), which is more like MbnB than MbnC.
Other biosynthesis proteins: MbnN, MbnS and MbnF
The Mbns from Methylosinus and Methylocystis species exhibit post-translational modifications beyond the formation of nitrogen-containing heterocycles and neighboring thioamides. Mbn biosynthesis in Methylosinus trichosporium OB3b requires a transamination reaction on the N-terminal amine group of the core peptide following leader peptide removal, as well as the formation of a disulfide bond, and all four Methylocystis Mbns contain a sulfonated threonine group [22, 24]. Although specific proteases and disulfide-forming proteins are not evident, we have discovered proteins likely responsible for transamination and threonine sulfonation in the Mbn biosynthesis operons of several genomes. Transaminases are present in three operons only: Methylosinus trichosporium OB3b (annotated as "histidinol phosphate transaminase/cobyric acid decarboxylase" and with a PFAM classification of PF00155 or Class I/II aminotransferase), Methylosinus sp. LW4 (also PF00155 or class I/II aminotransferase), and Gluconacetobacter sp. SXCC-1 (classified as PF00202 or Class III aminotransferase) (Figure 2). The transaminase has tentatively been designated MbnN. The paucity of transaminases in Mbn operons suggests that the N-terminal transamination present in Methylosinus trichosporium OB3b Mbn may not be a common modification.
Like the N-terminal transamination, threonine sulfonation may only be present in a subset of Mbns. To date, it has only been observed in the four structures of Mbns produced by Methylocystis species [23, 24]. Sulfotransferases with domains corresponding to Pfam family PF00685 were detected only in the two Group II Methylocystis operons. Although no structure for Mbn from Methylocystis strain SC2 is available, the similarity of its MbnA to that of Methylocystis rosea SV97T combined with the presence of a sulfotransferase in its operon strongly suggests that its Mbn will also be sulfonated, presumably at the same threonine. This sulfotransferase has been designated MbnS.
Finally, the gene encoding MbnF, generally annotated as a flavin adenine dinucleotide (FAD)-dependent monooxygenase or an FAD-dependent oxidoreductase (Pfam PF01494), is also present in six Group I and II operons (including all known Methylocystis genomes and some Methylosinus genomes), always following MbnM (Figure 2). The function of MbnF is unclear, but given its presence in the Methylocystis rosea SV97T operon and absence in the Methylosinus trichosporium OB3b operon, it could play a role in pyrazinedione biosynthesis (Figure 1), possibly hydroxylating the heterocycle. Without structures of Mbn-like products from non-methanotrophs, it is difficult to connect other neighboring genes (annotated or not) to potential biosynthetic modifications and to determine the effective ending point of the operon and potentially the end of any multicistronic mRNA transcripts. In both Methylocystis species, MbnS is followed by a gene resembling MoaA, a protein responsible for the first step in molybdenum cofactor biosynthesis  (which involves the conversion of a guanosine derivative to precursor Z) and a gene generally annotated as a 3-hydroxyisobutyrate dehydrogenase. Hypothetical unknown proteins (including the MbnC replacement in Group V operons) are present in several operons, and a range of proteins of unknown relevance, including several varieties of known copper-related proteins, appear in a few operons only (Figure 2).
Exporting methanobactin via MbnM
A proton/sodium-dependent multidrug export pump (MATE), belonging to the PFAM class PF01554, is found in 13 of the identified operons (Figure 2). Of the remaining operons, several are on small contigs in more fragmented draft genomes making it difficult to rule out the presence of a similar exporter. Excluding Vibrio caribbenthicus and Photorhabdus luminescens, which appear to have dissimilar MATE transporters, perhaps reflecting a less similar final Mbn-like product, this exporter is well-conserved, even in the non-methanotrophs Pseudomonas fluorescens NZI7 and Azospirillum sp. B510 and B506 (Additional file 1, Figure S3.) In prokaryotes, MATE transporters primarily function as exporters of antibiotics and similar toxic compounds, simultaneously importing Na+ or H+ and exporting mostly cationic natural products [46–48]. Native natural products are primarily exported by non-MATE efflux pumps, such as the resistance-nodulation-cell division (RND) or major facilitator superfamily (MFS) exporters that are believed to transport some siderophores out of the cell [49–51]. However, many MATE transporters do not have known substrates, and MATE transporters are even found in antibiotic hypersensitive strains . Thus, the ability of a MATE transporter to secrete Mbn-like compounds is plausible, if unprecedented.
Importing copper-loaded methanobactin via MbnT
A family of small molecule importers, known as TonB-dependent transporters (TBDTs), are also commonly associated with the Mbn biosynthesis operons. The only genomes for which nearby TBDTs are not observed are Vibrio caribbenthicus and Photorhabdus luminescens, as well as the second Mbn operon in Methylosinus sp. LW3, which is small and surrounded by transposon elements; contig truncation of several other operons may be hiding additional potential transporters in other species. We have shown previously that CuMbn is imported via an active process [17, 21] and TBDTs are good candidates for importers since they play a similar role for siderophores [53–56]. TBDTs found in the vicinity of Mbn operons are generally annotated as siderophore receptors and classifiable under models including TIGR01783 (full siderophore-specific TBDT model), PF00593 (TBDT barrel only), PF07715 (TBDT plug domain only) and in some cases PF07660 (an extended N-terminal region, which appears to approximate the published N-terminal extension (NExT) domain ); they have provisionally have been designated the MbnT family. Conservation of these TBDTs is weaker than that of MbnB, MbnC or MbnM; even the plug domain displays less homology (Additional file 1 Figure S4A, B). However, differences in the core peptide backbone sequence may require markedly different binding approaches. While methanotroph Mbn-related genes are generally relatively similar, the plug domain sequences of Methylocystis Group II TBDTs and Methylosinus Group I TBDTs diverge markedly, perhaps reflecting the structural differences of the final compounds (Additional file 1, Figure S4A, B.)
MbnT may have a FecIRA-like regulation system in Methylosinus species
In four operons from Methylosinus species, the TBDT has an extra N-terminal domain (Additional file 1, Figure S4C.) These larger TBDTs are preceded by an ORF generally annotated as an "Fe(III) dicitrate membrane sensor" (PFAM PF04773) and an "ECF sigma factor" (with conserved σ-70-like regions 2 (PFAM PF04542) and 4 (PFAM PF08281)), designated MbnR and MbnI, respectively (Figure 2). This pairing is generally observed for FecIRA-like systems, in which the holo siderophore-bound TBDT interacts with the membrane sensor, which then interacts with the ECF sigma factor to regulate expression of siderophore biosynthesis and transport proteins [57–60]. The earliest example of this system is the eponymous FecIRA system, which controls the transcription of iron citrate transporters [57, 59, 61, 62]. Similar systems exist for siderophores, such as pseudobactins BN7 and BN8 (the PupBRI system) , pyoverdines (FpvARI/PvdS)  and a range of other siderophores. Not all of these systems have identical regulatory pathways. The pyoverdine transport system has two ECF sigma factors (FpvI and PvdS) which regulate different operons , and the HasISR system, which transports heme, has an unusual regulatory scheme in which the membrane-bound sigma factor HasS inhibits the activity of the ECF sigma factor HasI until heme binding to the TBDT HasR .
Strikingly, only the four Methylosinus MbnT TBDTs have the N-terminal extensions necessary for FecIRA signaling , suggesting a possible regulatory mechanism for Mbn production and transport in Group I operons (Figure 6). In this model, when CuMbn binds to MbnT, a periplasmic TonB-mediated interaction with MbnR results in an altered cytoplasm-side interaction with MbnI. The MbnI ECF sigma factor may then interact with RNA polymerase to either upregulate or inhibit Mbn biosynthesis and transport and may also regulate other operons that are highly expressed at low copper, such as the sMMO operon. If MbnIRT is a positive regulation system, a negative regulator that binds copper and represses Mbn biosynthesis and transport, among other systems, may also be present.
The TBDTs in other operons beyond the Methylosinus (Group I) species lack N-terminal extension domains and are not adjacent to FecIR homologues. Although a FecIRA-like system could still be present in these species in a distant small operon, it is less likely. It may be that models analogous to different siderophore regulatory systems are more relevant to these Mbn operons. For example, iron-loaded pyochelin is taken up into the cell and binds to the transcription factor PchR, which regulates its biosynthesis and transport [66–69]. If such a system exists for chalkophores (Additional file 1, Figure S5), the regulators do not appear to be consistently encoded near the biosynthesis operon. However, genes encoding periplasmic binding proteins, commonly associated with natural product import via adenosine triphosphate (ATP)-binding cassette (ABC) transporters, are located downstream of TBDTs in both complete Methylocystis and Azospirillum operons, and could be relevant to the need for cytoplasmic uptake in a PchR-like model (Figure 2).
MbnP and MbnH: mysterious partners
The genes encoding MbnP and MbnH are conserved as a pair far beyond the group of Mbn producers analyzed here and are defined by an existing set of TIGRFAM HMMs (TIGR04039 and TIGR04052) and an associated genome property (GenProp0940). The pair consists of the di-heme cytochrome c peroxidase MbnH, frequently annotated as resembling MauG, and its neighboring partner protein, MbnP. In two non-Group V genomes (Methylosinus sp. LW3 and LW4), there are cases where this pair is not immediately proximal to an Mbn operon, but is present elsewhere in the genome. Methylosinus trichosporium OB3b has two such additional pairs. Interestingly, these isolated pairs are located near MbnT-like TBDTs that also have adjacent MbnI and MbnR homologues.
A somewhat similar pair of proteins are found in some methanotroph species that lack Mbn operons. In Methylococcus capsulatus (Bath), the proteins are called SACCP (the di-heme cytochrome c peroxidase) and MopE (the partner protein). MopE is known to be the subject of a post-translational modification (possibly by SACCP, which is similar to MauG ) in which a tryptophan converted to kynurenine participates in a copper binding site . Additionally, while the intact MopE protein is surface-associated, a C-terminal region is fully secreted . In Methylomicrobium album BG8, these proteins are called CorB (the di-heme cytochrome c peroxidase) and CorA (the partner protein) [73, 74]. The genes encoding these proteins are downregulated in the presence of copper [75–77]. However, although there are several well-conserved tryptophans in the MbnP proteins, the sequence is not markedly similar to MopE or CorA (Additional file 1, Figure S6), and there are no data linking any close MbnP homologues or their di-heme cytochrome c peroxidase partners to copper. The relevance of this gene pair to Mbn biosynthesis, regulation or transport thus remains unclear.
Overall structure of the Mbn operon
The core of the Mbn operon (Figure 2) is the MbnB biosynthesis gene, located directly downstream of MbnA in all operons except for the two Group V operons, which have an unknown gene between MbnA and MbnB. MbnC encodes a secondary core protein, present immediately downstream of MbnB in all operons except Group V operons. All components beyond that core are more flexible. When present, MbnM follows the core biosynthesis peptides. Other biosynthesis-related genes, such as MbnN and MbnS follow MbnM. In some cases, the MbnP/MbnH pair appears after the biosynthesis proteins. In others, it is present before them on the same strand, or before them but on the complementary strand. MbnT, downstream of MbnI/R in Group I operons, primarily occurs prior to the biosynthesis cluster on the same strand and frequently neighbors the MbnP/MbnH pair as well.
In many of the operons, factors related to genetic mobility, such as insertion sequences, transposases, integrases, insertion sites, shufflons and conjugation-related proteins, occur on one or both sides of the Mbn operon or within several kilobases (Figure 2). These elements may suggest an explanation for the seemingly unrelated assortment of species in which these operons have been detected, and for the lack of operon detection in several well-studied methanotroph species, including Methylocystis str. Rockwell . Siderophores are sometimes transported between species on virulence or fitness cassettes . Similarly, it may be that chalkophores are transported in this fashion and adapted by species that have a special need for copper-binding compounds.
We have detected a total of 18 novel Mbn-like precursors located in full or partial biosynthesis/transport operons in 16 species or metagenomic samples. Of the methanotroph species, operons are present in both strains that undergo the copper switch from sMMO to pMMO (for example, Methylosinus trichosporium OB3b , Methylocystis str. M [80, 81], Methylocystis hirsuta CSC-1 ) and those that only express pMMO (for example, Methylocystis parvus OBBP , Methylocystis rosea SV97T ). The 16 species are not limited to methanotrophic bacteria, providing compelling evidence that Mbn-like compounds may play a broader role in proteobacterial metal homeostasis. This analysis reveals the precursor peptide for Methylocystis rosea SV97T Mbn  and identifies in the same operon genes encoding enzymes that would be necessary to produce the novel features of this Mbn, specifically the sulfonated threonine. Moreover, these data allow us to predict that the Mbn produced by Methylocystis strain SC2 will be very similar to that of Methylocystis rosea SV97T and likely identical to that of Methylocystis hirsuta CSC-1. Conversely, we can predict that the Mbn operons of Methylocystis str. SB2, Methylocystis str. M and Methylocystis hirsuta CSC-1 will have the same core components as the two Methylocystis operons presented here. Taken together, these findings provide strong new support for a post-translational modification biosynthetic pathway.
Beyond the four Methylocystis Mbns, the only other structurally characterized Mbn is the original compound from Methylosinus trichosporium OB3b, which has a Group I Mbn operon. As the related natural products from Group I, III, IV and V familes are characterized, the extent of structural diversity in the Mbn family should become more clear. The roles of MbnB and MbnC as well as the less universal MbnN, MbnS and MbnF proteins in biosynthesis are unknown or unconfirmed and need to be investigated biochemically. This is particularly important since Mbns contain uncommon post-translational modifications, such as thioamide groups, a modification rare enough that Mbns have doubled the number of compounds known to contain it . In addition, there are no other examples of RiPPs containing pyrazinediones [85, 86], and even oxazolone rings are uncommon, with oxazoles and thiazoles constituting the more common products of serine, threonine and cysteine cyclization. The combination of these motifs with the possibility of more unknown post-translational modifications in Mbns from Groups I and III to V suggests that novel biochemical mechanisms may be involved in Mbn biosynthesis.
The two identified Group V operons may represent a different natural product subfamily, albeit one that shares some similar biosynthesis proteins and modifications with the main Mbn family. Notably, their MbnA sequences contain only a single modifiable cysteine, suggesting that if the final products bind copper at all, they do not use the paired heterocycle/thioamide coordination scheme. Instead of MbnC homologues, these operons include a third unidentified putative protein which neighbors MbnA, and Vibrio caribbenthicus also has a second unknown protein following MbnB. Both have nearby exporters, but no TBDT-like importers.
The identification of MbnM and MbnT as common members of the Mbn operon provides candidate transporters for both Mbn import and export. The possible involvement of MATE-type exporters is somewhat surprising, but the ability of TBDTs to import metal-loaded siderophores is well documented, and the association of such transporters with Mbn operons supports experimental work showing that Mbn uptake is an active process [21, 53–55]. Furthermore, in the case of Group I operons, the N-terminal transduction element in MbnT combined with the presence of MbnI and MbnR is consistent with FecIRA-style regulation. This model, along with a hypothetical pyochelin-like route for non-Group I operons, provides testable mechanisms for CuMbn involvement in methanotrophic copper regulation, and may help unravel the mystery of the copper switch.
A final point of interest lies in what was not found in this analysis. There are a variety of methanotroph genomes, including but not limited to Methylococcus capsulatus (Bath) , Methylocella sylvesteris BL2 , Methylocystis str. Rockwell (ATCC 49242)  and Methylomicrobium album BG8, in which we detect no Mbn biosynthesis/transport operons. Based on their genomes, if these species produce a chalkophore as suggested , it is not similar to existing structurally characterized Mbns and its biosynthetic enzymes do not closely resemble MbnB and MbnC. While one of these species only produces sMMO, the rest produce pMMO and some, including Methylocystis str. Rockwell, produce only pMMO. If these methanotrophs do not produce their own chalkophores, they might scavenge chalkophores from other species, similar to what is observed for siderophores , and may still possess Mbn-transporting TBDTs. Alternatively, these strains may have other, yet to be unidentified, mechanisms of copper uptake. Taken together, these data provide new insight into Mbn and Mbn-like compounds and their biosynthesis, provide new tools for investigating these processes, and have implications for the broader question of bacterial heavy metal homeostasis.
The contigs, scaffolds or complete genomes containing Mbn operons discussed in this paper include: Azospirillum sp. B506 (GenBank: BADK01001132), Azospirillum sp. B510 (GenBank: NC_013854) , Bioreactor metagenome PBDCA2 (GenBank: AGTN01410593, AGTN01295401 and AGTN01527378), Cupriavidus basilensis B-8 (GenBank: AKXR01001597), Gluconacetobacter sp. SXCC-1 (GenBank: NZ_AFCH01000034) , Gluconacetobacter oboediens 174bp2 (GenBank: NZ_CADT01000094), Marine metagenome (GenBank: JCVI_SCAF_1096627660232), Methylocystis sp. SC2 (GenBank: NC_018485) , Methylobacterium sp. B34 (GenBank: BADE01000957), Methylocystis parvus OBBP (GenBank: AJTV01000003 and AJTV01000041) , Methylocystis rosea SV97T (IMG: A3OODRAFT_scaffold1.1), Methylosinus sp. LW3 (IMG: MetLW3DRAFT_contig2.2), Methylosinus sp. LW4 (IMG: MetLW4DRAFT_scaffold2.2), Methylosinus trichosporium OB3b (IMG: MettrDRAFT_Contig106; GenBank version is outdated) , Photorhabdus luminescens subsp. laumondii TT01 (GenBank: BX571860) , Pseudomonas extremaustralis 14-3 substr. 14-3b (GenBank: NZ_AHIP01000040) , Pseudomonas fluorescens NZI7 (GenBank: AJXF01000037) , Tistrella mobilis KA081020-065 (GenBank: NC_017956)  and Vibrio caribbenthicus ATCC BAA-2122 (GenBank: NZ_AEIU01000072).
Gene cluster identification and classification
Gene sequences from Methylosinus trichosporium OB3b and Azospirillum sp. B510 were used as seeds for searches against the NCBI WGS and NR databases  (National Center for Biotechnology Information, Bethseda, Maryland, USA), as well as the IMG database  (DOE Joint Genome Institute, Walnut Creek, California, USA), using the tBLASTn  (National Center for Biotechnology Information, Bethseda, Maryland, USA) algorithm to identify genes even in unannotated regions. Hits of E <1E-20 were manually examined for the presence of other related genes and potential precursors. In genes of interest, annotation was confirmed or potential roles for unannotated genes were identified via BLAST (National Center for Biotechnology Information, Bethseda, Maryland, USA) and via the Pfam (European Bioinformatics Institute, Hinxton, England, UK) and TIGRFAM (J. Craig Venter Institute, Rockville, Maryland, USA) databases [38, 100]. In the cases of Cupriavidus basilensis B-8, Methylocystis parvus OBBP, Pseudomonas fluorescens NZI7, Azospirillum sp. B506 and Methylobacterium sp. B34, the relevant contigs were manually examined for ORFs. ORFs of interest were provisionally identified using BLAST and PFAM. The IGS Annotation Engine (http://ae.igs.umaryland.edu/cgi/index.cgi)  (Institute for Genome Sciences, University of Maryland, Baltimore, Maryland, USA) was used for structural and functional annotation of the first three of these sequences (Additional files 3, 4, 5, 6). Manatee was used to view annotations (http://manatee.sourceforge.net/).
Alignment and phylogeny
Initial multiple sequence alignments were generated using ClustalOmega  (University College Dublin, Dublin, Ireland), MAFFT  (Computational Biology Research Center, Tokyo, Japan) or MUSCLE  (Drive5, Tiburon, California, USA), applying the default settings. Alignments were visualized using the Jalview 2 (University of Dundee, Dundee, Scotland, UK) package and were organized according to a simple tree based on the Neighbor-Join algorithm using the BLOSUM62 model. Phylogenetic and evolutionary analyses were conducted using the MEGA5 package  (Center for Evolutionary Medicine and Informatics, Arizona State University, Tempe, Arizona, USA). During phylogentic tree construction, the evolutionary history was inferred by using the Maximum Likelihood method based on the JTT matrix-based model . The bootstrap consensus tree inferred from 1,000 replicates  was taken to represent the evolutionary history of the taxa analyzed . Branches corresponding to partitions reproduced in less than 50% bootstrap replicates were collapsed. Initial tree(s) for the heuristic search were obtained automatically by applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using a JTT model, and then selecting the topology with superior log likelihood value. A discrete Gamma distribution was used to model evolutionary rate differences among sites (five categories (+G, parameter varied by alignment)).
Precursor gene identification
The Methylosinus trichosporium OB3b and Azospirillum sp. B510 MbnB sequences were used as described above to identify potential Mbn biosynthesis operons. When such a protein was identified, small ORFs (10 to 50 aa) in a 2 kb region up- and downstream of the gene were analyzed for the presence of cysteines in the last 10 residues, a lysine or arginine preceded by a hydrophobic region located a few residues prior to the cysteine, and multiple lysines and/or arginines within the first 10 residues. Motifs showing the amino acid frequency across the precursor peptide were generated using Jalview 2 .
HMM generation and analysis
HMMs were generated for MbnA (Additional file 7), MbnB (Additional file 8) and MbnC (Additional file 9) using HMMER3  (Howard Hughes Medical Institute Janelia Farm, Ashburn, Virginia, USA). Prior to HMM generation, curated seed alignments (featuring removal of truncated or otherwise unacceptable sequences and trimming of extra domains) were manually generated. HMMs were used to scan the existing protein databases for additional operons, but all extant Mbn operons appear to have been located via tBLASTn, prior to HMM construction. A revised NExT HMM (NexTNew) was used to confirm the identification of TBDTs possessing the N-terminal extension required for TonB-mediated signal transduction .
Basic Local Alignment Search Tool
flavin adenine dinucleotide
hidden Markov model
multidrug and toxic compound extrusion
major facilitator superfamily
messenger ribonucleic acid
nuclear magnetic resonance
open reading frame
particulate methane monooxygenase
ribosomally synthesized and post-translationally modified peptide natural product
surface-associated cytochrome c peroxidase
soluble methane monooxygenase
Semrau JD, Dispirito AA, Yoon S: Methanotrophs and copper. FEMS Microbiol Lett. 2010, 34: 496-531.
Scheutz C, Kjeldsen P, Bogner JE, De Visscher A, Gebert J, Hilger HA, Huber-Humer M, Spokas K: Microbial methane oxidation processes and technologies for mitigation of landfill gas emissions. Waste Manag Res. 2009, 27: 409-455. 10.1177/0734242X09339325.
Huber-Humer M, Gebert J, Hilger H: Biotic systems to mitigate landfill methane emissions. Waste Manag Res. 2008, 26: 33-46. 10.1177/0734242X07087977.
Jiang H, Chen Y, Jiang PX, Zhang C, Smith TJ, Murrell JC, Xing XH: Methanotrophs: Multifunctional bacteria with promising applications in environmental bioengineering. Biochem Eng J. 2010, 49: 277-288. 10.1016/j.bej.2010.01.003.
Culpepper MA, Rosenzweig AC: Architecture and active site of particulate methane monooxygenase. Crit Rev Biochem Mol Biol. 2012, 47: 483-492. 10.3109/10409238.2012.697865.
Balasubramanian R, Smith SM, Rawat S, Stemmler TL, Rosenzweig AC: Oxidation of methane by a biological dicopper centre. Nature. 2010, 465: 115-119. 10.1038/nature08992.
Rosenzweig AC, Frederick CA, Lippard SJ, Nordlund P: Crystal structure of a bacterial non-haem iron hydroxylase that catalyses the biological oxidation of methane. Nature. 1993, 366: 537-543. 10.1038/366537a0.
Prior SD, Dalton H: The effect of copper ions on membrane content and methane monooxygenase activity in methanol-grown cells of Methylococcus capsulatus (Bath). J Gen Microbiol. 1985, 131: 155-163.
Murrell JC, McDonald IR, Gilbert B: Regulation of expression of methane monooxygenases by copper ions. Trends Microbiol. 2000, 8: 221-225. 10.1016/S0966-842X(00)01739-X.
Hakemian AS, Rosenzweig AC: The biochemistry of methane oxidation. Ann Rev Biochem. 2007, 76: 223-241. 10.1146/annurev.biochem.76.061505.175355.
Téllez CM, Gaus KP, Graham DW, Arnold RG, Guzman RZ: Isolation of copper biochelates from Methylosinus trichosporium OB3b and soluble methane monooxygenase mutants. Appl Environ Microbiol. 1998, 64: 1115-1122.
DiSpirito AA, Zahn JA, Graham DW, Kim HJ, Larive CK, Derrick TS, Cox CD, Taylor AB: Copper-binding compounds from Methylosinus trichosporium OB3b. J Bacteriol. 1998, 180: 3606-3613.
Kim HJ, Graham DW, DiSpirito AA, Alterman MA, Galeva N, Larive CK, Asunskis D, Sherwood PMA: Methanobactin, a copper-aquisition compound from methane oxidizing bacteria. Science. 2004, 305: 1612-1615. 10.1126/science.1098322.
Ruiz A, Ogden KL: Biotreatment of copper and isopropyl alcohol in waste from semiconductor manufacturing. IEEE Trans Semicond Manuf. 2004, 17: 538-543. 10.1109/TSM.2004.835708.
Summer KH, Lichtmannegger J, Bandow N, Choi DW, DiSpirito AA, Michalke B: The biogenic methanobactin is an effective chelator for copper in a rat model for Wilson disease. J Trace El Med Biol. 2011, 25: 36-41. 10.1016/j.jtemb.2010.12.002.
Balasubramanian R, Rosenzweig AC: Copper methanobactin: a molecule whose time has come. Curr Op Chem Biol. 2008, 12: 245-249. 10.1016/j.cbpa.2008.01.043.
Kenney GE, Rosenzweig AC: Chemistry and biology of the copper chelator methanobactin. ACS Chem Biol. 2012, 7: 260-268. 10.1021/cb2003913.
El Ghazouani A, Basle A, Firbank SJ, Knapp CW, Gray J, Graham DW, Dennison C: Copper-binding properties and structures of methanobactins from Methylosinus trichosporium OB3b. Inorg Chem. 2011, 50: 1378-1391. 10.1021/ic101965j.
Knapp CW, Fowle DA, Kulczycki E, Roberts JA, Graham DW: Methane monooxygenase gene expression mediated by methanobactin in the presence of mineral copper sources. Proc Natl Acad Sci USA. 2007, 104: 12040-12045. 10.1073/pnas.0702879104.
Kulczycki E, Roberts JA: Methanobactin-promoted dissolution of Cu-substituted borosilicate glass. Geobiology. 2007, 5: 251-263. 10.1111/j.1472-4669.2007.00102.x.
Balasubramanian R, Kenney GE, Rosenzweig AC: Dual pathways for copper uptake by methanotrophic bacteria. J Biol Chem. 2011, 286: 37313-37319. 10.1074/jbc.M111.284984.
Behling LA, Hartsel SC, Lewis DE, Dispirito AA, Choi DW, Masterson LR, Veglia G, Gallagher WH: NMR, mass spectrometry and chemical evidence reveal a different chemical structure for methanobactin that contains oxazolone rings. J Am Chem Soc. 2008, 130: 12604-12605. 10.1021/ja804747d.
Krentz BD, Mulheron HJ, Semrau JD, DiSpirito AA, Bandow NL, Haft DH, Vuilleumier S, Murrell JC, McEllistrem MT, Hartsel SC, Gallagher WH: A comparison of methanobactins from Methylosinus trichosporium OB3b and Methylocystis strain SB2 predicts methanobactins are synthesized from diverse peptide precursors modified to create a common core for binding and reducing copper ions. Biochemistry. 2010, 49: 10117-10130. 10.1021/bi1014375.
El Ghazouani A, Basle A, Gray J, Graham DW, Firbank SJ, Dennison C: Variations in methanobactin structure influences copper utilization by methane-oxidizing bacteria. Proc Natl Acad Sci USA. 2012, 109: 8400-8404. 10.1073/pnas.1112921109.
Bandow N, Gilles VS, Freesmeier B, Semrau JD, Krentz B, Gallagher W, McEllistrem MT, Hartsel SC, Choi DW, Hargrove MS, Heard TM, Chesner LN, Braunreiter KM, Cao BV, Gavitt MM, Hoopes JZ, Johnson JM, Polster EM, Schoenick BD, Umlauf AM, DiSpirito AA: Spectral and copper binding properties of methanobactin from the facultative methanotroph Methylocystis strain SB2. J Inorg Biochem. 2012, 110: 72-82.
Kim HJ, Galeva N, Larive CK, Alterman M, Graham DW: Purification and physical-chemical properties of methanobactin: a chalkophore from Methylosinus trichosporium OB3b. Biochemistry. 2005, 44: 5140-5148. 10.1021/bi047367r.
Crosa JH, Walsh CT: Genetics and assembly line enzymology of siderophore biosynthesis in bacteria. Microbiol Molec Biol Rev. 2002, 66: 223-249. 10.1128/MMBR.66.2.223-249.2002.
Stein LY, Yoon S, Semrau JD, DiSpirito AA, Crombie A, Murrell JC, Vuilleumier S, Kalyuzhnaya MG, Op den Camp HJ, Bringel F, Bruce D, Cheng JF, Copeland A, Goodwin L, Han S, Hauser L, Jetten MS, Lajus A, Land ML, Lapidus A, Lucas S, Médigue C, Pitluck S, Woyke T, Zeytun A, Klotz MG: Genome sequence of the obligate methanotroph Methylosinus trichosporium strain OB3b. J Bacteriol. 2010, 192: 6497-6498. 10.1128/JB.01144-10.
Boekhorst J, Wilson G, Siezen RJ: Searching in microbial genomes for encoded small proteins. Microb Biotechnol. 2011, 4: 308-313. 10.1111/j.1751-7915.2011.00261.x.
Harrison PM, Carriero N, Liu Y, Gerstein M: A 'PolyORFomic' analysis of prokaryote genomes using disabled-homology filtering reveals conserved but undiscovered short ORFs. J Mol Biol. 2003, 333: 885-892. 10.1016/j.jmb.2003.09.016.
Heo HS, Lee S, Kim JM, Choi YJ, Chung HY, Oh SJ: tsORFdb: Theoretical small open reading frames (ORFs) database and massProphet: peptide mass fingerprinting (PMF) tool for unknown small functional ORFs. Biochem Biophys Res Commun. 2010, 397: 120-126. 10.1016/j.bbrc.2010.05.093.
de Jong A, van Hijum S, Bijlsma JJ, Kok J, Kuipers OP: BAGEL: a web-based bacteriocin genome mining tool. Nucleic Acids Res. 2006, 34: W273-W279. 10.1093/nar/gkl237.
de Jong A, van Heel AJ, Kok J, Kuipers OP: BAGEL2: mining for bacteriocins in genomic data. Nucleic Acids Res. 2010, 38: W647-W651. 10.1093/nar/gkq365.
Wang H, Fewer DP, Sivonen K: Genome mining demonstrates the widespread occurrence of gene clusters encoding bacteriocins in cyanobacteria. PLoS ONE. 2011, 6: e22384-10.1371/journal.pone.0022384.
Velásquez JE, van der Donk WA: Genome mining for ribosomally synthesized natural products. Curr Opin Chem Biol. 2011, 15: 11-21. 10.1016/j.cbpa.2010.10.027.
Murphy K, O'Sullivan O, Rea MC, Cotter PD, Ross RP, Hill C: Genome mining for radical SAM protein determinants reveals multiple sactibiotic-like gene clusters. PLoS One. 2011, 6: e20852-10.1371/journal.pone.0020852.
Arnison PG, Bibb MJ, Bierbaum G, Bowers AA, Bugni TS, Bulaj G, Camarero JA, Campopiano DJ, Challis GL, Clardy J, Cotter PD, Craik DJ, Dawson M, Dittmann E, Donadio S, Dorrestein PC, Entian K-D, Fischbach MA, Garavelli JS, Goransson U, Gruber CW, Haft DH, Hemscheidt TK, Hertweck C, Hill C, Horswill AR, Jaspars M, Kelly WL, Klinman JP, Kuipers OP, et al: Ribosomally synthesized and post-translationally modified peptide natural products: overview and recommendations for a universal nomenclature. Nat Prod Rep. 2013, 30: 108-160. 10.1039/c2np20085f.
Haft DH, Selengut JD, Richter RA, Harkins D, Basu MK, Beck E: TIGRFAMs and Genome Properties in 2013. Nucleic Acids Res. 2013, 41: D387-D395. 10.1093/nar/gks1234.
Haft DH, Basu MK, Mitchell DA: Expansion of ribosomally produced natural products: a nitrile hydratase-and Nif11-related precursor family. BMC Biol. 2010, 8:
del Cerro C, García JM, Rojas A, Tortajada M, Ramón D, Galán B, Prieto MA, García JL: Genome sequence of the methanotrophic poly-β-hydroxybutyrate producer Methylocystis parvus OBBP. J Bacteriol. 2012, 194: 5709-5710. 10.1128/JB.01346-12.
Grass G: Iron transport in Escherichia coli: All has not been said and done. Biometals. 2006, 19: 159-172. 10.1007/s10534-005-4341-2.
Oman TJ, van der Donk WA: Follow the leader: the use of leader peptides to guide natural product biosynthesis. Nat Chem Biol. 2010, 6: 9-18. 10.1038/nchembio.286.
Dam B, Dam S, Kube M, Reinhardt R, Liesack W: Complete genome sequence of Methylocystis sp strain SC2, an aerobic methanotroph with high-affinity methane oxidation potential. J Bacteriol. 2012, 194: 6008-6009. 10.1128/JB.01446-12.
Copley RR, Bork P: Homology among (βα)8 barrels: implications for the evolution of metabolic pathways. J Mol Biol. 2000, 303: 627-640. 10.1006/jmbi.2000.4152.
Hanzelmann P, Schindelin H: Crystal structure of the S-adenosylmethionine-dependent enzyme MoaA and its implications for molybdenum cofactor deficiency in humans. Proc Natl Acad Sci USA. 2004, 101: 12870-12875. 10.1073/pnas.0404624101.
Omote H, Hiasa M, Matsumoto T, Otsuka M, Moriyama Y: The MATE proteins as fundamental transporters of metabolic and xenobiotic organic cations. Trends Pharmacol Sci. 2006, 27: 587-593. 10.1016/j.tips.2006.09.001.
Kuroda T, Tsuchiya T: Multidrug efflux transporters in the MATE family. Biochim Biophys Acta. 2009, 1794: 763-768. 10.1016/j.bbapap.2008.11.012.
He XA, Szewczyk P, Karyakin A, Evin M, Hong WX, Zhang QH, Chang G: Structure of a cation-bound multidrug and toxic compound extrusion transporter. Nature. 2010, 467: 991-994. 10.1038/nature09408.
Brickman TJ, Armstrong SK: Bordetella AlcS transporter functions in alcaligin siderophore export and is central to inducer sensing in positive regulation of alcaligin system gene expression. J Bacteriol. 2005, 187: 3650-3661. 10.1128/JB.187.11.3650-3661.2005.
Allard KA, Viswanathan VK, Cianciotto NP: lbtA and lbtB are required for production of the Legionella pneumophila siderophore legiobactin. J Bacteriol. 2006, 188: 1351-1363. 10.1128/JB.188.4.1351-1363.2006.
Hannauer M, Yeterian E, Martin LW, Lamont IL, Schalk IJ: An efflux pump is involved in secretion of newly synthesized siderophore by Pseudomonas aeruginosa. FEBS Lett. 2010, 584: 4751-4755. 10.1016/j.febslet.2010.10.051.
Li XZ, Nikaido H: Efflux-mediated drug resistance in bacteria: an update. Drugs. 2009, 69: 1555-1623. 10.2165/11317030-000000000-00000.
Noinaj N, Guillier M, Barnard TJ, Buchanan SK: TonB-dependent transporters: regulation, structure, and function. Ann Rev Microbiol. 2010, 64: 43-60. 10.1146/annurev.micro.112408.134247.
Wiener MC: TonB-dependent outer membrane transport: going for Baroque?. Curr Op Struct Biol. 2005, 15: 394-400. 10.1016/j.sbi.2005.07.001.
Krewulak KD, Vogel HJ: TonB or not TonB: is that the question?. Biochem Cell Biol. 2011, 89: 87-97. 10.1139/O10-141.
Schauer K, Rodionov DA, de Reuse H: New substrates for TonB-dependent transport: do we only see the 'tip of the iceberg'?. Trends Biochem Sci. 2008, 33: 330-338. 10.1016/j.tibs.2008.04.012.
Koebnik R: TonB-dependent trans-envelope signalling: the exception or the rule?. Trends Microbiol. 2005, 13: 343-347. 10.1016/j.tim.2005.06.005.
Postle K, Larsen RA: TonB-dependent energy transduction between outer and cytoplasmic membranes. Biometals. 2007, 20: 453-465. 10.1007/s10534-006-9071-6.
Ferguson AD, Amezcua CA, Halabi NM, Chelliah Y, Rosen MK, Ranganathan R, Deisenhofer J: Signal transduction pathway of TonB-dependent transporters. Proc Natl Acad Sci USA. 2007, 104: 513-518. 10.1073/pnas.0609887104.
Brooks BE, Buchanan SK: Signaling mechanisms for activation of extracytoplasmic function (ECF) sigma factors. Biochim Biophys Acta. 2008, 1778: 1930-1945. 10.1016/j.bbamem.2007.06.005.
Braun V, Mahren S, Ogierman M: Regulation of the Fecl-type ECF sigma factor by transmembrane signalling. Curr Opin Microbiol. 2003, 6: 173-180. 10.1016/S1369-5274(03)00022-5.
Braun V, Endriss F: Energy-coupled outer membrane transport proteins and regulatory proteins. Biometals. 2007, 20: 219-231. 10.1007/s10534-006-9072-5.
Koster M, Vanklompenburg W, Bitter W, Leong J, Weisbeek P: Role for the outer membrane ferric siderophore receptor PupB in signal transduction across the bacterial cell envelope. EMBO J. 1994, 13: 2805-2813.
Beare PA, For RJ, Martin LW, Lamont IL: Siderophore-mediated cell signalling in Pseudomonas aeruginosa: divergent pathways regulate virulence factor production and siderophore receptor synthesis. Mol Microbiol. 2003, 47: 195-207.
Lefevre J, Delepelaire P, Delepierre M, Izadi-Pruneyre N: Modulation by substrates of the interaction between the HasR outer membrane receptor and its specific TonB-like protein, HasB. J Mol Biol. 2008, 378: 840-851. 10.1016/j.jmb.2008.03.044.
Michel L, Bachelard A, Reimmann C: Ferripyochelin uptake genes are involved in pyochelin-mediated signalling in Pseudomonas aeruginosa. Microbiology. 2007, 153: 1508-1518. 10.1099/mic.0.2006/002915-0.
Heinrichs DE, Poole K: PchR, a regulator of ferripyochelin receptor gene (fptA) expression in Pseudomonas aeruginosa, functions both as an activator and as a repressor. J Bacteriol. 1996, 178: 2586-2592.
Michel L, Gonzalez N, Jagdeep S, Nguyen-Ngoc T, Reimmann C: PchR-box recognition by the AraC-type regulator PchR of Pseudomonas aeruginosa requires the siderophore pyochelin as an effector. Mol Microbiol. 2005, 58: 495-509. 10.1111/j.1365-2958.2005.04837.x.
Youard ZA, Wenner N, Reimmann C: Iron acquisition with the natural siderophore enantiomers pyochelin and enantio-pyochelin in Pseudomonas species. Biometals. 2011, 24: 513-522. 10.1007/s10534-010-9399-9.
Wilmot CM, Yukl ET: MauG: a di-heme enzyme required for methylamine dehydrogenase maturation. Dalton Trans. 2013, 42: 3127-3135. 10.1039/c2dt32059b.
Helland R, Fjellbirkeland A, Karlsen OA, Ve T, Lillehaug JR, Jensen HB: An oxidized tryptophan facilitates copper binding in Methylococcus capsulatus-secreted protein MopE. J Biol Chem. 2008, 283: 13897-13904. 10.1074/jbc.M800340200.
Karlsen OA, Berven FS, Stafford GP, Larsen Ø, Murrell JC, Jensen HB, Fjellbirkeland A: The surface-associated and secreted MopE protein of Methylococcus capsulatus (Bath) responds to changes in the concentration of copper in the growth medium. Appl Environ Microbiol. 2003, 69: 2386-2388. 10.1128/AEM.69.4.2386-2388.2003.
Berson O, Lidstrom ME: Cloning and characterization of corA, a gene encoding a copper-repressible polypeptide in the type I methanotroph, Methylomicrobium albus BG8. FEMS Microbiol Lett. 1997, 148: 169-174. 10.1111/j.1574-6968.1997.tb10284.x.
Karlsen OA, Larsen O, Jensen HB: Identification of a bacterial di-haem cytochrome c peroxidase from Methylomicrobium album BG8. Microbiology. 2010, 156: 2682-2690. 10.1099/mic.0.037119-0.
Karlsen OA, Lillehaug JR, Jensen HB: The presence of multiple c-type cytochromes at the surface of the methanotrophic bacterium Methylococcus capsulatus (Bath) is regulated by copper. Mol Microbiol. 2008, 70: 15-26. 10.1111/j.1365-2958.2008.06380.x.
Karlsen OA, Kindingstad L, Angelskår SM, Bruseth LJ, Straume D, Puntervoll P, Fjellbirkeland A, Lillehaug JR, Jensen HB: Identification of a copper-repressible C-type heme protein of Methylococcus capsulatus (Bath): a member of a novel group of the bacterial di-heme cytochrome c peroxidase family of proteins. FEBS J. 2005, 272: 6324-6335. 10.1111/j.1742-4658.2005.05020.x.
Karlsen OA, Larsen O, Jensen HB: The copper responding surfaceome of Methylococccus capsulatus Bath. FEMS Microbiol Lett. 2011, 323: 97-104. 10.1111/j.1574-6968.2011.02365.x.
Stein LY, Bringel F, DiSpirito AA, Han S, Jetten MS, Kalyuzhnaya MG, Kits KD, Klotz MG, Op den Camp HJ, Semrau JD, Vuilleumier S, Bruce DC, Cheng JF, Davenport KW, Goodwin L, Han S, Hauser L, Lajus A, Land ML, Lapidus A, Lucas S, Médigue C, Pitluck S, Woyke T: Genome sequence of the methanotrophic alphaproteobacterium Methylocystis sp strain Rockwell (ATCC 49242). J Bacteriol. 2011, 193: 2668-2669. 10.1128/JB.00278-11.
Carniel E: The Yersinia high-pathogenicity island: an iron-uptake island. Microbes Infect. 2001, 3: 561-569. 10.1016/S1286-4579(01)01412-5.
Gilbert B, McDonald IR, Finch R, Stafford GP, Nielsen AK, Murrell JC: Molecular analysis of pmo (particulate methane monooxygenase) operons from two type II methanotrophs. Appl Environ Microbiol. 2000, 66: 966-975. 10.1128/AEM.66.3.966-975.2000.
McDonald IR, Uchiyama H, Kambe S, Yagi O, Murrell JC: The soluble methane monooxygenase gene cluster of the trichloroethylene-degrading methanotroph Methylocystis sp. strain M. Appl Environ Microbiol. 1997, 63: 1898-1904.
Lindner AS, Pacheco A, Aldrich HC, Staniec AC, Uz I, Hodson DJ: Methylocystis hirsuta sp nov., a novel methanotroph isolated from a groundwater aquifer. Int J Syst Evol Microbiol. 2007, 57: 1891-1900. 10.1099/ijs.0.64541-0.
Wartiainen I, Hestnes AG, McDonald IR, Svenning MM: Methylocystis rosea sp nov., a novel methanotrophic bacterium from Arctic wetland soil, Svalbard, Norway (78°N). Int J Syst Evol Microbiol. 2006, 56: 541-547. 10.1099/ijs.0.63912-0.
Banala S, Sussmuth RD: Thioamides in nature: in search of secondary metabolites in anaerobic microorganisms. Chembiochem. 2010, 11: 1335-1337. 10.1002/cbic.201000266.
Chinworrungsee M, Kittakoop P, Saenboonrueng J, Kongsaeree P, Thebtaranonth Y: Bioactive compounds from the seed fungus Menisporopsis theobromae BCC 3975. J Nat Prod. 2006, 69: 1404-1410. 10.1021/np0601197.
Savard ME, Melzer MS, Boland GJ, Bensimon C, Blackwell BA: A new 1-hydroxy-2,6-pyrazinedione associated with hypovirulent isolates of Sclerotinia minor. J Nat Prod. 2003, 66: 306-309. 10.1021/np020445w.
Ward N, Larsen O, Sakwa J, Bruseth L, Khouri H, Durkin AS, Dimitrov G, Jiang L, Scanlan D, Kang KH, Lewis M, Nelson KE, Metheacute B, Wu M, Heidelberg JF, Paulsen IT, Fouts D, Ravel J, Tettelin H, Ren Q, Read T, DeBoy RT, Seshadri R, Salzberg SL, Jensen HB, Birkeland NK, Nelson WC, Dodson RJ, Grindhaug SH, Holt I, et al: Genomic insights into methanotrophy: the complete genome sequence of Methylococcus capsulatus (Bath). PLoS Biol. 2004, 2: e303-10.1371/journal.pbio.0020303.
Chen Y, Crombie A, Rahman MT, Dedysh SN, Liesack W, Stott MB, Alam M, Theisen AR, Murrell JC, Dunfield PF: Complete genome sequence of the aerobic facultative methanotroph Methylocella silvestris BL2. J Bacteriol. 2010, 192: 3840-3841. 10.1128/JB.00506-10.
Choi DW, Bandow NL, McEllistrem MT, Semrau JD, Antholine WE, Hartsel SC, Gallagher W, Zea CJ, Pohl NL, Zahn JA, DiSpirito AA: Spectral and thermodynamic properties of methanobactin from gamma-proteobacterial methane oxidizing bacteria: A case for copper competition on a molecular level. J Inorg Biochem. 2010, 104: 1240-1247. 10.1016/j.jinorgbio.2010.08.002.
Greenwald J, Nader M, Celia H, Gruffaz C, Geoffroy V, Meyer JM, Schalk IJ, Pattus F: FpvA bound to non-cognate pyoverdines: molecular basis of siderophore recognition by an iron transporter. Mol Microbiol. 2009, 72: 1246-1259. 10.1111/j.1365-2958.2009.06721.x.
Kaneko T, Minamisawa K, Isawa T, Nakatsukasa H, Mitsui H, Kawaharada Y, Nakamura Y, Watanabe A, Kawashima K, Ono A, Shimizu Y, Takahashi C, Minami C, Fujishiro T, Kohara M, Katoh M, Nakazaki N, Nakayama S, Yamada M, Tabata S, Sato S: Complete genomic structure of the cultivated rice endophyte Azospirillum sp B510. DNA Res. 2010, 17: 37-50. 10.1093/dnares/dsp026.
Du XJ, Jia SR, Yang Y, Wang S: Genome sequence of Gluconacetobacter sp strain SXCC-1, isolated from Chinese vinegar fermentation starter. J Bacteriol. 2011, 193: 3395-3396. 10.1128/JB.05147-11.
Duchaud E, Rusniok C, Frangeul L, Buchrieser C, Givaudan A, Taourit S, Bocs S, Boursaux-Eude C, Chandler M, Charles JF, Dassa E, Derose R, Derzelle S, Freyssinet G, Gaudriault S, Medigue C, Lanois A, Powell K, Siguier P, Vincent R, Wingate V, Zouine M, Glaser P, Boemare N, Danchin A, Kunst F: The genome sequence of the entomopathogenic bacterium Photorhabdus luminescens. Nat Biotechnol. 2003, 21: 1307-1313. 10.1038/nbt886.
Tribelli PM, Iustman LJR, Catone MV, Di Martino C, Revale S, Mendéz BS, López NI: Genome sequence of the polyhydroxybutyrate producer Pseudomonas extremaustralis, a highly stress-resistant Antarctic bacterium. J Bacteriol. 2012, 194: 2381-2382. 10.1128/JB.00172-12.
Godfrey SA, Marshall JW, Klena JD: Genetic characterization of Pseudomonas 'NZ17-a novel pathogen that results in a brown blotch disease of Agaricus bisporus. J Appl Microbiol. 2001, 91: 412-420. 10.1046/j.1365-2672.2001.01398.x.
Xu Y, Kersten RD, Nam SJ, Lu L, Al-Suwailem AM, Zheng HJ, Fenical W, Dorrestein PC, Moore BS, Qian PY: Bacterial biosynthesis and maturation of the didemnin anti-cancer agents. J Am Chem Soc. 2012, 134: 8625-8632. 10.1021/ja301735a.
Benson DA, Karsch-Mizrachi I, Clark K, Lipman DJ, Ostell J, Sayers EW: GenBank. Nucleic Acids Res. 2012, 40: D48-D53. 10.1093/nar/gkr1202.
Markowitz VM, Chen IM, Palaniappan K, Chu K, Szeto E, Grechkin Y, Ratner A, Jacob B, Huang JH, Williams P, Huntemann M, Anderson I, Mavromatis K, Ivanova NN, Kyrpides NC: IMG: the Integrated Microbial Genomes database and comparative analysis system. Nucleic Acids Res. 2012, 40: D115-D122. 10.1093/nar/gkr1044.
Gertz EM, Yu Y-K, Agarwala R, Schaffer AA, Altschul SF: Composition-based statistics and translated nucleotide searches: improving the TBLASTN module of BLAST. BMC Biol. 2006, 4: 41-10.1186/1741-7007-4-41.
Finn RD, Mistry J, Tate J, Coggill P, Heger A, Pollington JE, Gavin OL, Gunasekaran P, Ceric G, Forslund K, Holm L, Sonnhammer EL, Eddy SR, Bateman A: The Pfam protein families database. Nucleic Acids Res. 2010, 38: D211-D222. 10.1093/nar/gkp985.
Galens K, Orvis J, Daugherty S, Creasy HH, Angiuoli S, White O, Wortman J, Mahurkar A, Giglio MG: The IGS standard operating procedure for automated prokaryotic annotation. Stand Genomic Sci. 2011, 4: 244-251. 10.4056/sigs.1223234.
Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li WZ, Lopez R, McWilliam H, Remmert M, Soding J, Thompson JD, Higgins DG: Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol. 2011, 7: 539-
Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32: 1792-1797. 10.1093/nar/gkh340.
Katoh K, Standley DM: MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S: MEGA5: Molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011, 28: 2731-2739. 10.1093/molbev/msr121.
Jones DT, Taylor WR, Thornton JM: The rapid generation of mutation data matrices from protein sequences. Comput Appl Biosci. 1992, 8: 275-282.
Felsenstein J: Confidence limits on phylogenies-an approach using the bootstrap. Evolution. 1985, 39: 783-791. 10.2307/2408678.
Waterhouse AM, Procter JB, Martin DMA, Clamp M, Barton GJ: Jalview Version 2-a multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009, 25: 1189-1191. 10.1093/bioinformatics/btp033.
Finn RD, Clements J, Eddy SR: HMMER web server: interactive sequence similarity searching. Nucleic Acids Res. 2011, 39: W29-W37. 10.1093/nar/gkr367.
Taylor WR: Residual colours: a proposal for aminochromography. Protein Eng. 1997, 10: 743-746. 10.1093/protein/10.7.743.
We thank Dr. Ralf Koebnik for providing us with the NExT HMM. We also thank the Institute for Genome Sciences Annotation Engine service at the University of Maryland School of Medicine for providing structural and functional annotation of the Cupriavidus basilensis B-8, Methylocystis parvus OBBP and Pseudomonas fluorescens NZI7 genomes. This work was supported by NSF grant MCB0842366. GEK was supported in part by National Institutes of Health Training Grant GM08061.
The authors declare that they have no competing interests.
GEK carried out the bioinformatics analyses. GEK and ACR conceived of the study and wrote the manuscript. Both authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Figures S1-S6. Figures of additional Mbn structures, alignment of potential biosynthesis protein in Group V operons, alignment of MbnM sequences, alignment of MbnT sequences and phylogenetic tree, figure of alternate Mbn regulation scheme, and alignment of MbnP sequences. (PDF 886 KB)
Additional file 2: Table S1. Table containing information regarding the GenBank or JGI contigs/scaffolds and locus IDs (where present) for the genes discussed in the paper. (XLSX 47 KB)
Additional file 3: Annotated C. basilensis B-8 scaffold 1267_1, GenBank format. A GenBank-formatted file consisting of C. basilensis B-8 scaffold 1267_1 (containing an Mbn operon), as annotated by the IGS. (GBF 21 KB)
Additional file 4: Annotated M. parvus OBBP contig 003, GenBank format. A GenBank-formatted file consisting of M. parvus OBBP contig003 (containing an Mbn operon), as annotated by the IGS. (GBF 348 KB)
Additional file 5: Annotated M. parvus OBBP contig 041, GenBank format. A GenBank-formatted file consisting of M. parvus OBBP contig041 (containing an Mbn operon), as annotated by the IGS. (GBF 73 KB)
Additional file 6: Annotated P. fluorescens NZI7 contig00040_contig01, GenBank format. A GenBank-formatted file consisting of P. fluorescens NZI7 contig00040_contig01 (containing an Mbn operon), as annotated by the IGS. (GBF 63 KB)
Additional file 7: HMMER3 hidden Markov model for MbnA. A HMMER3-compatible hidden Markov model constructed using a curated alignment of the MbnA sequences discussed in the paper. (HMM 13 KB)
Additional file 8: HMMER3 hidden Markov model for MbnB. A HMMER3-compatible hidden Markov model constructed using a curated alignment of the complete MbnB sequences discussed in the paper. (HMM 118 KB)
Additional file 9: HMMER3 hidden Markov model for MbnC. A HMMER3-compatible hidden Markov model constructed using a curated alignment of the MbnC sequences discussed in the paper. (HMM 86 KB)
Authors’ original submitted files for images
About this article
Cite this article
Kenney, G.E., Rosenzweig, A.C. Genome mining for methanobactins. BMC Biol 11, 17 (2013). https://doi.org/10.1186/1741-7007-11-17