Microarray and cDNA sequence analysis of transcription during nerve-dependent limb regeneration

Background Microarray analysis and 454 cDNA sequencing were used to investigate a centuries-old problem in regenerative biology: the basis of nerve-dependent limb regeneration in salamanders. Innervated (NR) and denervated (DL) forelimbs of Mexican axolotls were amputated and transcripts were sampled after 0, 5, and 14 days of regeneration. Results Considerable similarity was observed between NR and DL transcriptional programs at 5 and 14 days post amputation (dpa). Genes with extracellular functions that are critical to wound healing were upregulated while muscle-specific genes were downregulated. Thus, many processes that are regulated during early limb regeneration do not depend upon nerve-derived factors. The majority of the transcriptional differences between NR and DL limbs were correlated with blastema formation; cell numbers increased in NR limbs after 5 dpa and this yielded distinct transcriptional signatures of cell proliferation in NR limbs at 14 dpa. These transcriptional signatures were not observed in DL limbs. Instead, gene expression changes within DL limbs suggest more diverse and protracted wound-healing responses. 454 cDNA sequencing complemented the microarray analysis by providing deeper sampling of transcriptional programs and associated biological processes. Assembly of new 454 cDNA sequences with existing expressed sequence tag (EST) contigs from the Ambystoma EST database more than doubled (3935 to 9411) the number of non-redundant human-A. mexicanum orthologous sequences. Conclusion Many new candidate gene sequences were discovered for the first time and these will greatly enable future studies of wound healing, epigenetics, genome stability, and nerve-dependent blastema formation and outgrowth using the axolotl model.


Background
Salamanders are fascinating vertebrate organisms because they routinely regenerate complex tissues. Within only a few weeks of losing a piece of limb to a hungry predator or scalpel-wielding scientist, a salamander perfectly reforms the missing structure. In the early history of salamander regeneration research, scientists innovated elegant experimental designs to probe the anatomical basis of regeneration [1]. More recently and in parallel with the discovery of conserved regulatory genes and developmental pathways among metazoans, scientists have focused attention on candidate molecules and signaling pathways whose functions were deduced first from studies of model organisms. In particular, much research has been devoted to understanding aspects of limb regeneration associated with wound healing that recapitulate limb development; this strategy has yielded many useful insights and molecular probes [2][3][4][5][6][7][8][9][10][11][12][13][14]. Although it is clear that key regulatory molecules play important roles in the development of all organisms, it is not clear that a framework for understanding regeneration can be constructed using a generic and limited molecular toolkit. There is a need to go beyond candidate molecules and use unbiased approaches to characterize the molecular complexity underlying salamander regeneration.
Recent research resource development for the Mexican axolotl now allows classic regeneration experiments to be re-examined with powerful and unbiased genomic approaches. One particularly elegant experiment performed almost 200 years ago showed that salamander limb regeneration requires the presence of peripheral nerves. Todd [15] found in 1823 that limb regeneration does not occur if the sciatic nerve of the hindlimb is severed shortly before or immediately after a more distal limb amputation. Subsequent research showed that the brachial nerves entering the forelimb are required to promote limb outgrowth and patterning of a new limb [16,17]. Within a few days of limb amputation, cells proximal to the amputation plane dedifferentiate and accumulate to form a blastema. Blastema cells proliferate and progressively differentiate during regeneration to give rise to all mesodermal structures of a typical vertebrate limb [18]. Upon amputation, nerve fibers in the vicinity of the amputation plane extend into the blastema and play a supportive role in cell proliferation [11,19]. Transection of the spinal nerves that enter the limb in the axolotl leads to a decrease in cycling cells and there is resorption of distal tissues of the amputated limb. Histological and cell proliferation analyses suggest that early cellular events are similar between denervated and innervated limbs, but denervated limbs are incapable of blastema formation because they do not support significant cell proliferation and outgrowth [20][21][22][23][24][25].
Although limb regeneration is a complex developmental process, nerve dependency and other aspects of regeneration have often been conceptualized as having a simple molecular basis, involving relatively few regulatory factors. For example, Ferretti and Brockes [26] hypothesized that in the absence of nerves, Schwann cells produce an inhibitory factor that prevents blastema cell proliferation. This mechanism is supported by experimental results although the hypothetical factor has not been identified [27,28]. Alternative mechanisms for nerve dependency have also been proposed for several factors with growth promoting effects [29][30][31][32][33]. A recent study identified newt anterior gradient protein (nAG) as a blastema cell growthpromoting factor in vitro, whose over-expression was sufficient to rescue regeneration of denervated and amputated limbs in vivo [34]. Most recently, nerve-dependent expression of the transcription factor sp9 has been identified as an early marker of dedifferentiation of the wound epithelium and the initiation of limb regeneration [11]. While considerable progress has been made in investigating the functions of candidate regulatory factors and signaling pathways, a broader systems-level perspective is needed to understand why multiple aspects of limb regeneration are dependent upon the presence of a nerve.
Genomic tools are now available that allow global characterization of the regeneration process in salamanders. Expressed sequence tag (EST) information has facilitated the development of an Ambystoma salamander Affymetrix microarray platform [35][36][37][38]. This platform and a highthroughput 454 cDNA sequencing approach were used in this study to compare transcript abundance among uninjured limbs, regenerating limbs, and limbs denervated at the time of amputation. The results show that innervated (NR) and denervated (DL) limbs exhibited similar (but not identical) gene expression patterns at five days post amputation (dpa) but then diverged as a blastema formed under the influence of nerves. The results are discussed within the context of previous studies of nerve dependency, highlighting specific genes and biological processes that are associated with blastema formation and outgrowth, and more generally, the Mexican axolotl's unparalleled ability to regenerate limbs.

Morphology and histology of denervated and innervated limbs
Histological staining verified our experimental procedures for creating innervated and denervated limbs on the same individual. Previous studies found few morphological or histological differences between innervated and denervated limbs during the first few days of regeneration [16,39,40]. Consistent with these observations, denervated and innervated limbs at 5 dpa (DL5 and NR5) were histologically indistinguishable (Figure 1a and 1c). Histo-logical staining (hematoxylin and eosin) showed hemorrhaging directly beneath the epithelium containing Leydig cells, squamous cells, and basal keratinocytes (Figure 1b and 1d). All 5 dpa limbs resembled the wound-healing phase or early phase of dedifferentiation according to the staging of [41]. A blastema was visible on innervated limbs 14 dpa (NR14) and histological staining showed an accumulation of blastemal cells under the wound epithelium. These structures were not observed in denervated limbs at 14 dpa (DL14; Figure 1e to 1h). These results indicate that blastema formation and outgrowth only occurred in NR limbs. An apical thickening of the epithelium characterized the distal end of NR14 limbs and this layer consisted of keratinocytes and Leydig cells. These histological traits indicate that 14 dpa limbs in this experiment corresponded to the early to mid bud stage of limb regeneration [41]. Immunological staining using RT-97 was performed to detect the presence or absence of nerve axons at 5 and 14 dpa. At both time-points, neurofilament staining was positive in the NR limbs, but negative in DL limbs. In pilot experiments, we determined that > 20 days is required for nerves to re-innervate limbs after denervation surgery (data not shown). Thus, DL and NR limbs were created successfully and histology showed that 5 and 14 dpa time points correspond to wound-healing and early to mid bud phases of regeneration, respectively.

Transcription during normal limb regeneration: Deviations of NR limbs from baseline
At both 5 and 14 dpa, mRNA levels for hundreds of genes were significantly different from baseline levels of genes expressed in whole limbs at Day 0 ( Table 1). Many of the same genes (n = 215; up = 111; down = 104) were identified as significant in NR5 and NR14 limbs; the deviation from baseline was in the same direction for all but one of these genes (fabp2). Four matrix metalloproteinases (mmp1, mmp3/10a, mmp9, and mmp13) that are known to function during wound healing in many organisms were upregulated at both 5 and 14 dpa ( Figure 2; Additional file 1), while collagens (col4a1, col4a2, col8a1, col9a3, and col11a1; Figure 3) and muscle-specific genes ( Figure 4) were downregulated (Table 1; Additional file 2). Upregulation of collagen catabolism genes coupled with downregulation of collagen structural genes suggests that transcriptional activation and repression are integrated to efficiently remodel the extracellular environment of damaged tissues. Downregulation of muscle genes at both time points suggests that the differentiated muscle gene expression phenotype changes by 5 dpa, and changes more dramatically by 14 dpa.
Although many genes exhibited similar deviations from baseline at 5 and 14 dpa, unique gene expression changes were identified at each time point (Table 1; Additional files 1 and 2). The unique NR5 genes were associated with gene ontology (GO) terms that implicate extracellular protein changes and signal transduction pathways of the early wounding response. These terms included response to stimulus, signal transduction, extracellular region, and ion transport. The unique NR14 genes were associated with GO terms that implicate cell division and DNA metabolism including ccnd2, ccnb1, rrm1, rrm2, nasp, rrc1, and cdc20. The unique gene expression changes that were identified at 5 and 14 dpa support the idea of temporal progression from an early wound-healing phase to a blastema outgrowth phase during normal limb regeneration.

Transcription within denervated limbs: Deviations of DL limbs from baseline
As was observed in NR limbs, hundreds of genes were identified as significant when comparing DL5 and DL14 mRNA levels with baseline levels measured at Day 0 (Table 1; Additional files 1 and 2). Some of the same or similar GO terms that were associated with NR limbs were identified as significantly enriched in DL limbs. This was Histology of innervated and denervated limbs at 5 and 14 dpa not unexpected because both DL and NR limbs undergo tissue histolysis at the limb stump and carry out an early wound-healing response ( Figure 1). For example, extracellular region and MMP genes were upregulated at 5 and 14 dpa, as was seen in NR limbs ( Figure 2; Additional file 1). As in NR limbs, muscle contraction ( Figure 4), cytoplasmic, and collagen genes (Figure 3; col4a1, col4a2, col8a1) were downregulated in both DL5 and DL14 limbs (Additional file 2). Moreover, 83% of the genes that were iden-tified as significant in NR5 limbs were also significant (and in the same direction) in DL5 limbs. These results indicate that many transcriptional events are nerve-independent during early regeneration.
Denervated limbs are known to cease growth following limb amputation. Several groups of genes may explain this observation including the downregulation of genes associated with the M phase of cell division, mitochon-  Significant genes were identified by microarray analysis. Gene ontology terms are represented that were sampled more often than expected in NR5, NR14, DL5, and DL14 limbs. P-value is calculated by DAVID [45].
drial transcripts, and genes associated with glucose metabolism (Table 1; Additional file 2). Furthermore, DL14 limbs were morphologically similar to DL5 limbs because denervation prevented blastema formation ( Figure 1). Consistent with this observation, many of the genes that were identified as upregulated in DL5 limbs were also identified as significant in DL14 limbs (69%). These genes are associated with the following GO terms: lysosome, response to stress, hydrolyse activity, signal transducer activity, and ion transport. Overall, these terms suggest protraction and expansion of wound-healing responses in DL14 limbs, as well as changes in cellular metabolism and cell division.

Transcript abundance differences between NR and DL limbs
In the preceding two sections, transcriptional patterns of NR and DL limbs were described relative to baseline levels at Day 0. Here, we report genes found to be significant when comparing NR and DL limbs. In general, few significant changes in gene expression were observed when NR5 and DL5 transcripts were compared directly. Transcripts for 16 genes were more abundant in NR5 limbs and 17 were more abundant in DL5 limbs, and these differences were small in terms of fold-level change (< 3-fold difference between NR5 and DL5; Additional file 3). Eighteen of these genes exhibited significant sequence similarity to human or salamander presumptive gene sequences; the others are unknown. The genes with significantly more transcripts in NR5 limbs are associated with intracellular (for example dnase1l3, uap1, acy3, nans, myl4) and extracellular functions, including membrane proteins (for example psca, umod, and emp1) and collagen binding (serpinh1). The genes with significantly more transcripts in DL5 limbs are associated with extracellular functions or the immune response (for example igll1, CD74, hmox1, neil1, marco, sftpd, mmp9, mrc1). All but one of the signifi-cant 5 dpa genes showed the same directional deviation at 14 dpa; myl1was 1.5-fold higher in the NR limb at 5 dpa, but 2.2-fold lower at 14 dpa. Thus, as was observed when comparing DL5 and NR5 mRNA abundances with baseline levels, relatively few gene expression differences were identified between NR and DL limbs at 5 dpa, and the magnitude of these differences was small. These results further support the idea that transcription during limb regeneration is predominantly nerve-independent at 5 dpa.

Schematic of matrix metalloproteinase gene expression
Whereas only 33 transcriptional differences were observed between morphologically similar DL and NR limbs at 5 dpa, 282 differences were detected at 14 dpa Schematic of down-regulated muscle contraction genes Figure 4 Schematic of down-regulated muscle contraction genes. Striated muscle contraction genes that were downregulated during limb regeneration. Each gene is represented by two boxes that denote proportional expression among Day 0, NR5, DL5, NR14, and DL14 samples. The left box reports hybridization intensity from the microarray experiment and the right box reports normalized count data from the 454 cDNA sequencing experiment. Figure 4 was modified from a MAPP originally created by Joanna Fong and Nathan Salomonis.
(Additional file 4). K-means cluster analysis of these genes with significant human protein hits highlighted three clusters wherein genes exhibited similar patterns of expression ( Figure 5). Genes in Cluster 1 presented expression patterns with transcript abundances above baseline in NR14 limbs and abundances below baseline levels in DL14 limbs. Genes associated with cell cycling, a well-established characteristic of blastemal cells, were highly enriched in Cluster 1 (n = 26). Over 50% of the genes in Cluster 1 are predicted to localize to the nucleus, including several transcriptional regulators (msx2, id3, tmpo, atf5, rbm15, spen, parp1, and tardbp). Thus, many of the genes in Cluster 1 have functions that are consistent with blastema formation and outgrowth in NR limbs.
Genes from Clusters 2 and 3 were generally expressed in the same direction between NR14 and DL14 limbs; however, the magnitude of expression differed. Genes in Cluster 2 presented transcript abundances that generally exceeded baseline levels, with higher levels observed in DL14 limbs. This pattern suggests that most of these genes were activated in the same direction in the presence or absence of nerves, but denervation caused higher transcript abundances. Twenty-two percent of the genes in Cluster 2 are associated with the GO term cellular response to stimulus (n = 14) and a significant proportion localized to the lysosome (n = 6), including lgmn, ctsk, ctss, asah, atp6v0d1, and neu1. Genes in Cluster 3 presented transcript abundances that were generally lower than baseline levels, with much lower levels observed in NR14 limbs. With relatively few genes in this cluster, no biological process was identified as significantly enriched. However, inspection of the genes in Cluster 3 again supports the idea that some genes may function in muscle contraction (actc1, myh7, and fhl1) and tissue repair (hsp27 and hebp2). In summary, mRNA levels for genes from Clusters 2 and 3 were quantitatively affected by the presence or absence of a nerve.

cDNA sequence analysis of nerve dependency
To further explore nerve dependency and generate an unbiased collection of molecular probes for regenerating limbs, we sequenced cDNAs derived from the same RNA samples that were used in the microarray analysis. Over 1.7 × 10 6 reads were generated and this yielded approximately 90,000 to 230,000 high-quality sequence reads for each limb treatment, with an average of 215 base pairs in length ( Table 2). More than half of the sequence reads correspond to mitochondrial transcripts and ribosomal RNA. This frequency of mtDNA transcripts (30%) approximates the number sampled in an earlier EST screen of the Ambystoma genome [42]. The number of rRNA transcripts was higher than expected. Assembly of all high-quality cDNA reads yielded 429,086 unique sequences. These sequences were assembled with previous EST contigs to Clustering of genes identified as significant from the compari-son of NR14 and DL14 limbs Figure 5 Clustering of genes identified as significant from the comparison of NR14 and DL14 limbs. Fold change values are relative to baseline levels at Day 0. Blue-coded genes are cell cycle associated; orange-coded genes localize to the lysosome; green-coded genes are associated with inflammatory responses; red-coded genes are matrix metalloproteinases; purple-coded genes are associated with muscle. Genes coded * are associated with inflammation and localize to the lysosome.
produce 61,127 contigs each containing at least two overlapping sequences. The distribution of contig lengths is shown in Figure 6. All contigs and singletons were searched against NCBI databases to identify significant similarity matches that would suggest presumptive gene identities. Ambystoma contigs and singletons yielded 25,446 significant hits to sequences in the human RefSeq database (BLASTx, e < 1 × 10 -7 ), including 9411 unique human genes. Figure 7 shows the distribution of percent coverage to predicted human RefSeq proteins. Interestingly, 7130 Ambystoma queries that did not show significant amino acid sequence identity to a human reference sequence did show significant nucleotide identity to a Xenopus sequence. Assembly of new 454 cDNA sequences with existing EST contigs from the Ambystoma EST database more than doubled (3935 to 9411) the number of non-redundant human-A. mexicanum orthologous sequences. This increase in sequence content was even among 10 GO functional categories that are relevant to salamander wound healing and regeneration ( Table 3).
Assuming that many of the anonymous 454 contigs and singletons (> 300,000) that were generated correspond to functional genes, significantly more than 10,000 different transcripts are expressed during the first two weeks of axolotl limb regeneration.

mRNA abundance estimates and gene discovery from 454 cDNA sequence data
The number of times that a non-redundant transcript was sampled by 454 cDNA sequencing was used to estimate mRNA abundances. The transcript counts for 1150 Ambystoma EST contigs (genes) differed significantly among limb cDNA pools that were created for each of the limb types (Additional file 5). It was possible to assign a putative ortholog to 563 of these genes; the remaining genes were considered anonymous. This final list of genes was compared with the significant gene lists from the microarray analysis. It was determined that for 271 of the 1150 significant genes from the 454 cDNA sequencing analysis, a portion of the gene sequence was represented by a probe set on the Ambystoma GeneChip. Of these, 104 genes were identified as significant by both methodologies and mRNA abundances for these genes were highly positively correlated (Additional file 6; median Spearman's correlation = 0.87). The 167 genes found to be significant by 454 cDNA sequencing, but not by microarray analysis, were mostly characterized by low fold changes from baseline (median fold change as estimated from 454 cDNA sequencing counts = 1.75); or conversely, registered low hybridization intensities in the microarray analysis (median rank among 4844 probe sets = 343).
Gene functions that were identified as significantly enriched by microarray analysis were also identified as significant by 454 cDNA sequencing. For example, the muscle contraction GO term was identified as highly enriched and the underlying genes were similarly downregulated relative to baseline (Table 4; Figure 4; Additional file 6). Also, genes sampled most often from NR14 limbs were associated with DNA metabolism, a biological  The table shows the number of Ambystoma ESTdb contigs with gene ontology annotations before and after 454 DNA sequencing. The total is less than the sum of each category because several genes may belong to multiple categories. BP = biological process. MF = molecular function.
process associated with cell cycling (Table 4), and transcripts for genes associated with cell proliferation and cell cycle progression (for example pcna, smc1, ctps, umod, psca, smc1l1, rad21) were either sampled more often among NR limbs or were only sampled from NR limbs (Additional file 5). Thus, 454 cDNA sequencing also identified genes in NR limbs that are consistent with blastema formation. Several functional terms that were not identified by microarray analysis were identified as enriched by 454 cDNA sequencing. Transcripts for 40 genes associated with macromolecule metabolism were most abundant in NR5 limbs compared with other limbs. Transcripts for 12 genes associated with macromolecule catabolism were most abundant in DL5 limbs, including mmp1, mmp3/ 10a, mmp3/10b, mmp9, and mmp13. This suggests that the presence or absence of nerves differentially affects transcriptional responses and regulation at 5 dpa. Also, 454 cDNA sequencing identified additional psca-like genes that were not represented on the GeneChip, and these were also differentially expressed between NR5 and DL5 limbs. These and other examples suggest that 454 cDNA sequencing complemented the microarray analysis by providing deeper sampling of transcriptional programs and associated biological processes. This revealed more candidate nerve-dependent gene expression changes at the earlier 5 dpa time point than was revealed by microarray analysis.

5).
Overall, the 454 sequencing approach verified the primary results from the microarray analysis and identified many new candidate genes and functional pathways that are associated with limb regeneration.

Discussion
Microarray analysis and 454 cDNA sequencing were used to identify nerve-dependent and independent gene expression changes during limb regeneration in the Mexican axolotl. The results show that limb regeneration is associated with thousands of transcriptional changes.
Considerable similarity was observed between the DL and NR transcriptional programs at 5 and 14 dpa. For example, genes that are critical to wound healing were upregulated in both limb types ( Table 6) while genes that are associated with muscle structure and function were downregulated ( Figure 4). Many of the transcriptional changes that were observed at 5 dpa were also observed at 14 dpa. Thus, many aspects of early limb regeneration are accomplished in the absence of nerves. However, gene expression differences were identified between DL and NR limbs at 5 and 14 dpa. Many of the transcriptional differences correlated with blastema formation; cell numbers increased in NR limbs after 5 dpa and this yielded a distinct transcriptional signature of cell proliferation in NR14 limbs. Overall, this study identified genes that are associated with wound healing, early events of blastema formation, and subsequent blastema cell proliferation and outgrowth. Below, we discuss and expand upon these primary results and highlight genes whose functions appear to be important for understanding the basis of nerve dependency and limb regeneration.

Early wound-healing response during limb regeneration
Previous studies have documented anatomical similarities between innervated and denervated limbs at early stages of regeneration [20,43,44]. This study shows that there are also many transcriptional similarities. This suggests that many aspects of the early wound-healing phase of limb regeneration are not dependent upon post-amputation, nerve-derived factors (Table 6). Instead, humoral immune and local tissue responses appear to be key. Many genes that are associated with wound-healing and tissue-repair activities including stress, inflammation, cell survival, immunity, and extracellular matrix remodeling were upregulated from baseline in 5 dpa limbs ( Figure 2; Table  6). It is probably no coincidence that essentially all of the early stress-associated genes that were previously identified as significantly regulated (using the same Ambystoma GeneChip) during early spinal cord regeneration [37], and during the innate immune response of axolotls to a deadly viral pathogen [38], were also identified as significant in this study. Many of these genes appear to be expressed similarly in all vertebrates in response to stress, including junb, irf1, hmox1, apoE, mmps, ptx3, gal3, gadd45g, and tgfb. Additionally, significantly more 'extracellular' genes were identified at 5 dpa than expected by chance, including genes that code for matrix remodeling proteins and secreted molecules whose functions are associated with growth factor binding, cell signaling, survival, death, adhesion, migration, and proliferation. The early wound-healing response initiates local environmental changes of the injury site that are pivotal to subsequent phases of regeneration.
The 5 dpa time point was chosen in this study to identify critical nerve-dependent signaling events that are stimulated within the first few days of regeneration (see [11]). Comparison of DL5 and NR5 data revealed few overall gene expression differences. The 5 dpa time point captured many transcriptional responses that are induced by injury and/or amputation but relatively few that are associated with known or suspected neurotrophic signaling pathways. More comprehensive sampling and deeper sequencing is needed to detail early nerve-dependent transcriptional responses because several genes that are known to be nerve-responsive during the wound-healing phase were not identified in this study (for example sp9, [11]; prrx1, tbx5, [10]).

Downregulation of genes associated with differentiated muscle
This study documented dramatic decreases in the relative abundance of mRNAs coding for skeletal muscle contractile proteins, including myosins, actins, actinins, titin, tropomyosins, and troponins. Many of these changes were also detected by 454 cDNA sequence analysis (Figure 4). This strong, muscle-specific transcriptional signature was observed because approximately half of uninjured forelimb nuclei in 7 to 9 cm axolotls, and likely more than half the cross-sectional area, derive from muscle [45]. It is Genes identified as differentially expressed by 454 sequencing analysis that have high sequence similarity to retrovirus genes. unlikely that the downregulation of muscle genes is due to retraction of the muscle towards the shoulder because muscle transcripts are much more downregulated at 14 dpa than 5 dpa in both denervated and innervated limbs.
Considering that limb tissue samples in this study included ~1 mm of undamaged tissue proximal to the amputation plane, the results suggest that injury to skeletal muscle induces tissue-wide loss of muscle contractile transcripts. It is possible that the decrease in muscle transcripts is due to muscle wasting, caused by a lack of mechanical stress. It is interesting to speculate that this response maybe associated with the degeneration or cellularization of multinucleated muscle fibers into mononucleated cells, which occurs in both denervated and innervated amputated limbs [41,[46][47][48][49][50]. It is interesting to note that muscle-specific genes, including actc1, actn2, atp2a2, my1pf, tncc, tnni2, tnni3, are downregulated during early stages of mammalian skeletal muscle regeneration [51], and this is accomplished without blastema formation. Skeletal muscle regeneration in mammals and other vertebrates involves resident stem (satellite) cells [52,53]. It is possible that muscle-specific genes are downregulated as an integral step of a conserved, skeletal muscle regeneration program of vertebrates. Candidate transcriptional repressor genes were identified in this study including id3 [54], tardbp [55], cnot [56], msx1 [6,57] and msx2. It will be important in future studies to determine if the dramatic decrease in muscle transcripts is due to activation of muscle stem (satellite) cells, muscle loss, muscle dysfunction/ wasting, or whether these transcriptional patterns have an active role in regeneration.

Genes associated with epigenetic reprogramming and genomic stability
Blastema formation requires a large number of progenitor cells derived from quiescent stem cells or differentiated cell types. It is generally known that reprogramming of differentiated cells is accompanied by epigenetic changes such as histone and DNA modifications [58,59]. Few candidates have been identified previously as bringing about epigenetic changes necessary for cellular reprogramming during regeneration [60,61]. This study identified several genes whose functions are associated with epigenetic phenomena, including chromatin remodeling, DNA methylation, and transcriptional regulation. These include uhrf1 [62], lmnb2, parp1 [63], thymopoietin [64], and a gene with high sequence identity to SAM-dependent methyltransferases (Cluster_227434_Contig1; SRV_05867_a_at).
After cellular reprogramming and during limb outgrowth, blastemal cells undergo tremendous cell proliferation. During blastema cell proliferation, telomere lengths and overall genome stability must be maintained to prevent cell death. This study identified several candidate genes from NR14 limbs that are known to function in genome stability, telomere homeostasis, and DNA repair. These include parp1 [65], hmgb2 [66], fen1 [67], aurka [68], aurkb [69], and pif1, a DNA helicase that maintains genome stability and binds to telomerase from yeast to humans [70]. It is also important to note that many transcripts were identified from NR and DL limbs that code for retroelement components (Table 5; for example polyproteins, gag proteins, reverse transcriptases, and recombinases). Retrotransposons are normally transcriptionally silenced in differentiated somatic cells by epigenetic mechanisms, but become active upon changes in epigenetic status; these may also regulate nearby gene expression [71,72]. It is unclear if upregulation of retroelement transcripts affects genome stability and/or is necessary for regeneration.

Genes associated with nerve-dependent blastema outgrowth
In this study, blastemas were not observed in NR5 limbs or DL14 limbs, and only formed on NR14 limbs. Nervedependent limb outgrowth occurs as a result of blastemal cell proliferation. Two early cell proliferation biomarkers, umod and psca, were identified as significantly different between NR5 and DL5 limbs. umod probably locates to the wound epithelium as it is upregulated in apical skin cells during thyroid hormone-induced metamorphosis of Fold-level change of upregulated genes that have functions associated with a wounding response (identified by microarray analysis). axolotl epidermis (unpublished data). It is possible that psca is a membrane receptor of blastema cells because it shows structural similarity to prod1, a surface protein that is implicated in proximal-distal positional identity of blastemal cells during newt limb regeneration [73]. Most of the cell proliferation biomarkers were identified at 14 dpa, when a blastema was present in NR limbs but absent in DL limbs. Thus, between 5 and 14 dpa, blastemal cells underwent considerable cell proliferation in the presence of nerves. Because the limb blastema continues to expand after 14 dpa, the blastema-specific genes that were identified in this study are probably transcribed at much higher levels at later time points. A clear signature of cell proliferation, including genes that function in the cell cycle, mitosis, and nucleotide synthesis, was observed in NR14 limbs. In contrast, these genes were slightly downregulated in denervated limbs at this time (Table 1; Figure 5). Thus, transcripts associated with cell proliferation are maintained at steady state (no increased proliferation) in NR and DL limbs for at least five days. This early, nerveindependent portion of the limb regeneration program may allow time for re-innervation of the injury site and production of nerve-derived molecules in sufficient quantity to initiate and sustain blastemal cell proliferation.
Multiple gene products have been hypothesized to be neurotrophic factors provided by nerves to sustain blastema cell proliferation. These include growth-promoting factors such as fibroblast growth factors [11,31], substance P [74], neuregulin [33], and transferrin [32]. Transferrin and neuregulin were sampled by 454 sequencing, but were not identified as differentially expressed by our analysis. fgf8 and fgf10 were screened out of the microarray analysis due to low expression, but post-hoc analysis suggested that both are upregulated in NR14 limbs compared to control and DL14 limbs (Additional file 7). Expression of these molecules is known to be nervedependent in blastemas of Xenopus [75] and axolotls [3,4,76]. Other molecules that have previously been associated with the blastema during limb regeneration were not included in our statistical analyses due to low hybridization intensity, but were later found to be differentially expressed in NR14 limbs, including hoxd10, hoxa13, hoxa11, and msx1 (Additional file 7). Recently, Kumar et al [34] identified a growth-promoting extracellular ligand (nAG) that rescues aspects of nerve dependency of limb regeneration in newts. Probesets for nAG are represented on the Ambystoma GeneChip and transcripts for this gene were sampled by 454 cDNA sequencing. nAG mRNA was transcribed at a high level in all tissues, but did not differ significantly between NR, DL, or control limbs. It is possible that the effects of nAG and other neurotrophic candidates are associated with quantitative variation of mRNA transcript abundances over fine temporal and spatial scales. Such variation would have been missed in this analysis of three time points and sampling of mRNAs from heterogeneous tissues. Furthermore, it is possible that the increase in nAG immunoreactivity observed by Kumar et al [34] is regulated at the level of translation and would not be observed using microarray or sequencing approaches. Overall, we found that several, but not all, genes previously shown to be directly downstream of neurotrophic factors are expressed at higher mRNA levels in the blastema versus control and denervated samples.
Singer [17] emphasized that the neurotrophic factor underlying nerve dependency was growth promoting and quantitative in effect. He showed that the trophic effect of nerves could be titred by surgically manipulating the number of nerves innervating the limb [16]. Other lines of research showed that nerve-derived factors were not necessary for traversing cell cycle checkpoints, as an equivalent number of presumptive blastema cells are initially observed to enter S phase in denervated and innervated limbs [21,[77][78][79]. Instead, the neurotrophic factor appears to be necessary to complete the cell cycle and this may be non-trivial considering the cost of replicating a large salamander genome. These studies suggest that nerves either directly or indirectly provide limiting macromolecules that are needed to accomplish cell division. According to this reasoning and assuming a correlation between transcript and protein abundance for specific mRNA species, transcripts associated with nerve dependency would be expected to exceed baseline levels in NR limbs during regeneration, and decrease in abundance in DL limbs. This study identified many human-axolotl presumptive orthologs and anonymous axolotl transcripts that showed this pattern. For example, psat codes for a protein that regulates the second enzymatic step of the phosphorylated pathway in mammals, which produces L-serine [80]. PSAT expression (protein and mRNA) is high among cell types with high rates of proliferation, including cancer cell lines [81,82]. Strikingly, psat registered the largest mRNA abundance difference between NR14 and DL14 limbs among probesets on the Ambystoma GeneChip (8.2-fold change), and this result was verified by both 454 cDNA sequencing (NR14 = 20.1, DL14 = 3) and real-time PCR (8.37-fold change; data not shown). We also found that the third enzyme in the phosphorylated pathway, psph, was 3.58-fold higher in NR14 versus DL14 by real-time PCR (data not shown). PSAT and PSPH may have neurotrophic potential but more likely function to provide proliferating cells with a limiting and conditionally important substance (L-serine) that is required for synthesis of macromolecules, amino acids, and purines that are needed to accomplish mitosis [80]. The fact that macromolecular synthesis and cell proliferation are both depressed in denervated limbs [83][84][85][86] supports the idea that PSAT activity is associated with nerve-derived factors that contribute to blastema outgrowth.

Comparison of microarray and cDNA sequencing approaches
Microarray analysis and 454 cDNA sequencing offer different advantages for dissecting complex biological processes. The strength of microarray analysis is the precision that this approach provides for estimating transcript abundances for a specific panel of genes. Such gene panels provide standardized markers for identifying shared and unique patterns of transcription among experiments. In turn, identification of such patterns provides systemslevel insight. For example and as was discussed above, cross-experiment comparisons of axolotl transcription helped distinguish general stress responses from local regenerative responses. A relatively conservative fold level threshold (> 1.5-fold) was used in this study to identify significant genes in the microarray analysis and lowly expressed genes (in the lowest quartile) were removed from consideration. These conservative approaches potentially exclude important genes but likely discover 'real' transcriptional differences between samples. A lower threshold could be applied to extract more information from the dataset and future studies would likely benefit by increasing replication of GeneChips to detect significant differences for lowly expressed genes. With respect to 454 cDNA sequencing, two goals were accomplished: gene discovery and estimation of transcript abundance. However, accomplishment of both goals required a trade-off in the allocation of resources toward deep sequencing versus replication. Sequencing resources were used to deeply sequence Day 0, NR, and DL cDNA libraries instead of shallowly sequencing replicate libraries for these time points. This strategy identified thousands of new gene sequences that will greatly enrich future regeneration studies. Future studies will benefit from experimental designs that replicate deep sequencing of cDNA libraries and this will be possible as sequencing costs decrease. Still, it was encouraging to find in this study that 38% of genes that were identified as differentially expressed by 454 cDNA sequencing and found on the microarray GeneChip were identified as significant by both of these technologies, and transcript abundances for these genes were highly positively correlated.

Conclusion
This study addressed the nerve dependency of limb regeneration by characterizing downstream cellular events that are affected when an intact nerve supply is removed from an amputated salamander limb. Microarray analysis showed that the early wound response is largely nerveindependent, but transcriptional profiles diverge between denervated and innervated limbs when the innervated limb starts to regenerate. Pyrosequencing supported these microarray results while substantially increasing sequence information from the salamander transcriptome. This study shows the utility of next-generation sequencing platforms for gaining transcriptome information [87][88][89]. This new DNA sequence information will greatly enrich future regeneration studies using the axolotl.

RNA extraction and microarray analysis
DL and NR limbs were collected approximately 1 mm proximal to the amputation plane 5 dpa and 14 dpa. Nine animals were used for each time point and limbs were pooled into three groups of three. Left and right limbs were paired within animals when possible in making the pools for the 5 and 14 dpa time points. The day 0 pools were created using only the right limbs of nine different individuals that were collected within minutes following limb amputation. RNA was extracted using Trizol Reagent (Invitrogen, Carlsbad, CA, USA) followed by RNeasy minicolumns (Qiagen, Valencia, CA, USA). RNA quality was assessed by spectrophotometry using a Nanodrop ND-1000 (Nanodrop, Wilmington, DE, USA) and run on a Bioanalyzer 2100 (Agilent, Santa Clara, CA, USA). The Ambystoma microarray platform was produced by the Voss lab and Affymetrix and has been described elsewhere [35,37]. Total RNA was used to produce cRNA probes for GeneChip hybridizations (Affymetrix, Santa Clara, CA, USA) at the University of Kentucky Microarray Core Facility according to standard Affymetrix protocols. Probe level quality control analyses were performed as described in [37]. Data processing and statistical analysis was performed using the Affy Bioconductor package for the R statistical environment [90]. Background correction, normalization, and probe set summarization were performed via the robust multi-array average (RMA) algorithm of [91]. Correlation matrices (Pearson's r) for replicate GeneChips at the probe-set level were produced to assess correlation between GeneChips (minimum r = 0.9787, maximum r = 0.9952). Probe-sets were removed if mean signal intensity was less than the mean of the lowest quartile across all 15 GeneChips (mean ± standard deviation = 7.748 ± 0.031). Some microarray technologies may provide unreliable hybridization estimates for lowly expressed genes [92]. For this reason, a stringent cut-off was applied that removed the bottom quartile of genes for significance testing. Probe-set filtering yielded 3656 probe-sets for significance testing.

Microarray analysis
The limma package [93,94] available from Bioconductor was used to conduct three analyses to statistically identify differentially expressed genes. In the first analysis, linear models were fit to each gene. These models used coefficients to denote each of the five treatments by sampling time combinations. The following coefficients were contrasted: Day 0 versus NR 5 dpa (NR5), Day 0 versus DL 5 dpa (DL5), Day 0 versus NR 14 dpa (NR14), Day 0 versus DL 14 dpa (DL14), NR5 versus NR14, and DL5 versus DL14. The other two analyses were equivalent to paired ttests and compared NR5 versus DL5, and NR14 versus DL14. Multiple testing was corrected using an FDR cut-off of 0.05 and then a fold-change filter (≥ 1.5-fold change) was implemented to derive final gene lists. All microarray data are available at [95]. The identity of differentially expressed salamander transcripts was inferred from presumptive human orthologs. Orthology was assumed for all salamander transcripts that exhibited significant sequence similarity to protein coding sequences from human RefSeq and nr databases (BLASTx, e < 1 × 10 -7 ). Kmeans gene clustering was performed using the Genesis software package [96]. Presumptive human-salamander orthologs were further annotated using GO terms and tools provided by the Database for Annotation, Visualization, and Integrated Discovery [97]. Significantly overrepresented GO terms (EASE score p < 0.01) were identified for specific treatment/sampling time combinations. Certain GO terms were excluded from the results if similar information was represented by a similar GO term. The null expectation for GO term representation was obtained by assigning GO terms to 3271 EST contigs from Ambystoma ESTdb [95].

cDNA Sequence Analysis
The same total RNA samples that were used in the microarray analysis were used to produce cDNA templates for 454 pyrosequencing. cDNA libraries were generated for Day 0, NR5, NR14, DL5, and DL14 RNA samples using the Super SMART cDNA Synthesis protocol (Clontech, Mountain View, CA). Single-stranded cDNA template was amplified using the Advantage 2 PCR Kit (Clontech) and size selected according to manufacturer's instructions.
cDNAs were sequenced using the Genome Sequencer FLX System (Roche Applied Science, Indianapolis, IN). Seq-Clean was used for vector/poor quality trimming, bacterial contaminant screening, and identification of A. mexicanum mitochondrial DNA and rDNA sequences [98]. Retained sequences were pre-clustered using PaCE and then assembled using CAP3 with a 90% sequence similarity threshold [99,100]. Contigs (including singletons) were searched using BLAST algorithms against the Ambystoma ESTdb, human and nr RefSeq databases, and Xenopus laevis and X. tropicalis Unigene sets. Annotated queries that returned a significant BLAST hit were assigned the gene identifier of the best-matching subject sequence. All of these new 454 sequence reads have been submitted to the Short Read Archive (SRA) at the National Center for Biotechnology Information (NCBI), accession SRA004195.2. 454 Sequences were assembled with previous EST data and are available at Sal-Site [95].
The number of times each 454 DNA sequence read matched a unique EST contig from the Ambystoma ESTdb was recorded, and these count data were used to estimate mRNA abundances for presumptive axolotl genes. The following method was used to identify differentially expressed genes among 10,275 contigs that were sampled ≥ 5 times across all five cDNA libraries. First, 5000 random draws were taken from a multinomial distribution to derive expected count data for each gene (k = 5 cDNA libraries, n = the sum of counts across all libraries for a given gene, and p 1 , p 2 , p 3 , p 4 , and p 5 = expected proportion of counts per library given unequal sampling among cDNA libraries). Then, χ 2 statistics were calculated, on a gene-by-gene basis, for each of these random draws. P-values for each gene were estimated by calculating the proportion of randomized χ 2 statistics that were ≥ to the χ 2 statistic associated with the observed data. Contigs with Pvalues ≤ 0.001 were considered differentially expressed. Upon examination of this dataset, it was noticed that 589 of the significant EST contigs were uniquely derived from different cDNA libraries and these tended to form small (< 250 bp) contigs that did not match previous sequences in the Ambystoma ESTdb. These sequences were considered cloning/sequencing artifacts and removed from the dataset. This yielded a final dataset of 1150 significant genes. GO analyses were performed as described above with the exception that human default GO term frequencies were used to establish null expectations (EASE score p < 0.02).

Authors' contributions
JRM and LGE performed surgeries, tissue collections, and RNA isolations. SP gave bioinformatic support and assembly of 454 sequences. RBP performed statistical analysis of microarray and 454 data. JRM and JAW performed histology, immunohistochemistry, and real-time PCR. SRV performed 454 sequencing template preparations. SRV and JRM drafted the manuscript. CKB, WZ, GMP, IMV, TH, SVB, DMG TTH, SRV, and JRM participated in the design of the study and helped draft the manuscript. All authors read and approved the final manuscript.