- Research article
- Open Access
Epigenetic regulation of human-specific gene expression in the prefrontal cortex
BMC Biology volume 21, Article number: 123 (2023)
Changes in gene expression levels during brain development are thought to have played an important role in the evolution of human cognition. With the advent of high-throughput sequencing technologies, changes in brain developmental expression patterns, as well as human-specific brain gene expression, have been characterized. However, interpreting the origin of evolutionarily advanced cognition in human brains requires a deeper understanding of the regulation of gene expression, including the epigenomic context, along the primate genome. Here, we used chromatin immunoprecipitation sequencing (ChIP-seq) to measure the genome-wide profiles of histone H3 lysine 4 trimethylation (H3K4me3) and histone H3 lysine 27 acetylation (H3K27ac), both of which are associated with transcriptional activation in the prefrontal cortex of humans, chimpanzees, and rhesus macaques.
We found a discrete functional association, in which H3K4me3HP gain was significantly associated with myelination assembly and signaling transmission, while H3K4me3HP loss played a vital role in synaptic activity. Moreover, H3K27acHP gain was enriched in interneuron and oligodendrocyte markers, and H3K27acHP loss was enriched in CA1 pyramidal neuron markers. Using strand-specific RNA sequencing (ssRNA-seq), we first demonstrated that approximately 7 and 2% of human-specific expressed genes were epigenetically marked by H3K4me3HP and H3K27acHP, respectively, providing robust support for causal involvement of histones in gene expression. We also revealed the co-activation role of epigenetic modification and transcription factors in human-specific transcriptome evolution. Mechanistically, histone-modifying enzymes at least partially contribute to an epigenetic disturbance among primates, especially for the H3K27ac epigenomic marker. In line with this, peaks enriched in the macaque lineage were found to be driven by upregulated acetyl enzymes.
Our results comprehensively elucidated a causal species-specific gene-histone-enzyme landscape in the prefrontal cortex and highlighted the regulatory interaction that drove transcriptional activation.
Compared with other primates, humans not only have a relatively large brain compared to their weight , but also showcase a significant leap in functional complexity such as cognition, thinking, and communication [2, 3]. However, the difference in DNA sequences between humans and their nearest relative species, chimpanzee, is only 1.2%, considering that the two diverged approximately 6–8 million years ago [4,5,6]. Given this estimated divergence, a crucial question arises: How does human cognitive ability develop during such a short evolutionary time? Changes in gene expression levels have long been thought to play an important role in human evolution [7, 8]. Indeed, when comparing adult humans with other primates, such as chimpanzees and rhesus macaques, excessive human-specific expression changes are observed in genes in the cerebral cortex [8, 9]. Given that this overexpression change has not been found in other sites, such as the blood and liver, gene expression changes in human brains are thought to serve as the basis underpinning human cognitive ability . Besides, previous studies also revealed that genes expressed in the brain accumulated more changes and displayed a 3–5 times accelerated evolutionary pace of developmental pattern divergence on the human than on the chimpanzee lineage [11, 12].
Human cognitive ability and brain development are parallel during the course of development . Humans, especially infants and children, learn and absorb cultural knowledge in the same group as a fundamental basis for their cognitive ability. However, humans and other primates demonstrate considerable differences in cultural knowledge absorption as early as infancy . Accordingly, it is speculated that the developed cognitive ability of humans during an individual’s development is probably a manifestation of specific anatomical and tissue characteristics during brain development. Moreover, these specific anatomical and organizational features are probably reflected in changes at molecular levels specific to human brain development, such as gene expression and splicing, protein and metabolite concentrations, and even epigenetic modification states.
Previous studies have shown that > 15% of genes in the prefrontal cortex (PFC) of human and rhesus macaques showed different developmental expression patterns . By combining microarray and high-throughput sequencing techniques, most of the genes in the PFC and cerebellar cortex of humans, chimpanzees, and rhesus macaques were found to vary significantly with age . Although this change with age is conserved in primate brains, nearly a thousand genes have been identified to exhibit human-specific expression profiles, most of which are present in the early stage of the prefrontal cortex, with a gene count four-fold higher than that of chimpanzees [12, 16]. Additionally, many of these genes are closely linked to the nervous system. The difference is probably a reflection of the extreme delay in human synaptic development . This extreme developmental delay is thought to provide sufficient time for infants to build a more complex neural network than other primates, thus laying the foundation for human cognitive capacity. These studies show the importance of comparing differences in brain developmental expression patterns between humans and other primates to understand human cognitive evolution. Accordingly, elucidating the molecular regulation mechanism of brain-specific developmental expression patterns not only can provide a higher level of understanding for human cognitive evolution but also has a more practical guiding significance for social activities. This is because the healthy development of the human brain is essential for shaping cognitive ability, while its disturbed development leads to severe cognitive impairment and mental retardation, such as autism and schizophrenia (SZ) . Currently, humans suffering from development-related cognitive disorders account for 5–10% of the population worldwide . Such individuals are likely to require medical care for most of their lives, which represents a considerable burden on both their families and society. Studying the molecular regulation mechanism of human brain-specific gene expression helps to reveal the origin of human cognitive ability and provides a valuable theoretical basis for targeted therapy for diseases associated with brain development.
Thus far, some preliminary findings have revealed the molecular regulation mechanism of human cognitive ability formation. In the search for cis-regulatory elements, scanning regulatory regions such as promoters and enhancers at the genome-wide level has led to the identification of numerous human-specific mutations in the vicinity of genes involved in nervous system development or brain-specific expression [19, 20]. In the search for trans-regulatory elements, although few microRNAs and transcription factors (TFs) have been found, the TFs MEF2A (myocyte-specific enhancer factor 2A) and ERG1–3 (early growth response protein 1–3) represent some examples [12, 16]. Consistent with functional analysis, these TFs regulate the nervous system, including neural survival and synaptic transmission [21,22,23], further suggesting their important regulatory role in the formation of human cognitive ability.
To date, hundreds of human brain-specific gene expression changes can be explained by a limited number of microRNAs, TFs, mutations in regulatory elements, or epigenetic changes [12, 16, 24,25,26,27]. It is worth noting that histone modification can directly regulate the condensed state of chromatin and thus affect gene expression [26, 28,29,30,31]. Histone modification has long been thought to be closely related to the healthy development of brain, aging, and cognitive disorders [32,33,34,35]. Previous studies have found that histone modification status changes significantly in the brains of primates and patients with cognitive disorders. For example, histone H3K4me3 was modified in the forebrain of newborns . Moreover, the altered state of histone H3K4me2 showed differences in the PFC of rhesus macaques at different ages . Another study identified hundreds of brain-specific modification sites and dozens of brain-specific modification-loss sites by comparing differences in histone H3K4me3 levels in adult humans, chimpanzees, and rhesus macaques . However, human brain cognitive development is a systematic process, and these studies are often limited to a single modified state or a single species, and lack comparison with other molecular levels. Thus, the histone regulatory mechanism of specific gene expression in human brains still remains elusive.
In this study, we measured the genome-wide distribution of H3K4me3 and H3K27ac in the PFC of human, chimpanzee, and rhesus macaque brains, and identified species-specific histone modification sites. By integrating the transcriptome data of humans, chimpanzees, and rhesus macaques derived from the same biological samples, a histone modification regulatory network for the specific expression of genes in the human brain was constructed and its key regulatory factors were identified. Our results significantly contribute to the systematic understanding of molecular regulatory mechanisms of human cognitive ability.
H3K4me3 and H3K27ac landscapes of the PFC
In this study, we focused on the dorsolateral PFC, which is involved in cognitive operations that are important for informed choice and creativity among other executive functions and represents a highly associated cortex that is subjected to a disproportionate morphological expansion during primate evolution. Prefrontal H3K4me3 and H3K27ac epigenomes from the gray matter of three adult humans, three adult chimpanzees, and three adult rhesus macaques were measured (Additional file 1: Table S1). The Illumina HiSeq 2000 sequencing platform was used to obtain 20,520,716 to 33,844,133 raw reads for each sample from the H3K4me3 data and 24,548,467 to 29,119,905 raw reads for each sample from the H3K27ac data (Additional file 1: Table S2). MACS (Model-based Analysis of ChIP-Seq) software was used to identify 21,864–37,423 raw peaks for the H3K4me3 epigenome and 8868–31,391 raw peaks for the H3K27ac epigenome (Fig. 1A, B). Consistently, H3K4me3 was mainly enriched around the transcriptional start site (TSS) of active genes and H3K27ac preferentially resided in the promoter and enhancer region. The vast majority (78.3–87.1%) of H3K4me3 peaks were located proximal to the TSSs (within 3 kb) of annotated genes (Fig. 1C), whereas more than 20% of H3K27ac peaks were positioned within 3 kb of known TSSs (Fig. 1D). Peaks with a shared locus were combined to obtain a total of 25,929 H3K4me3 peaks and 14,617 H3K27ac peaks (Additional file 1: Tables S3 and S4). Among them, approximately 60% of the H3K4me3 peaks (15,452 of 25,929) were shared between humans and chimpanzees, with more than 56% of H3K4me3 peaks (14,533 of 25,929) shared across the three species (Fig. 1A). A similar result was observed in the H3K27ac data, in which approximately 18% of H3K27ac peaks (2625 of 14,617) were shared between humans and chimpanzees, with more than 14% of H3K27ac peaks (2049 of 14,617) shared across the three species (Fig. 1B). Moreover, either hierarchical clustering (Fig. 1E, F), by comparing their peak coordinates and peak intensity, or principal component analysis (Fig. 1G, H), based on normalized read counts for all samples, consistently revealed the highest similarities in PFC epigenomes among intra-species and separation trends among inter-species. The Pearson correlation coefficients of the peak locations ranged from 0.846 to 0.948 for H3K4me3 peaks and 0.461–0.759 for H3K27ac peaks within species, while the Pearson correlation coefficients of the peak intensity ranged from 0.944 to 0.989 for H3K4me3 peaks and 0.893–0.977 for H3K27ac peaks within species (Fig. 1E, F).
Human-specific signatures of the H3K4me3 epigenome
To identify loci with human-specific H3K4me3 and H3K27ac signatures in PFC, we screened 25,929 H3K4me3 peaks and 14,617 H3K27ac peaks detected in primates. Using the “edgeR” package, we identified 2396 peaks significantly enriched in human samples as compared with the two non-human primates after correcting for false discovery rate (FDR). We further grouped the human-specific H3K4me3 peaks into upregulated and downregulated categories, with histone modification levels that were at least 1.2-fold increased or decreased in humans compared to those in chimpanzees and macaques. In total, we obtained 1175 upregulated H3K4me3 peaks and 775 downregulated H3K4me3 peaks in humans. In parallel, 487 upregulated H3K4me3 peaks and 219 downregulated H3K4me3 peaks in the chimpanzee were obtained. The H3K4me3 peaks were specifically upregulated or downregulated in humans approximately 2.4 and 3.5 times, respectively, more than those in the chimpanzee. Hereafter, we use the term “H3K4me3HP” to denote “human-specific H3K4me3 peaks” for further analysis (Fig. 2A; Additional file 1: Tables S5 and S6). Strikingly, the genes adjacent to H3K4me3HP gain regions showed highly significant enrichment for genes involved in myelination assembly, neuronal ensheathment, receptor clustering, and high overlap with oligodendrocyte markers (48/365 genes, BH-corrected P = 1.905 × 10−15), all of which were related to myelin membrane formation and signaling transmission (Fig. 2B; Additional file 1: Table S7). The hubs with H3K4me3HP gain regions, that is, the genes proximal to H3K4me3HP gain regions with the highest connectivity, were RUNX2, SOX2, FOXO3, CDK1, CDH1, PTK2, and HIST2H3PS2 (Fig. 2B). RUNX2, SOX2, and FOXO3 are well-recognized TFs required for the self-renewal of neural stem and progenitor cells, thereby promoting the cell cycle in proliferating progenitors [38,39,40,41,42]. CDK1, CDH1, PTK2, and HIST2H3PS2 are involved in cell cycle progression, cellular adhesion, and proliferation [43,44,45]. Moreover, the genes adjacent to the H3K4me3HP loss regions were enriched for CA1 pyramidal neuron markers (26/329 genes, BH-corrected P = 6.927 × 10−4) and markers for S1 pyramidal neurons (17/226 genes, BH-corrected P = 9.855 × 10−3), as well as for genes associated with synaptic transmission and axonogenesis, all of which are related to synaptic activity (Fig. 2C; Additional file 1: Table S8). Consistent with this functional annotation, two of the hubs were MAPK3 and GRIN1, which play a vital role in the plasticity of synapses, thereby contributing to memory and learning (Fig. 2C) [46, 47]. Another four of the hubs were CD44, RHOA, GNAI2, and PRKACA, all with active roles in transmembrane activities [48,49,50,51].
Human-specific signatures of the H3K27ac epigenome
We identified 799 H3K27ac peaks (gain: 483; loss: 316) and 242 H3K27ac peaks (gain: 131; loss: 111) that showed significant changes in human and chimpanzee PFCs, respectively. Slightly different from the observation of H3K4me3 modification, the H3K27ac peaks that were specifically upregulated or downregulated in humans were approximately 3.7 and 2.8 times, respectively, more than those in the chimpanzee. Hereafter, we use the term “H3K27acHP” to denote “human-specific H3K27ac peaks” for further analysis (Fig. 3A; Additional file 1: Tables S9 and S10). Among the genes close to H3K27acHP gain regions, two hubs (PDE4B and ARRB1) were known to mediate cellular response to extracellular stimuli, inferring functional relevance based on a network approach (Fig. 3B) [52, 53]. Additionally, these genes remarkably showed overrepresentation in interneuron markers (12/302 genes, BH-corrected P = 6.317 × 10−4) and markers of oligodendrocytes (11/365 genes, BH-corrected P = 6.422 × 10−3) (Additional file 1: Table S11). For the genes adjacent to H3K27acHP loss regions, all of which showed highly significant enrichment for many tightly related gene ontology categories, including synapse organization and activity, and were enriched for CA1 pyramidal neuron markers (13/329 genes, BH-corrected P = 3.133 × 10−4), and thus played a critical role in neuronal signal transduction (Fig. 3C; Additional file 1: Table S11). FOS, one of the hubs, is currently regarded as a marker of neuronal activity and has been associated with neural and behavioral responses to extracellular stimuli . KAT2B is a lysine histone acetyltransferase highly expressed in the brain, which has highlighted its vital importance for brain function and proper development .
Regulation of the human-specific gene pattern by H3K4me3HP and H3K27acHP
To determine the consequences of epigenetic divergence on gene expression divergence across the three primates, we first tested the fraction of human-specific expressed genes subjected to H3K4me3HP and H3K27acHP in PFC, respectively. We used transcript levels measured by RNA-seq in the PFC of ten humans, eight chimpanzees, and seven rhesus macaques published previously [56,57,58,59]. We re-analyzed the 12,464 genes expressed among all three adult primates, of which, 362 genes were differentially expressed between humans and the other two primates (Fig. 4A). It has been reported that cis-elements accumulate in the regulatory region over time and tend to affect each gene independently. Unsurprisingly, approximately 7% of human-specific expressed genes were correlated with H3K4me3HP, whereas approximately 2% were correlated with H3K27acHP (Fig. 4B). This observation may be attributed to the fact that approximately a 2.5-fold excess of H3K4me3 peaks showed human-specific histone modification compared to H3K27ac peaks, thereby shaping transcriptome divergence by epigenetic modification in H3K4me3.
To test whether these findings were replicable and to further validate the results in an independent dataset, we obtained 1062 H3K4me3HP (gain: 885; loss: 177) with at least 1.5-fold differential tag density in nine adult humans and compared them to three macaques and four chimpanzees from the published epigenetic data, which were detected using an independent Poisson statistic . As a result, approximately 7% of human-specific expressed genes were enriched with H3K4me3HP, which agrees with our measurement. Moreover, for H3K27ac, we re-analyzed an additionally published epigenome from the PFC region of three humans, two chimpanzees, and three macaques . Of the total 60,702 H3K27ac peaks, 933 H3K27acHP (gain: 428; loss: 505) were detected under the same filtering criteria (BH-corrected P < 0.05, fold change > 1.2) with the same statistical procedure—the quasi-likelihood methods implemented in the edgeR package. Consequently, approximately 3% of human-specific expressed genes were enriched with H3K27acHP. Considering peaks up to four-fold greater than ours, when shrunken to a more stringent criterion, i.e., fold change > 2, the number of H3K27acHP decreased to 740 (gain: 284; loss: 456). Consequently, approximately 2% of human-specific expressed genes were found to co-localize with H3K27acHP. Therefore, independent analyses produced robust and reproducible results.
It has been previously reported that strand-specific RNA-seq (ssRNA-seq) provided more reliable resolution of transcriptome profiling and more accurate measurement of gene expression . Here, we collected the same cohort of three humans, three chimpanzees, and three macaques from PFC neurons measured using the ssRNA-seq technique. We obtained 40,680,796 to 59,816,568 raw reads for each sample from the ssRNA-seq data (Additional file 1: Table S12) and screened 28,850 genes sharing comparable expression patterns across three primates (Additional file 1: Table S13). A total of 767 genes (up: 542; down: 225) were determined to be differentially expressed between humans and the other two primates (Fig. 4C). Remarkably, the species-specific genes overlapped significantly with their counterparts reported in the RNA-seq dataset (Fisher’s exact test, P < 0.001; Additional file 2: Fig. S1A). Furthermore, the expression changes specific to each species observed both in the RNA-seq and ssRNA-seq datasets correlated very well (Pearson correlation coefficient, r = 0.77 ± 0.13), despite the only partial overlap of genes between studies (Additional file 2: Fig. S1B). Albeit marginally, approximately 8 and 3% of human-specific expressed genes were enriched with H3K4me3HP and H3K27acHP, respectively (Fig. 4D).
To further estimate whether the species-specific histone modification affects the genes that showed changes in expression during development, we tested the overlap between genes related to a species-specific histone modification and three types of genes identified in a previous study : genes with constant expression across the lifespan (type I), genes showing variable expressions across lifespan but no developmental pattern differences among species (type II), and genes with developmental remodeling within species (type III). All three types were found to be profoundly enriched with H3K4me3HP, while H3K27acHP appeared to show a significant overrepresentation of type II and type III genes (Additional file 2: Fig. S2). Consequently, the species-specific histone modification not only affected the genes with expression changes in primates but also with the developmental remodeling of their expression patterns in primates. These findings suggest that human-specific histone modification contributes significantly to gene expression evolution in primate PFC.
Histone-TF target regulatory network
We hypothesized that the human-specific gene expression in the PFC is possibly driven either by an orchestrated interplay of cis-elements and trans-factors or independently. To this end, we focused on 1950 H3K4me3HP and 799 H3K27acHP from our dataset, as well as 211 TFs from a previously published study . As expected, TF, H3K4me3, and H3K27ac significantly correlated with a combination of three species-specific genes detected in our ssRNA-seq dataset (Fig. 5A). Next, we investigated the relationship between gene expression differences and corresponding TF expression differences, as well as the relationship between gene expression differences and coupled epigenetic differences between humans and chimpanzees by fitting a linear regression model. To avoid bias, only those TFs that were strongly correlated with the expressed genes, i.e., absolute Pearson correlation coefficient > 0.6 and P < 0.05, were used in the model. In total, we identified 69 TFs with at least one target gene that was human- or chimpanzee-specific (Additional file 1: Table S14). Among them, five most relevant TFs showed both positive and negative correlations with their regulatory target genes, including SREBF1 downregulated with Alzheimer’s disease (AD) in oligodendrocytes , POU2F1 as a potential regulator of excitatory neuron development in a mouse model , PAX8 increased survival and immortalization in gliomas cells , POU5F1 (also known as OCT4), its ectopic expression, combined with SOX2 and KLF4, promoted axon regeneration after injury , and MEIS1 involved in the development of the central nervous system and considered to be the strongest genetic risk factor for restless legs syndrome . Consequently, we found a significant positive correlation between the three regulators and human-chimpanzee gene expression differences (permutation test, P < 0.001; Additional file 2: Fig. S3A), which was presumably due to the transcriptional activation function of TFs and histones. TFs explained approximately 4% of the gene expression differences between humans and chimpanzees; the H3K4me3 and H3K27ac peaks located close to human- and chimpanzee-specific genes explained as high as 17.8 and 18.6% of the gene expression differences, respectively, which were profoundly larger than those to be expected by chance (permutation test, P < 0.001; Fig. 5B and Additional file 2: Fig. S3B). This result raised the possibility that histone H3 lysine 4 trimethyl and lysine 27 acetyl modification contributed to a wider extent of gene expression variation between humans and chimpanzees.
To assess the interplay of various regulatory mechanisms specific to the human lineage, we coupled TFs to human-specific expressed genes and found that the expression of six TFs was significantly positively or negatively correlated with the expression of their corresponding target genes (one-sided Wilcoxon rank-sum test, BH-corrected P < 0.05; Pearson correlation coefficient, |r|> 0.6) than that expected by chance (permutation test, P = 0.015; Additional file 2: Fig. S3C). Considering that a single TF can regulate multiple expressed genes simultaneously, one gene probably possesses multiple transcription factor binding sites (TFBS) and can recruit several TFs to control its expression. We then assessed the putative human-specific genes that were strongly correlated with TFs by comparing the absolute Pearson correlation coefficients between the expression profile of the gene and corresponding TFs to those calculated between the same gene and the same number of TFs that were randomly selected from all expressed genes using a one-sided Wilcoxon rank-sum test based on 1000 permutations. Under the criteria of BH-corrected P < 0.05 and absolute Pearson correlation coefficient > 0.6, four human-specific genes were estimated to strongly correlate with TFs (permutation test, P < 0.001; Additional file 2: Fig. S3C). Collectively, of 64 (8%) human-specific genes enriched with H3K4me3HP, eight expressed genes were also enriched with H3K27acHP, and only one gene was co-regulated by H3K4me3HP and TFs (Figs. 4D and 5C, D). Notably, our results indicated that H3K27acHP and TFs behaved in a mutually exclusive manner.
There is extensive evidence suggesting that one of the human-specific genes CDCP1, collaboratively regulated by H3K4me3HP and TF, may act as a novel regulator and promote the proliferation and migration of glioma, which is a primary, malignant, and aggressive brain tumor in adults . Interestingly, as a pro-inflammatory cytokine, the reduced expression of CDCP1 is demonstrated to be protective in autoimmune encephalomyelitis . The expression of transcription factor STAT2 was determined to be significantly correlated with the human-specific genes detected in our ssRNA-seq dataset, while the overexpression of its inhibitor PIAS2 has been reported to cause motor and cognitive impairments, predisposing one to sporadic Parkinson’s disease . Moreover, our results showed that nuclear transcription factor Y subunit alpha (NFYA), a key regulator of cell cycle progression, correlated strongly with human-specific genes. Interestingly, Yamanaka et al. proposed that the suppression of neuronal NFYA using gene deletion or knockdown strategies resulted in progressive neurodegeneration .
Taken together, these results support that the overwhelming majority of human-specific genes are predominantly regulated by human-specific cis-elements alone rather than the crosstalk of cis-elements and trans-factors.
Assessment of enzymatic intervention in histone modification changes
As histone is susceptible to control by a group of enzymes, we next explored whether histone-modifying divergence occurring in PFC across the three species was attributed to enzymatic activities. For this purpose, we collected 28 H3K4me3-modifying enzymes and two H3K27ac-modifying enzymes from HISTome2 together with the EpiFactors database (Additional file 1: Table S15). As a result, 22 and 27 enzymes were detected in the RNA-seq and the ssRNA-seq dataset, respectively (Fig. 6A). Specifically, CREBBP and EP300, which mediate the acetylation of histone H3 at “Lys-27,” were highly expressed in the macaque lineage compared to the other two primates, and further showed a significant overrepresentation in macaque-specific upregulated genes measured in either the RNA-seq dataset (Fisher’s exact test, P = 0.0127) or the ssRNA-seq dataset (Fisher’s exact test, P = 0.0083), which indicated that the enrichment of H3K27ac peaks unique to macaque presumably could be driven by upregulated H3K27ac-modifying enzymes. Notably, we observed that the expression profiles of enzymes showed a profoundly higher correlation with 5459 species-specific H3K4me3 peaks and 4404 H3K27ac peaks than that expected by chance (Figs. 6B, C; permutation test, P < 0.001). We also observed a significant excess of correlated H3K4me3- and H3K27ac-modifying enzymes than expected by chance (Fig. 6D; permutation test, P < 0.001). Cumulatively, these findings support that enzymatic expression, to some extent, facilitates the explanation of histone-modifying divergence among species.
The human brain and cognitive abilities develop in parallel throughout ontogenesis, resulting in a phenotype strikingly distinct from that of other primates. Previous studies have suggested that remodeling of the gene expression trajectory during brain development plays a crucial role in human cognitive evolution [10, 71,72,73]. However, the regulatory mechanisms that are responsible for gene expression changes during human brain development are unclear. It must be noted that histone modifications, which shape gene expression patterns, affect neuronal functions in both healthy and diseased brains. Thus, understanding the epigenetic regulatory mechanisms of gene expression changes during human brain development is essential to understanding the origin of cognitive evolution and development-related mental disorders. In this study, we mapped genome-wide H3K4me3 and H3K27ac in the PFC of human and two non-human primates, including closely related living primate relatives -chimpanzees and monkey-rhesus macaques. In total, 25,929 H3K4me3 peaks and 14,617 H3K27ac peaks were consistently detected in at least one species. By grouping histone modification peaks into species-specific categories, approximately 2.8 and 3.3 times more H3K4me3 and H3K27ac were detected in humans, respectively, compared to the chimpanzee. Furthermore, either H3K4me3 or H3K27ac peaks with specific patterns identified in humans are widely associated with genes related to cognitive function.
An analysis of functional interrogation revealed that H3K4me3HP gain regions were enriched for genes involving myelination assembly, neuronal ensheathment, and receptor clustering. They also showed a strong overlap with genes that were specifically expressed in oligodendrocytes. Oligodendrocytes, constituting 50–75% of the glial cells in the neocortex, provide metabolic and trophic support to axons and have been implicated in the cellular phase of AD [74, 75]. H3K4me3HP loss regions showed significant enrichment for CA1 and S1 pyramidal neurons, as well as gene categories of synaptic transmission and axonogenesis. Of note, the hippocampus, the origin of CA1 pyramidal neurons, is central to learning and memory functions and points to the acceleration of transcriptomic evolution in humans compared to other primates [76, 77]. Furthermore, a previous study demonstrated the simultaneous tight interconnection of CA1 and S1 pyramidal neurons by biclustering genes and cells, suggesting the orchestration of functionality . Interestingly, the H3K27acHP gain regions were revealed to be enriched in interneuron and oligodendrocyte markers. This result is notable because oligodendrocytes and axons have reciprocal communication, in which oligodendrocytes receive instructive signals from axons that direct their myelination, and subsequently shape axonal structure and conduction. Therefore, this finding highlights that oligodendrocytes provide indispensable support to neurons [79,80,81]. Interruption of GABAergic signaling to oligodendrocyte precursor cells has been shown to contribute to reduced myelination and hypoactivity of interneurons, as well as significant changes in cortical network activities and impaired social cognitive behavior . Moreover, a series of intensive early studies have reported the disturbed crosstalk of oligodendrocyte-interneuron in AD and SZ, suggesting a novel therapeutic target [83,84,85]. Parvalbumin interneuron hypomyelination is associated with cognitive inflexibility in these disorders, which is caused by impaired maturation of myelin-producing oligodendrocytes. This notion is accompanied by reduced myelin- and oligodendrocyte-related protein levels in the AD and SZ brains, such as myelin basic protein (MBP), myelin proteolipid protein (PLP), cyclic nucleotide phosphohydrolase (CNP), myelin-associated glycoprotein (MAG), and myelin oligodendrocyte glycoprotein (MOG), indicating a loss of myelin. A pilot study showed a decreased level of contactin-associated protein, which mediates communications between oligodendrocytes and synapses, thus indicating defective oligodendrocyte-neuronal interactions in SZ . The H3K27acHP loss regions were also found to be closely associated with synapse organization and activity, as well as hippocampal CA1 pyramidal neuron markers, which are central to both the acquisition and maintenance of memory. In the PFC of 48 participants with varying degrees of AD pathology, Mathys et al. identified 1115 differentially expressed genes (DEGs) in cells isolated from AD-pathology versus no-pathology individuals across six major brain cell types . Therefore, this encouraged us to investigate what was the transcriptional regulatory mechanism of H3K4me3 and H3K27ac involved in AD. We observed that H3K4me3HP loss regions were exclusively enriched in excitatory neurons (hypergeometric test, BH-corrected P = 3.658 × 10−8), which were overwhelmed by downregulated DEGs, reaching up to 75% (Additional file 2: Fig. S4A). Both H3K4me3HP gain and H3K27acHP loss regions showed strong overrepresentations with oligodendrocyte markers (hypergeometric test, BH-corrected P = 5.64 × 10−7 for H3K4me3HP gain and BH-corrected P = 2.698 × 10−3 for H3K27acHP loss), which comprised 102 upregulated and 71 downregulated DEGs (Additional file 2: Figs. S4A and S4B). These results indicated that histone modification changes in human PFC might be involved in cognitive evolution in the human lineage and cognitive impairment in neurodegenerative disorders.
Based on both routine and ssRNA-seq techniques, we estimated approximately 7 and 2% of human-specific expressed genes compared to non-human primates were co-occurrent with the signals of H3K4me3HP and H3K27acHP, respectively. As gene expression changes, especially remodeling of the gene expression trajectory during brain development, have been suggested to play an important role in human cognitive evolution, we speculated that histone modification changes in human PFC could contribute to gene expression alterations and evolution of advanced cognition in humans.
It must be noted that our work has several limitations. First, given that the differences between the human and non-human H3K4me3 and H3K27ac landscape are the major axis of epigenomic variation, it would be indispensable for future studies to pursue more samples; such a high-resolution approach is expected to reveal larger numbers of epigenomic loci unique to each primate, and avoid statistical underpower owing to a small sample size. Second, we used adult brains for cross-species comparisons, but human-specific signatures in the neuronal cortex are reported to be even more pronounced during pre- and perinatal development. Accordingly, it is reasonable to assume that younger brains may display changes at additional loci, or more pronounced alterations in the regulatory region of some genes identified in this study. To overcome these limitations, we firstly re-analyzed the histone modification data from published studies [24, 60] and confirmed the proportions of human-specific expressed genes, which might be caused by the modification changes in H3K4me3 and H3K27ac, were generally robust. Although only adult primates were used in this study, using the genes identified to change their expression trajectory during development in human PFC , we tested the potential influence of histone modification on developmental remodeling genes. Constitutive expression divergence represented by type I and II genes showing constant expression across lifespan or no developmental pattern changes among species reflected different results; both two types were profoundly enriched with H3K4me3HP, while H3K27acHP tended to show a significant overrepresentation of type II genes, they have no association with type I genes (Additional file 2: Fig. S2). By contrast, we found genes that underwent developmental remodeling (type III) showed significant associations with H3K4me3HP and H3K27acHP, accounting for approximately 12 and 5% of type III genes, respectively. This result suggested that the histone modification changes in human PFC not only affect the genes with expression changes in primates but also with the developmental remodeling of their expression patterns in primates. Given the role played by developmental remodeling genes in neurons and neuronal functions , it is conceivable that the observed human-specific histone modification changes in the PFC could underlie advanced cognitive abilities in the human brain.
Our findings consolidate the importance of evolutionary changes in cis-regulatory mechanisms that drive evolutionary changes in gene expression. The fact that many genes show human-specific expression signatures regulated by histone modification raises questions about the synchronized role of cis-elements and trans-factors during primate genome evolution. To clarify this issue, additional 211 TFs and their predicted target genes were added. Our results revealed that both TFs and epigenetics correlated well with species-specific gene expression and to some extent could be responsible for the gene expression difference between humans and chimpanzees, with an explained variance ranging from 4 to 18.6%. These observations suggest that both TF and epigenetic modification associated with active transcription contribute to transcriptome evolution among primates. We also identified six TFs that showed a significantly high correlation with some of the human-specific expressed genes detected in the ssRNA-seq dataset, including NFYA, SREBF1, HNF4A, HMGA1, MEIS1, and STAT2. To date, numerous studies have analyzed these TFs in the brain. Indeed, SREBF1 was declared to be an antipsychotic-activated TF controlling cholesterol biosynthesis, which is involved in the etiology of SZ . Morabito and colleagues found that variability in the SREBF1 motif was decreased in late-stage AD. They also revealed that SREBF1 expression was downregulated in AD oligodendrocytes, highlighting a new avenue for AD therapeutics . A network-based meta-analysis identified HNF4A, a TF associated with gluconeogenesis and diabetes, as a central longitudinally dynamic biomarker for Parkinson’s disease . Furthermore, a series of studies have proposed that the impaired expression of HMGA1 in glioma cells may be linked to gliogenesis and confer a survival benefit to mesenchymal glioblastoma stem-like cell (GSC) tumors [90,91,92]. Moreover, deficiency in the MEIS1 gene has been revealed to explain some of the iron or dopamine changes in relation to restless legs syndrome [93, 94]. Remarkably, our results suggest that human-specific genes are prone to be independently regulated by TFs and epigenetic modification.
Regarding the drivers of this epigenetic evolution, we found that the expression profile of enzymes correlated with species-specific H3K4me3 and H3K27ac epigenomes was significantly better than that expected by chance, suggesting a potential mechanism contributing to genome evolution. Two well-known enzymes associated with H3K27ac, CREBBP and EP300, were found to be highly expressed in the macaque lineage compared to the other two primates and showed remarkable correlation with macaque-specific upregulated genes. This result raised the possibility that the higher epigenomic levels of H3K27ac specific to the macaque may be driven by upregulated H3K27ac-modifying enzymes. Furthermore, several enzymes and enzyme families associated with H3K4me3 showed profoundly higher levels in macaques than in the other two primates; these include, among others, the lysine methyltransferases KMT (a.k.a MLL) family and the lysine demethylases KDM family. The H3K4-specific methyltransferase, MLL1, is essential for cortical and hippocampal development and may play an important role in the etiology of SZ [95,96,97]. Therefore, it is becoming evident that the epigenome, to some extent, could have evolved as a consequence of enzymatic divergence.
Our findings, based on a more comprehensive and detailed analysis of specific histone modification profiles, underscore the importance of epigenomic fine mapping for the human brain in determining genome regulation and provide important insights into the possible nature of molecular mechanisms underlying human cognitive evolution.
Collection of brain samples
We used the same cohort of PFC samples from the postmortem brains of three adult humans, three adult chimpanzees, and three rhesus macaques for ChIP-seq and strand-specific RNA-seq (Additional file 1: Table S1). Human samples were obtained from the NICHD Brain and Tissue Bank for Developmental Disorders at the University of Maryland (USA). All subjects were healthy, as defined by forensic pathologists at the tissue bank. Chimpanzee samples were obtained from the Anthropological Institute & Museum of the University of Zürich-Irchel (Switzerland), and the Biomedical Primate Research Centre (Netherlands). Rhesus macaque samples were obtained from the Suzhou Experimental Animal Center (China). All non-human primates used in this study suffered sudden deaths for reasons other than their participation in this study and had no relationship to the collected tissues. The PFC samples were dissected from the anterior part of the superior frontal gyrus, corresponding to the Brodmann area 10. All tissues were snap-frozen after dissection and stored at − 80 °C.
Chromatin immunoprecipitation sequencing (ChIP-seq)
ChIP was carried out as described previously with some modifications . Briefly, 0.15 g of PFC tissue was ground into powder on dry ice and crosslinked in 1% formaldehyde for 10 min at room temperature. Then, the reaction was quenched by adding glycine, and the tissue was further homogenized in a glass douncer. Cell pellets were lysed successively in Farnham lysis buffer (5 mM PIPES pH 8.0, 85 mM KCl, 0.5% NP-40) and lysis buffer (1% SDS, 10 mM EDTA, 50 mM Tris–HCl, pH 8.0) on ice. Nuclei were sonicated for 5 s at 15% power of Misonix Q700 for 90 times with a 10-s refractory period. The supernatant was diluted in dilution buffer (1.1% Triton X-100, 0.25% sodium deoxycholate, 1.2 mM EDTA, 167 mM NaCl, 16.7 mM Tris–HCl, pH 8.0) and transferred to the Protein A/G magnetic beads which were pre-incubated with H3K4me3 (Millipore, 07–473) or H3K27ac (Active Motif, 39,133) antibody in PBS with 0.5% BSA at 4 °C for at least 4 h, and rotated overnight at 4 °C. The beads were then washed sequentially with low-salt wash buffer (0.1% SDS, 1% Triton X-100, 0.25% sodium deoxycholate, 1 mM EDTA, 150 mM NaCl, 50 mM Tris–HCl, pH 8.0), high-salt wash buffer (0.1% SDS, 1% Triton X-100, 0.25% sodium deoxycholate, 1 mM EDTA, 500 mM NaCl, 50 mM Tris–HCl, pH 8.0), LiCl wash buffer (500 mM LiCl, 1% NP-40, 1% sodium deoxycholate, 100 mM Tris–HCl, pH 8.0), and TE buffer and eluted with elution buffer (1% SDS, 0.1 M NaHCO3) at 65 °C. After adding NaCl to the final 200 mM, the chromatin in the supernatant was reverse crosslinked overnight at 65 °C and then treated with RNase A (final concentration of 0.1 μg/μl) for 0.5 h at 37 °C and proteinase K (final concentration of 0.4 μg/μl) for 1 h at 55 °C. DNA was extracted with the QIAquick MinElute PCR Purification Kit (Qiagen, 28,104). Genomic DNA without immunoprecipitation was used as an input control. A sequencing library was prepared using an Illumina Truseq ChIP Sample Prep Kit (Illumina, USA). The pooled libraries were sequenced on an Illumina HiSeq 2000 platform using the 100-bp singled-ended sequencing protocol.
Strand-specific RNA sequencing (ssRNA-seq)
Total RNA was isolated using TRIzol (Invitrogen, USA) according to the manufacturer’s instructions. RNA quality was assessed with an Agilent 2100 Bioanalyzer. Samples with RNA Integrity Number (RIN) values > 7.5 were selected (Additional file 1: Table S1). Total RNA (1 μg) from the same PFC sample used in ChIP-seq was used to construct the sequencing library following the TruSeq Stranded mRNA Sample Prep (Illumina, USA) protocol. The libraries were pooled and sequenced on an Illumina HiSeq 4000 platform in the 150-bp singled-ended mode.
Consensus genome and gene annotation construction
To directly compare the histone modification and transcriptome among humans, chimpanzees, and rhesus macaques, a consensus genome and gene annotation for the three species were constructed for further analysis. The consensus genome was constructed as described in [59, 99]. In detail, the pairwise genome alignment files of the human (hg19) and chimpanzee (panTro4) genomes and the human (hg19) and rhesus macaque (rheMac3) genomes, aligned by BLASTZ, were downloaded from the UCSC genome browser (https://genome.ucsc.edu). Based on these alignment files, a multiple genome alignment of three species was constructed using the multi-alignment tool . A human–chimpanzee–macaque consensus genome was further constructed by replacing all discordant sites in the human genome, including mismatches and insertions/deletions, with “N” according to the three-species alignment. The consensus gene annotation was constructed as described previously . Human gene annotation was downloaded from GENCODE (v21; https://www.gencodegenes.org). Chimpanzee and rhesus macaque coordinates based on the genome version panTro4 and rheMac3, respectively, were constructed from human gene annotation using the LiftOver tool (http://www.genome.ucsc.edu/cgi-bin/hgLiftOver). Gene annotation coordinates that had the same exon orders and changed no more than 50% during LiftOver were preserved. The intersection of regions with coordinates mapped on hg19 was subsequently merged and used for further analysis.
ChIP-seq data preprocessing
The ChIP-seq raw reads were aligned to the consensus genome as described above using bowtie (version 2–2.2.5) with the “--very-sensitive-local” model . Next, duplicated reads were removed using Picard tools (version 1.117; https://broadinstitute.github.io/picard/). Only unique mapped reads were used for peak calling. Statistically genome-wide significant enriched regions with an FDR < 0.01 for H3K4me3 and H3K27ac (termed H3K4me3/H3K27ac raw peaks for short) were identified by comparing the ChIP-seq samples to corresponding input samples using the MACS peak caller (version 1.4.2). The location overlaps of the raw peaks in each of the two samples was estimated by BEDOPS . The raw peaks detected in all three samples within one species were firstly merged as peaks in humans, chimpanzees, and macaques (termed H3K4me3/H3K27ac peaks for short) and then compared among species to divide the peaks into the groups that were shared by three species, two species, and peaks that were unique to one species. Peaks that overlap at their location on the genome were merged for further analysis. The R package ChIPseeker  was used to annotate the location of peaks with the closest genes and genomic regions based on the human reference genome (hg19). The region within 3000 bp from known TSSs was defined as the promoter region. The overlap between the H3K4me3/H3K27ac peaks and the promoter region was also estimated by BEDOPS. The read counts on each merged peak were calculated by coverageBed in bedtools . The peak intensities were calculated using read coverage normalized by peak length and the total number of reads mapped to peaks in a sample.
Strand-specific RNA-seq data preprocessing
RNA-seq raw reads were firstly trimmed with the ea-utils tool (version 1.1.2)  and then aligned to the consensus genome constructed as described above with STAR (version 2.4.2)  using the default parameters. Unique mapped reads were obtained by removing duplicates with Picard tools (version 1.117) for further analysis. The read counts on each consensus annotation were calculated by coverageBed in bedtools. The gene expression levels were calculated using the read coverage normalized by gene length and the total number of reads mapped to genes in a sample. Only genes with read counts > 0 in at least one species were considered for further analysis.
Hierarchical clustering and PCA
Pairwise Pearson correlations based on the peak intensity between samples and the proportion of overlapping peaks calculated by BEDOPS as described above were used for hierarchical clustering. Heatmaps were created using the heatmap.2 function from the R package “gplots”. For PCA, the prcomp function in R based on peak intensity was applied with default parameters.
Identification of species-specific histone peaks and expressed genes
To identify species-specific histone peaks or expressed genes, differential analysis between every two species was performed on ChIP-seq data, RNA-seq data, and ssRNA-seq data using the edgeR v3.36.0, respectively . For each comparison, normalization factors were computed using the calcNormFactors function, which employs the trimmed mean of M-value (TMM) technique; after which, tagwise dispersions were estimated and the read count matrix was subjected to a quasi-likelihood negative binomial generalized log-linear model (glmQLFit) using species as covariates. The resulting P values were determined using glmQLFTest. Multiple testing was conducted by applying the Benjamini-Hochberg (BH) method to the P values to control the FDR. The average peak coverage or gene expression was calculated for each species, and then the fold change was defined as the ratio of the average value between every two species. Only peaks with FDR < 0.05 and fold change > 1.2 were considered significant. For expressed genes, the fold change threshold was set at 1.5. If a peak or gene showed no significant difference between chimpanzees and macaques but showed a significant difference between humans and the other two primate species, this peak or gene was assigned to the human-specific peak (referred to H3K4me3HP and H3K27acHP) or gene. Chimpanzee- and macaque-specific peaks or genes were defined by the analogous criteria.
Functionality enrichment analysis and cell type specificity
Gene Ontology (GO) terms associated with H3K4me3HP- and H3K27acHP-enriched genes were determined by the Bioconductor package “clusterProfiler” based on hypergeometric distribution . The background gene set was downloaded from the Allen Brain Atlas (https://human.brain-map.org) data portal. GO term categories specific to biological processes with P < 0.05 after BH correction were considered to be significantly enriched. Cell type specificity was performed as described previously . For brevity, marker genes subjected to nine major cell types were determined in the mouse brain using single-cell RNA-seq . The cell types contained neuronal subtypes, including S1 pyramidal neurons, CA1 pyramidal neurons, and interneurons, as well as non-neuronal glia cells, including astrocytes, oligodendrocytes, endothelial cells, ependymal cells, mural cells, and microglia. A total of 19,282 human-mouse one-to-one orthologs were downloaded from Ensembl (https://asia.ensembl.org). For each cell type, a hypergeometric test was applied to compare the overrepresentation of marker genes to H3K4me3HP- and H3K27acHP-enriched genes while using all mouse genes as the background. The BH-corrected P < 0.05 was used as the enrichment cutoff.
PPI network construction
The PPI networks of H3K4me3HP- and H3K27acHP-enriched genes were constructed using the STRING (Search Tool for the Retrieval of Interacting Genes/Proteins) database (https://cn.string-db.org), which provides a critical assessment and integration of protein–protein interactions, including physical or functional associations . The threshold of the PPI score was set as 0.7 to obtain interactions with higher confidence. As nodes with a higher degree of connectivity make larger contributions to the stability of the network, genes with connectivity degrees > 10 were defined as hub genes using the CentiScaPe plugin . The PPI network was visualized by Cytoscape v3.8.2 (https://cytoscape.org) .
Association between regulatory mechanisms and species-specific genes
Pearson correlation coefficients were calculated between three regulatory mechanisms (TF, H3K4me3, and H3K27ac) and their corresponding target genes specific to the three species. For each regulatory mechanism, the correlation expected by chance was assessed by randomly sampling the same number of non-species-specific genes and their coupled regulators, a procedure that was repeated 100 times.
Relationship between regulatory mechanisms and human-chimpanzee differences
The association between human-chimpanzee gene expression differences and three regulatory mechanisms was estimated previously . In detail, log2 fold changes of gene expression differences and coupled TF expression differences or coupled H3K4me3/H3K27ac coverage differences were fitted with a linear regression model, and variance was inferred using the anova function in R. Only human- and chimpanzee-specific genes detected in the ssRNA-seq dataset were considered. Particularly, the significance of the TFs correlated with corresponding target genes was inspected by the cor.test function in R. TFs with absolute Pearson correlation coefficients > 0.6 and P < 0.05 were used in the above analysis. Permutation was performed by randomly coupling genes to those regulators 1000 times to estimate the significance of these associations.
Putative TFs regulating human-specific genes
TFs showing a significant correlation with their corresponding target genes specific to humans were calculated by checking the absolute Pearson correlation coefficients between the expression profiles of TFs and human-specific target genes detected in the ssRNA-seq dataset. Significance was assessed by comparing the absolute Pearson correlation coefficients calculated within human-specific genes to those calculated between the same TF and its corresponding non-human-specific target genes using a one-sided Wilcoxon rank-sum test. TFs with BH-corrected P < 0.05 and absolute Pearson correlation coefficients > 0.6 were determined to be significantly correlated. The expected by chance number of correlated TFs was estimated by shuffling species labels 1000 times for genes.
Putative human-specific genes regulated by TFs
For each human-specific gene, the gene showing a significant correlation with its corresponding TF was calculated by checking the absolute Pearson correlation coefficients between the expression profile of the human-specific gene and the corresponding TF. To estimate the correlation significance, we randomly chose the same number of genes from all expressed genes as TFs and calculated the absolute Pearson correlation coefficients between the expression profiles of the same gene and randomly chosen TFs. A one-sided Wilcoxon rank-sum test was used to determine the significance of a higher correlation for TFs. Genes with a BH-corrected P < 0.05 and absolute Pearson correlation coefficients > 0.6 were inferred to be significantly correlated. The expected by chance number of correlated human-specific genes was assessed by shuffling the species labels 1000 times for TFs.
Histone-modifying enzymes were retrieved from the HISTome2 database (http://www.actrec.gov.in/histome2/Human/index.php), a knowledgebase of histone proteins, post-translational modifications, and histone-modifying enzymes for multiple organisms with epidrugs . A panel of epigenetic regulators, as well as their targets, were also characterized according to the manually curated EpiFactors database (http://epifactors.autosome.ru), which provides a wide range of information about human proteins and complexes involved in epigenetic regulation .
Association between enzymes and species-specific histone peaks
The Pearson correlation of enzymes derived from the ssRNA-seq dataset and the species-specific H3K4me3 and H3K27ac peaks was calculated using all the expression profiles of all three species. The expected by chance correlation coefficients were further estimated by randomly sampling the same number of non-species-specific histone peaks and repeating the permutation 100 times.
Putative enzymes mediating species-specific histone peaks
For every enzyme, the correlation between the enzyme and the species-specific histone peaks was compared to the correlation of the same enzyme and the same number of non-species-specific histone peaks using a one-sided Wilcoxon test. Enzymes with P < 0.01 were deemed to correlate well with species-specific histone peaks. The average value based on 1000 permutations was defined as the number of significantly correlated enzymes. Furthermore, the expected by chance number of correlated enzymes was calculated by 1000 random samplings of nonenzymatic genes and the same number of non-species-specific histone peaks. It was noteworthy that the test was conducted using all three species together.
Availability of data and materials
All data generated or analyzed during this study are included in this published article, its supplementary information files (Additional file 1), and publicly available repositories. Raw data are deposited in the National Omics Data Encyclopedia (NODE; http://www.biosino.org/node) and are available through NODE accession number OEP003561 at https://www.biosino.org/node/project/detail/OEP003561. The ChIP-seq datasets have accession number OEX020247 (https://www.biosino.org/node/experiment/detail/OEX020247), and matched strand-specific RNA-seq datasets have accession number OEX020246 (https://www.biosino.org/node/experiment/detail/OEX020246).
Chromatin immunoprecipitation sequencing
Strand-specific RNA sequencing
Histone H3 lysine 4 trimethylation
Histone H3 lysine 27 acetylation
Human-specific H3K4me3 peaks
Human-specific H3K27ac peaks
Transcription start site
Transcription factor binding site
False discovery rate
RNA Integrity Number
Preuss TM. The human brain: rewired and running hot. Ann N Y Acad Sci. 2011;1225 Suppl 1(Suppl 1):E182-191.
Roth G, Dicke U. Evolution of the brain and intelligence in primates. Prog Brain Res. 2012;195:413–30.
Laland K, Seed A. Understanding human cognitive uniqueness. Annu Rev Psychol. 2021;72:689–716.
Waterson RH, Lander ES, Wilson RK, The Chimpanzee S, Analysis C. Initial sequence of the chimpanzee genome and comparison with the human genome. Nature. 2005;437(7055):69–87.
Kronenberg ZN, Fiddes IT, Gordon D, Murali S, Cantsilieris S, Meyerson OS, Underwood JG, Nelson BJ, Chaisson MJP, Dougherty ML, et al. High-resolution comparative analysis of great ape genomes. Science. 2018;360(6393):eaar6343.
Langergraber KE, Prüfer K, Rowney C, Boesch C, Crockford C, Fawcett K, Inoue E, Inoue-Muruyama M, Mitani JC, Muller MN, et al. Generation times in wild chimpanzees and gorillas suggest earlier divergence times in great ape and human evolution. Proc Natl Acad Sci U S A. 2012;109(39):15716–21.
McLean CY, Reno PL, Pollen AA, Bassan AI, Capellini TD, Guenther C, Indjeian VB, Lim X, Menke DB, Schaar BT, et al. Human-specific loss of regulatory DNA and the evolution of human-specific traits. Nature. 2011;471(7337):216–9.
He Z, Han D, Efimova O, Guijarro P, Yu Q, Oleksiak A, Jiang S, Anokhin K, Velichkovsky B, Grünewald S, et al. Comprehensive transcriptome analysis of neocortical layers in humans, chimpanzees and macaques. Nat Neurosci. 2017;20(6):886–95.
Zhang X, Fang B, Huang Y-F. Transcription factor binding sites are frequently under accelerated evolution in primates. Nat Commun. 2023;14(1):783.
Cáceres M, Lachuer J, Zapala MA, Redmond JC, Kudo L, Geschwind DH, Lockhart DJ, Preuss TM, Barlow C. Elevated gene expression levels distinguish human from non-human primate brains. Proc Natl Acad Sci U S A. 2003;100(22):13030–5.
Khaitovich P, Hellmann I, Enard W, Nowick K, Leinweber M, Franz H, Weiss G, Lachmann M, Paabo S. Parallel patterns of evolution in the genomes and transcriptomes of humans and chimpanzees. Science. 2005;309(5742):1850–4.
Somel M, Liu X, Tang L, Yan Z, Hu H, Guo S, Jiang X, Zhang X, Xu G, Xie G, et al. MicroRNA-driven developmental remodeling in the brain distinguishes humans from other primates. PLoS Biol. 2011;9(12): e1001214.
Johnson MH. Functional brain development in humans. Nat Rev Neurosci. 2001;2(7):475–83.
Warneken F, Tomasello M. Extrinsic rewards undermine altruistic tendencies in 20-month-olds. Dev Psychol. 2008;44(6):1785–8.
Somel M, Guo S, Fu N, Yan Z, Hu HY, Xu Y, Yuan Y, Ning Z, Hu Y, Menzel C, et al. MicroRNA, mRNA, and protein expression link development and aging in human and macaque brain. Genome Res. 2010;20(9):1207–18.
Liu X, Somel M, Tang L, Yan Z, Jiang X, Guo S, Yuan Y, He L, Oleksiak A, Zhang Y, et al. Extension of cortical synaptic development distinguishes humans from chimpanzees and macaques. Genome Res. 2012;22(4):611–22.
Volpe JJ. Overview: normal and abnormal human brain development. Ment Retard Dev Disabil Res Rev. 2000;6(1):1–5.
Chelly J, Mandel J-L. Monogenic causes of X-linked mental retardation. Nat Rev Genet. 2001;2(9):669–80.
Haygood R, Fedrigo O, Hanson B, Yokoyama K-D, Wray GA. Promoter regions of many neural- and nutrition-related genes have experienced positive selection during human evolution. Nat Genet. 2007;39(9):1140–4.
Prabhakar S, Noonan JP, Pääbo S, Rubin EM. Accelerated evolution of conserved noncoding sequences in humans. Science. 2006;314(5800):786.
Davis KL, Panksepp J, Normansell L. The affective neuroscience personality scales: Normative data and implications. Neuropsychoanalysis. 2003;5(1):57–69.
Flavell SW, Cowan CW, Kim TK, Greer PL, Lin Y, Paradis S, Griffith EC, Hu LS, Chen C, Greenberg ME. Activity-dependent regulation of MEF2 transcription factors suppresses excitatory synapse number. Science. 2006;311(5763):1008–12.
Li B, Woo R-S, Mei L, Malinow R. The Neuregulin-1 Receptor ErbB4 Controls Glutamatergic Synapse Maturation and Plasticity. Neuron. 2007;54(4):583–97.
Shulha HP, Crisci JL, Reshetov D, Tushir JS, Cheung I, Bharadwaj R, Chou HJ, Houston IB, Peter CJ, Mitchell AC, et al. Human-specific histone methylation signatures at transcription start sites in prefrontal neurons. PLoS Biol. 2012;10(11):e1001427.
Berto S, Nowick K. Species-specific changes in a primate transcription factor network provide insights into the molecular evolution of the primate prefrontal cortex. Genome Biol Evol. 2018;10(8):2023–36.
Xu C, Li Q, Efimova O, He L, Tatsumoto S, Stepanova V, Oishi T, Udono T, Yamaguchi K, Shigenobu S, et al. Human-specific features of spatial gene expression and regulation in eight brain regions. Genome Res. 2018;28(8):1097–110.
Somel M, Liu X, Khaitovich P. Human brain evolution: transcripts, metabolites and their regulators. Nat Rev Neurosci. 2013;14(2):112–27.
Dong X, Weng Z. The correlation between histone modifications and gene expression. Epigenomics. 2013;5(2):113–6.
Pusarla RH, Bhargava P. Histones in functional diversification. Core histone variants FEBS J. 2005;272(20):5149–68.
Talbert PB, Henikoff S. Histone variants–ancient wrap artists of the epigenome. Nat Rev Mol Cell Biol. 2010;11(4):264–75.
García-Pérez R, Esteller-Cucala P, Mas G, Lobón I, Di Carlo V, Riera M, Kuhlwilm M, Navarro A, Blancher A, Di Croce L, et al. Epigenomic profiling of primate lymphoblastoid cell lines reveals the evolutionary patterns of epigenetic activities in gene regulatory architectures. Nat Commun. 2021;12(1):3116.
Gräff J, Kim D, Dobbin MM, Tsai LH. Epigenetic regulation of gene expression in physiological and pathological brain processes. Physiol Rev. 2011;91(2):603–49.
Gräff J, Rei D, Guan JS, Wang WY, Seo J, Hennig KM, Nieland TJ, Fass DM, Kao PF, Kahn M, et al. An epigenetic blockade of cognitive functions in the neurodegenerating brain. Nature. 2012;483(7388):222–6.
Boyd-Kirkup JD, Green CD, Wu G, Wang D, Han JD. Epigenomics and the regulation of aging. Epigenomics. 2013;5(2):205–27.
Lister R, Mukamel EA, Nery JR, Urich M, Puddifoot CA, Johnson ND, Lucero J, Huang Y, Dwork AJ, Schultz MD, et al. Global epigenomic reconfiguration during mammalian brain development. Science. 2013;341(6146):1237905–1237905.
Cheung I, Shulha HP, Jiang Y, Matevossian A, Wang J, Weng Z, Akbarian S. Developmental regulation and individual differences of neuronal H3K4me3 epigenomes in the prefrontal cortex. Proc Natl Acad Sci U S A. 2010;107(19):8824–9.
Han Y, Han D, Yan Z, Boyd-Kirkup JD, Green CD, Khaitovich P, Han JDJ. Stress-associated H3K4 methylation accumulates during postnatal development and aging of rhesus macaque brain. Aging Cell. 2012;11(6):1055–64.
Liu K, Ma W, Li C, Li J, Zhang X, Liu J, Liu W, Wu Z, Zang C, Liang Y, et al. Advances in transcription factors related to neuroglial cell reprogramming. Transl Neurosci. 2020;11(1):17–27.
Amador-Arjona A, Cimadamore F, Huang C-T, Wright R, Lewis S, Gage FH, Terskikh AV. SOX2 primes the epigenetic landscape in neural precursors enabling proper gene activation during hippocampal neurogenesis. Proc Natl Acad Sci U S A. 2015;112(15):E1936–45.
Ahlenius H, Chanda S, Webb AE, Yousif I, Karmazin J, Prusiner SB, Brunet A, Südhof TC, Wernig M. FoxO3 regulates neuronal reprogramming of cells from postnatal and aging mice. Proc Natl Acad Sci U S A. 2016;113(30):8514–9.
Sock E, Wegner M. Transcriptional control of myelination and remyelination. Glia. 2019;67(11):2153–65.
Tiwari N, Pataskar A, Peron S, Thakurela S, Sahu SK, Figueres-Onate M, Marichal N, Lopez-Mascaraque L, Tiwari VK, Berninger B. Stage-Specific Transcription Factors Drive Astrogliogenesis by Remodeling Gene Regulatory Landscapes. Cell Stem Cell. 2018;23(4):557-571 e558.
Andre E, Beckerandre M. Expression of an N-Terminally Truncated Form of Human Focal Adhesion Kinase in Brain. Biochem Biophys Res Commun. 1993;190(1):140–7.
Cheng R, Liang X, Zhao Q, Lian Z, Tang L, Qiu C, Chen H, Zhang P. APC(Cdh1) controls cell cycle entry during liver regeneration. Exp Cell Res. 2017;354(2):78–84.
Enserink JM, Kolodner RD. An overview of Cdk1-controlled targets and processes. Cell Div. 2010;5:11–11.
Thomas GM, Huganir RL. MAPK cascade signalling and synaptic plasticity. Nat Rev Neurosci. 2004;5(3):173–83.
Qin S, Zhao X, Pan Y, Liu J, Feng G, Fu J, Bao J, Zhang Z, He L. An association study of the N-methyl-D-aspartate receptor NR1 subunit gene (GRIN1) and NR2B subunit gene (GRIN2B) in schizophrenia with universal DNA microarray. Eur J Hum Genet. 2005;13(7):807–14.
Aruffo A, Stamenkovic I, Melnick M, Underhill CB, Seed B. CD44 is the principal cell surface receptor for hyaluronate. Cell. 1990;61(7):1303–13.
Asanuma K, Yanagida-Asanuma E, Faul C, Tomino Y, Kim K, Mundel P. Synaptopodin orchestrates actin organization and cell motility via regulation of RhoA signalling. Nat Cell Biol. 2006;8(5):485–91.
Jiang L, Dai Y, Liu X, Wang C, Wang A, Chen Z, Heidbreder CE, Kolokythas A, Zhou X. Identification and experimental validation of G protein alpha inhibiting activity polypeptide 2 (GNAI2) as a microRNA-138 target in tongue squamous cell carcinoma. Hum Genet. 2011;129(2):189–97.
Turnham RE, Scott JD. Protein kinase A catalytic subunit isoform PRKACA; History, function and physiology. Gene. 2016;577(2):101–8.
Millar JK, Pickard BS, Mackie S, James R, Christie S, Buchanan SR, Malloy MP, Chubb JE, Huston E, Baillie GS. DISC1 and PDE4B are interacting genetic factors in schizophrenia that regulate cAMP signaling. Science. 2005;310(5751):1187–91.
Wang P, Xu T-Y, Wei K, Guan Y-F, Wang X, Xu H, Su D-F, Pei G, Miao C-Y. ARRB1/β-arrestin-1 mediates neuroprotection through coordination of BECN1-dependent autophagy in cerebral ischemia. Autophagy. 2014;10(9):1535–48.
Velazquez FN, Caputto BL, Boussin FD. c-Fos importance for brain development. Aging. 2015;7(12):1028–9.
Tapias A, Wang Z-Q. Lysine acetylation and deacetylation in brain development and neuropathies. Genomics Proteomics Bioinformatics. 2017;15(1):19–36.
Liu X, Han D, Somel M, Jiang X, Hu H, Guijarro P, Zhang N, Mitchell A, Halene T, Ely JJ, et al. Disruption of an Evolutionarily Novel Synaptic Expression Pattern in Autism. PLoS Biol. 2016;14(9): e1002558.
He Z, Bammann H, Han D, Xie G, Khaitovich P. NCBI Gene Expression Omnibus (GEO). 2014. https://identifiers.org/bioproject:PRJNA222268.
Liu X, Han D, Somel M, Jiang X, Hu H, Guijarro P, Zhang N, Mitchell A, Halene T, Ely JJ, et al. NCBI Gene Expression Omnibus (GEO). 2016. https://identifiers.org/bioproject:PRJNA254971.
He Z, Bammann H, Han D, Xie G, Khaitovich P. Conserved expression of lincRNA during human and macaque prefrontal cortex development and maturation. RNA. 2014;20(7):1103–11.
Vermunt MW, Tan SC, Castelijns B, Geeven G, Reinink P, de Bruijn E, Kondova I, Persengiev S, Netherlands Brain B, Bontrop R, et al. Epigenomic annotation of gene regulatory alterations during evolution of the primate brain. Nat Neurosci. 2016;19(3):494–503.
Mills JD, Kawahara Y, Janitz M. Strand-Specific RNA-Seq Provides Greater Resolution of Transcriptome Profiling. Curr Genomics. 2013;14(3):173–81.
Morabito S, Miyoshi E, Michael N, Shahin S, Martini AC, Head E, Silva J, Leavy K, Perez-Rosendahl M, Swarup V. Single-nucleus chromatin accessibility and transcriptomic characterization of Alzheimer’s disease. Nat Genet. 2021;53(8):1143–55.
Domcke S, Hill AJ, Daza RM, Cao J, O’Day DR, Pliner HA, Aldinger KA, Pokholok D, Zhang F, Milbank JH, et al. A human cell atlas of fetal chromatin accessibility. Science. 2020;370(6518):eaba7612.
Chen YJ, Campbell HG, Wiles AK, Eccles MR, Reddel RR, Braithwaite AW, Royds JA. PAX8 regulates telomerase reverse transcriptase and telomerase RNA component in glioma. Cancer Res. 2008;68(14):5724–32.
Lu Y, Brommer B, Tian X, Krishnan A, Meer M, Wang C, Vera DL, Zeng Q, Yu D, Bonkowski MS, et al. Reprogramming to recover youthful epigenetic information and restore vision. Nature. 2020;588(7836):124–9.
Schormair B, Zhao C, Bell S, Tilch E, Salminen AV, Pütz B, Dauvilliers Y, Stefani A, Högl B, Poewe W, et al. Identification of novel risk loci for restless legs syndrome in genome-wide association studies in individuals of European ancestry: a meta-analysis. Lancet Neurol. 2017;16(11):898–907.
Geng F, Lu GF, Luo YJ, Dominguez S, Kong DY, Shen LH, Luo XM, Yang X, Hu M, Lai WS, et al. The emerging role of the MiR-1272-ADAM9-CDCP1 signaling pathway in the progression of glioma. Aging. 2020;13(1):894–909.
Levey AI, Qiu D, Zhao L, Hu WT, Duong DM, Higginbotham L, Dammer EB, Seyfried NT, Wingo TS, Hales CM, et al. A phase II study repurposing atomoxetine for neuroprotection in mild cognitive impairment. Brain. 2022;145(6):1924–38.
Magalhaes J, Tresse E, Ejlerskov P, Hu E, Liu Y, Marin A, Montalant A, Satriano L, Rundsten CF, Carlsen EMM, et al. PIAS2-mediated blockade of IFN-β signaling: a basis for sporadic Parkinson disease dementia. Mol Psychiatry. 2021;26(10):6083–99.
Yamanaka T, Tosaki A, Kurosawa M, Matsumoto G, Koike M, Uchiyama Y, Maity SN, Shimogori T, Hattori N, Nukina N. NF-Y inactivation causes atypical neurodegeneration characterized by ubiquitin and p62 accumulation and endoplasmic reticulum disorganization. Nat Commun. 2014;5(1):3354.
King MC, Wilson AC. Evolution at two levels in humans and chimpanzees. Science. 1975;188(4184):107–16.
Uddin M, Wildman DE, Liu G, Xu W, Johnson RM, Hof PR, Kapatos G, Grossman LI, Goodman M. Sister grouping of chimpanzees and humans as revealed by genome-wide phylogenetic analysis of brain gene expression profiles. Proc Natl Acad Sci U S A. 2004;101(9):2957–62.
Enard W, Przeworski M, Fisher SE, Lai CSL, Wiebe V, Kitano T, Monaco AP, Pääbo S. Molecular evolution of FOXP2, a gene involved in speech and language. Nature. 2002;418(6900):869–72.
Pelvig DP, Pakkenberg H, Stark AK, Pakkenberg B. Neocortical glial cell numbers in human brains. Neurobiol Aging. 2008;29(11):1754–62.
De Strooper B, Karran E. The Cellular Phase of Alzheimer’s Disease. Cell. 2016;164(4):603–15.
Rubin RD, Watson PD, Duff MC, Cohen NJ. The role of the hippocampus in flexible cognition and social behavior. Front Hum Neurosci. 2014;8:742.
Konopka G, Friedrich T, Davis-Turak J, Winden K, Oldham MC, Gao F, Chen L, Wang GZ, Luo R, Preuss TM, et al. Human-specific transcriptional networks in the brain. Neuron. 2012;75(4):601–17.
Zeisel A, Muñoz-Manchado Ana B, Codeluppi S, Lönnerberg P, La Manno G, Juréus A, Marques S, Munguba H, He L, Betsholtz C, et al. Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq. Science. 2015;347(6226):1138–42.
Chong SY, Chan JR. Tapping into the glial reservoir: cells committed to remaining uncommitted. J Cell Biol. 2010;188(3):305–12.
Lin SC, Bergles DE. Synaptic signaling between GABAergic interneurons and oligodendrocyte precursor cells in the hippocampus. Nat Neurosci. 2004;7(1):24–32.
Voronova A, Yuzwa SA, Wang BS, Zahr S, Syal C, Wang J, Kaplan DR, Miller FD. Migrating interneurons secrete fractalkine to promote oligodendrocyte formation in the developing mammalian brain. Neuron. 2017;94(3):500-516.e509.
Fang LP, Zhao N, Caudal LC, Chang HF, Zhao R, Lin CH, Hainz N, Meier C, Bettler B, Huang W, et al. Impaired bidirectional communication between interneurons and oligodendrocyte precursor cells affects social cognitive behavior. Nat Commun. 2022;13(1):1394.
Nasrabady SE, Rizvi B, Goldman JE, Brickman AM. White matter changes in Alzheimer’s disease: a focus on myelin and oligodendrocytes. Acta Neuropathol Commun. 2018;6(1):22–22.
Raabe FJ, Slapakova L, Rossner MJ, Cantuti-Castelvetri L, Simons M, Falkai PG, Schmitt A. Oligodendrocytes as A New Therapeutic Target in Schizophrenia: From Histopathological Findings to Neuron-Oligodendrocyte Interaction. Cells. 2019;8(12):1496.
Maas DA, Eijsink VD, Spoelder M, van Hulten JA, De Weerd P, Homberg JR, Vallès A, Nait-Oumesmar B, Martens GJM. Interneuron hypomyelination is associated with cognitive inflexibility in a rat model of schizophrenia. Nat Commun. 2020;11(1):2329.
Schmitt A, Leonardi-Essmann F, Durrenberger PF, Wichert SP, Spanagel R, Arzberger T, Kretzschmar H, Zink M, Herrera-Marschitz M, Reynolds R, et al. Structural synaptic elements are differentially regulated in superior temporal cortex of schizophrenia patients. Eur Arch Psychiatry Clin Neurosci. 2012;262(7):565–77.
Mathys H, Davila-Velderrain J, Peng Z, Gao F, Mohammadi S, Young JZ, Menon M, He L, Abdurrob F, Jiang X, et al. Single-cell transcriptomic analysis of Alzheimer’s disease. Nature. 2019;570(7761):332–7.
Le Hellard S, Mühleisen TW, Djurovic S, Fernø J, Ouriaghi Z, Mattheisen M, Vasilescu C, Raeder MB, Hansen T, Strohmaier J, et al. Polymorphisms in SREBF1 and SREBF2, two antipsychotic-activated transcription factors controlling cellular lipogenesis, are associated with schizophrenia in German and Scandinavian samples. Mol Psychiatry. 2010;15(5):463–72.
Santiago JA, Potashkin JA. Network-based metaanalysis identifies HNF4A and PTBP1 as longitudinally dynamic biomarkers for Parkinson’s disease. Proc Natl Acad Sci U S A. 2015;112(7):2257–62.
Chung YH, Qian Q, Huang HY, Chiu WT, Yang CS, Tzeng SF. The Nuclear Function of IL-33 in Desensitization to DNA Damaging Agent and Change of Glioma Nuclear Structure. Front Cell Neurosci. 2021;15:713336.
Mineo M, Ricklefs F, Rooj AK, Lyons SM, Ivanov P, Ansari KI, Nakano I, Chiocca EA, Godlewski J, Bronisz A. The long non-coding RNA HIF1A-AS2 facilitates the maintenance of mesenchymal glioblastoma stem-like cells in hypoxic niches. Cell Rep. 2016;15(11):2500–9.
Bansod S, Kageyama R, Ohtsuka T. Hes5 regulates the transition timing of neurogenesis and gliogenesis in mammalian neocortical development. Development. 2017;144(17):3156–67.
Khan FH, Ahlberg CD, Chow CA, Shah DR, Koo BB. Iron, dopamine, genetics, and hormones in the pathophysiology of restless legs syndrome. J Neurol. 2017;264(8):1634–41.
Catoire H, Dion PA, Xiong L, Amari M, Gaudet R, Girard SL, Noreau A, Gaspar C, Turecki G, Montplaisir JY, et al. Restless legs syndrome-associated MEIS1 risk variant influences iron homeostasis. Ann Neurol. 2011;70(1):170–5.
Kim SY, Levenson JM, Korsmeyer S, Sweatt JD, Schumacher A. Developmental Regulation of Eed Complex Composition Governs a Switch in Global Histone Modification in Brain. J Biol Chem. 2007;282(13):9962–72.
Lim DA, Huang YC, Swigut T, Mirick AL, Garcia-Verdugo JM, Wysocka J, Ernst P, Alvarez-Buylla A. Chromatin remodelling factor Mll1 is essential for neurogenesis from postnatal neural stem cells. Nature. 2009;458(7237):529–33.
Huang HS, Matevossian A, Whittle C, Kim SY, Schumacher A, Baker SP, Akbarian S. Prefrontal dysfunction in schizophrenia involves mixed-lineage leukemia 1-regulated histone methylation at GABAergic gene promoters. J Neurosci. 2007;27(42):11254–62.
Liu Y, Han D, Han Y, Yan Z, Xie B, Li J, Qiao N, Hu H, Khaitovich P, Gao Y, et al. Ab initio identification of transcription start sites in the Rhesus macaque genome by histone modification and RNA-Seq. Nucleic Acids Res. 2011;39(4):1408–18.
He Z, Han D, Efimova O, Guijarro P, Yu Q, Oleksiak A, Jiang S, Anokhin K, Velichkovsky B, Grunewald S, et al. Comprehensive transcriptome analysis of neocortical layers in humans, chimpanzees and macaques. Nat Neurosci. 2017;20(6):886–95.
Blanchette M, Kent WJ, Riemer C, Elnitski L, Smit AF, Roskin KM, Baertsch R, Rosenbloom K, Clawson H, Green ED, et al. Aligning multiple genomic sequences with the threaded blockset aligner. Genome Res. 2004;14(4):708–15.
Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9(4):357–9.
Neph S, Kuehn MS, Reynolds AP, Haugen E, Thurman RE, Johnson AK, Rynes E, Maurano MT, Vierstra J, Thomas S, et al. BEDOPS: high-performance genomic feature operations. Bioinformatics. 2012;28(14):1919–20.
Yu G, Wang L-G, He Q-Y. ChIPseeker: an R/Bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics. 2015;31(14):2382–3.
Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26(6):841–2.
Aronesty E. ea-utils: Command-line tools for processing biological sequencing data. 2011. https://github.com/ExpressionAnalysis/ea-utils.
Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29(1):15–21.
Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–40.
Yu G, Wang L-G, Han Y, He Q-Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16(5):284–7.
Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, Simonovic M, Doncheva NT, Morris JH, Bork P, et al. STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019;47(D1):D607-d613.
Scardoni G, Petterlini M, Laudanna C. Analyzing biological network parameters with CentiScaPe. Bioinformatics. 2009;25(21):2857–9.
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504.
Shah SG, Mandloi T, Kunte P, Natu A, Rashid M, Reddy D, Gadewal N, Gupta S. HISTome2: a database of histone proteins, modifiers for multiple organisms and epidrugs. Epigenetics Chromatin. 2020;13(1):31.
Medvedeva YA, Lennartsson A, Ehsani R, Kulakovskiy IV, Vorontsov IE, Panahandeh P, Khimulya G, Kasukawa T, The FC, Drabløs F. EpiFactors: a comprehensive database of human epigenetic factors and complexes. Database (Oxford). 2015;2015:bav067.
We thank the donors and their families for the tissue samples used in these studies. We also thank NICHD Brain and Tissue Bank for Developmental Disorders at the University of Maryland (USA), Anthropological Institute & Museum of the University of Zürich-Irchel (Switzerland) and Biomedical Primate Research Centre (Netherlands), and Suzhou Experimental Animal Center (China) for the non-human primates’ samples.
We are also grateful to Prof. Gang Wei from CAS-MPG Partner Institute for Computational Biology for helpful discussions and suggestions.
This work was supported by grants from the National Natural Science Foundation of China (grant numbers 31501012 and 81871534), and the Ministry of Finance of China (grant number GY2021G-2).
Ethics approval and consent to participate
This study was reviewed and approved by the Institutional Animal Care and Use Ethics Committee of the Shanghai Institute for Biological Sciences (approval ID: ER-SIBS-260802P). For human samples in this study, informed consent was obtained from all donors or their next of kin.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Additional file 1: Table S1.
Sample information. Table S2. ChIP-seq mapping statistics. Table S3. Normalized read counts of H3K4me3 peaks in human, chimpanzee and macaque PFC used in this study. Table S4. Normalized read counts of H3K27ac peaks in human, chimpanzee and macaque PFC used in this study. Table S5. List of 1175 H3K4me3 peaks with human-specific gain in PFC. Table S6. List of 775 H3K4me3 peaks with human-specific loss in PFC. Table S7. Functional enrichment of genes adjacent to H3K4me3HP gain. Table S8. Functional enrichment of genes adjacent to H3K4me3HP loss. Table S9. List of 483 H3K27ac peaks with human-specific gain in PFC. Table S10. List of 316 H3K27ac peaks with human-specific loss in PFC. Table S11. Functional enrichment of genes adjacent to H3K27acHP gain and loss. Table S12. Strand-specific RNA-seq mapping statistics. Table S13. Raw and normalized read counts in human, chimpanzee and macaque PFC used in this study. Table S14. List of 69 TFs and their target genes with absolute Pearson correlation coefficients > 0.6 and P values < 0.05. Table S15. List of 28 H3K4me3- and two H3K27ac-modifying enzymes.
Additional file 2: Fig. S1.
Species-specific expressed genes detected in RNA-seq and ssRNA-seq datasets.Overlap of human-, chimpanzee-, and macaque-specific genes detected in RNA-seq and ssRNA-seq.Pearson correlation coefficients of log2-transformed fold changes between RNA-seq and ssRNA-seq for each species-specific gene set. All correlation coefficients were calculated using common species-specific expressed genes identified in both datasets. Each symbol represents individual gene, the line shows linear model curves. Different colors represent each pairwise comparison.***P < 0.001. Fig. S2. Proportion of type I, II and III genes regulated by species-specific histone peaks. Different colors denote each primate. The dark color denotes H3K4me3 modification, and the light color denotes H3K27ac modification. Significance of overlap between three types and expressed genes regulated by species-specific histone modification is marked by asterisk in each bar. Fig. S3. Histone-TF target regulatory network.Association expected by chance between human-chimpanzee gene expression differences and coupled TFs’ expression differences or coupled H3K4me3/H3K27ac coverage differences was done by randomly coupling genes to those regulators 1000 times. Green, TF; orange, H3K4me3; light orange, H3K27ac.Pearson correlation coefficients between log2 fold changes of human-chimpanzee expression differences and coupled regulators. The red triangle denotes true value corresponding to each regulatory mechanism.The percentage of gene expression variance between human and chimpanzee explained by coupled regulators.Left bar: number of TFs significantly correlated with human-specific genes detected in ssRNA-seq dataset by comparing the absolute Pearson correlation coefficients calculated within human-specific genes to that calculated between the same TF and its corresponding non-human-specific target genes using one-sided Wilcoxon rank-sum test. The streaked bar represents the average number of correlated TFs expected by chance, estimated by shuffling species’ labels 1000 times for genes. Right bar: number of human-specific genes significantly correlated with their corresponding TFs by comparing the absolute Pearson correlation coefficients between expression profile of human-specific gene and corresponding TFs to that calculated between the same gene and the same number of TFs that randomly selected from all expressed genes using one-sided Wilcoxon rank-sum test. The streaked bar represents the average number of correlated human-specific genes expected by chance, estimated by shuffling species’ labels 1000 times for TFs. Fig. S4. Six major brain cell types in Alzheimer’s disease enriched with genes adjacent to H3K4me3HP and H3K27acHP.The orange represents H3K4me3 and H3K27ac peaks with significant enrichment, green represents H3K4me3 and H3K27ac peaks with significant depletion. For each cell type, a hypergeometric test was applied to compare the overrepresentation of marker genes to H3K4me3HP- and H3K27acHP-enriched genes while using all of the brain genes as the background. The BH-corrected P < 0.05 was used as the enrichment cutoff.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Sun, W., Xie, G., Jiang, X. et al. Epigenetic regulation of human-specific gene expression in the prefrontal cortex. BMC Biol 21, 123 (2023). https://doi.org/10.1186/s12915-023-01612-3
- Strand-specific RNA-seq (ssRNA-seq)
- Prefrontal cortex (PFC)
- Transcription factor (TF)
- Histone-modifying enzyme