Efficient prioritization of CRISPR screen hits by accounting for targeting efficiency of guide RNA

Park, Byung-Sun; Jeon, Heeju; Chi, Sung-Gil; Kim, Tackhoon

doi:10.1186/s12915-023-01536-y

Methodology article
Open access
Published: 24 February 2023

Efficient prioritization of CRISPR screen hits by accounting for targeting efficiency of guide RNA

Byung-Sun Park^1,2,
Heeju Jeon^1,2,
Sung-Gil Chi² &
…
Tackhoon Kim ORCID: orcid.org/0000-0001-7911-6273^1,2,3

BMC Biology volume 21, Article number: 45 (2023) Cite this article

3116 Accesses
2 Citations
1 Altmetric
Metrics details

Abstract

Background

CRISPR-based screens are revolutionizing drug discovery as tools to identify genes whose ablation induces a phenotype of interest. For instance, CRISPR-Cas9 screening has been successfully used to identify novel therapeutic targets in cancer where disruption of genes leads to decreased viability of malignant cells. However, low-activity guide RNAs may give rise to variable changes in phenotype, preventing easy identification of hits and leading to false negative results. Therefore, correcting the effects of bias due to differences in guide RNA efficiency in CRISPR screening data can improve the efficiency of prioritizing hits for further validation. Here, we developed an approach to identify hits from negative CRISPR screens by correcting the fold changes (FC) in gRNA frequency by the actual, observed frequency of indel mutations generated by gRNA.

Results

Each gRNA was coupled with the “reporter sequence” that can be targeted by the same gRNA so that the frequency of mutations in the reporter sequence can be used as a proxy for the endogenous target gene. The measured gRNA activity was used to correct the FC. We identified indel generation efficiency as the dominant factor contributing significant bias to screening results, and our method significantly removed such bias and was better at identifying essential genes when compared to conventional fold change analysis. We successfully applied our gRNA activity data to previously published gRNA screening data, and identified novel genes whose ablation could synergize with vemurafenib in the A375 melanoma cell line. Our method identified nicotinamide N-methyltransferase, lactate dehydrogenase B, and polypyrimidine tract-binding protein 1 as synergistic targets whose ablation sensitized A375 cells to vemurafenib.

Conclusions

We identified the variations in target cleavage efficiency, even in optimized sgRNA libraries, that pose a strong bias in phenotype and developed an analysis method that corrects phenotype score by the measured differences in the targeting efficiency among sgRNAs. Collectively, we expect that our new analysis method will more accurately identify genes that confer the phenotype of interest.

Background

CRISPR-Cas9 screens [1,2,3,4] have been widely adopted to discover novel therapeutic targets, whose disruption leads to a favorable phenotype in relevant disease models. For instance, crucial factors that modulate cancer immunotherapy [5], as well as other diseases, including ferroptosis-associated lipid peroxidation [6] and hemoglobinopathies [7] have been discovered. CRISPR-Cas9 screens are widely used as viability screens for cancer research to search for genes whose ablation can decrease cancer cell fitness.

CRISPR-Cas9 screens, especially dropout screens that identify the sgRNAs that are depleted in the population, typically yield tens to hundreds of candidate “hits.” Therefore, efficient prioritization of the hits among those candidates for further validation can enhance efficiency in finding genes whose ablation reduces cellular fitness. However, low-activity sgRNAs inevitably add bias to the screening results, leading to false negative hits. Different sgRNAs targeting the same gene often result in varying changes in the phenotype despite the assumption underlying CRISPR-Cas9 screens that the change in phenotype by sgRNA expression is purely a consequence of the ablation of the sgRNA target gene. Several attributes of sgRNAs have been proposed to contribute to the noise and bias in CRISPR screens, such as (i) sgRNAs can induce off-target effects by targeting other unintended gene sites [8,9,10,11]; (ii) the copy number of targeted genes varies in cells, especially in the cancer cells [12, 13]; and (iii) efficiencies of insertions and deletions (indels) differ among gRNAs [14]. Therefore, one of the most crucial tasks in the discovery of target genes using CRISPR screens is minimizing the noise and bias in CRISPR-Cas9 screens to maximize the likelihood of finding true hits within the candidate hits.

Several approaches have been developed to refine CRISPR screening results. Doench and colleagues have used an increasing number of sgRNAs targeting the same gene [15]. Elling and colleagues and other groups used a unique molecular identifier (UMI) to lineage trace single cell in CRISPR screens to remove outlier cells with aberrant behaviors [16,17,18]. Also, Garnett and colleagues and others made extensive characterization of the standard sgRNAs widely used to generate minimized sgRNA library consisting of the most effective guides [13, 19]. Many groups have developed rules that govern the on-target and off-target activities of sgRNAs to successfully predict them [15]. However, considering that sgRNA sequence design is restrained by the presence of PAM and the sequence of the target gene, it is practically impossible to expect that thousands of sgRNAs used in CRISPR screens are all perfectly optimized. In addition, sgRNA optimization rules must be established for each Cas nuclease, including the less characterized Cas nucleases [20] and engineered SpCas9 nucleases [21,22,23], limiting generalizability.

Parts and colleagues and Tsherniak and colleagues developed methods to infer sgRNA activity from published CRISPR screen data to correct the screen results [24, 25]. The inference requires an established set of CRISPR screen results across multiple cell lines for best performance. Therefore, while this approach may be useful for correcting CRISPR screen data with established sgRNA library with records of well-validated results, such data are again limited for CRISPR screens with custom sgRNA libraries or with less characterized Cas nucleases. We reasoned that the above limitations in optimization and correction of CRISPR screens can be overcome if the frequency and DNA cleavage efficiency of each sgRNA can be measured simultaneously (Fig. 1). This approach enables efficient correction of CRISPR screen results without existing knowledge in sgRNA activity or requiring multiple replicates across multiple cell lines because each replicate contains both the changes in viability and frequency of indel mutations in cells carrying a specific sgRNA.

By taking a library architecture that enables such a task [26], we present a novel analysis method for adjusting changes in viability with the actual, observed differences in indel frequencies. By performing a “tiling array” screen that involves the use of every possible sgRNA that targets seven different genes, we confirmed that the variation in sgRNA activity is by far the dominant factor contributing to bias in the results. We further developed and validated an analysis method to correct the viability score based on differences in sgRNA activity. We applied our new analysis method to previously published CRISPR screens to improve the receiver operating characteristic area under the curve (ROC-AUC) value for the true positive discovery of essential genes exceeding 0.983. We expect our method to provide a framework for removing bias in CRISPR screens for more efficient prioritization of screen hits for further validation where pre-existing evidences of sgRNA activities are limiting.

Results

Indel generation efficiency of sgRNA is the dominant factor influencing the changes in phenotype

We first identified the contribution of each sgRNA attribute to the bias in the CRISPR screen results. To this end, we generated a tiling array library that contained all possible sgRNAs targeting seven selected genes. The selected genes included two essential genes (RPL8 and RPL15), two dispensable genes (CCR5 and CD4), and three genes that were expected to give an intermediate phenotype (FNTA, WWTR1, GSK3B). As previously described [26], each sgRNA within the library was paired with a sequence (hereafter called the “reporter sequence”) that can be targeted by the same sgRNA (Fig. 2A). If the sgRNA is active and can cleave the endogenous target gene, the sgRNA will similarly generate an indel mutation in the reporter sequence. The library construction was done with fold coverage of at least 500 to preserve even representation of sgRNAs. Also, PCR amplification of the oligonucleotide pool was optimized to minimize the decoupling of sgRNA and reporter sequence (see the “Methods” section) [27]. We confirmed that > 70% of reporter sequences were correctly paired with their corresponding sgRNAs using deep sequencing (Additional file 1: Fig. S1A). We delivered this library into an A375 melanoma cell line stably expressing Cas9. The activity of Cas9 was confirmed in at least 87% of cells using flow cytometry (Additional file 1: Fig. S1B). The indel mutations in the reporter sequence and endogenous target were well correlated (Additional file 1: Fig. S1C). A375-Cas9 cells were infected with the lentiviral library and were collected at multiple time points for 3 weeks for genomic DNA extraction and deep sequencing (Fig. 2B).

Paired-end sequencing of the PCR amplicon derived from the lentiviral genome could simultaneously identify sgRNA and the presence of indel mutations in the reporter sequence (Fig. 2A). NGS analysis revealed that most sgRNAs are well represented with 89% (962/1086) and 97% (1045/1086) of sgRNA frequencies within library within twofold and fivefold difference from median frequency (0.93 reads per thousand), respectively (Additional file 1: Fig. S1D). 2.6% (28/1086) of sgRNAs with reads per thousand of less than 0.1 were excluded from analysis as these sgRNAs showed large variation in sgRNA frequency and indel frequency between biological replicates, indicating genetic drift (Additional file 1: Fig. S1E, F). Fold changes (FC) in the frequency of each sgRNA and the fraction of reporter sequences that had been mutated had a very good correlation (Pearson’s r = 0.98, p < 0.0001) between biological replicates, highlighting the reproducibility of the screening approaches (Fig. 2C, D). FC and mutation frequency of the reporter sequence were correlated with viability scores in previously published screen results (Additional file 1: Fig. S1G) and the predicted on-target cleavage efficiency score (Additional file 1: Fig. S1H), respectively. We observed a gradual increase in indel frequency over time (Fig. 2E), indicating that the saturation of the indel mutation was minimal during the experiment. The indel frequencies varied between 0 and 94%, reflecting that the library consisted of an unoptimized set of sgRNAs.

We first assessed the effect of several attributes of the sgRNA, including indel frequency, off-target effect, and positions of the sgRNA target sequences within the gene sequence, on the FC values for each sgRNA. The FC showed a negative correlation with the indel frequency of the reporter sequence (Pearson’s r = − 0.30, p < 0.0001, Fig. 3A, Additional file 1: Fig. S2). In contrast, the predicted number of off-targets, position of targeted sites within the coding sequence, predicted frequency of in-frame mutation, and sgRNA specificity scores had little, if any, influence on FC values (Pearson’s r = − 0.14–0.13, p > 0.05, Fig. 3B–D). sgRNAs whose target sites are spanning the exon–intron junctions [28] did not show any significant differences in fold change compared to those targeting within an exon (Fig. 3E, F). These results suggest that indel frequencies of the gRNAs are by far the most dominant factor in determining the FC of sgRNAs. This prompted us to develop a new analysis method to correct the bias.

Mutations in the reporter sequence are coupled with those in the endogenous target gene in the same cell

We recognized that there are sgRNAs with a very low FC and indel frequency. Cells expressing inactive or low-activity sgRNA should have viability comparable to those expressing non-targeting sgRNA. Therefore, the theoretical minimum FC value for each sgRNA (assuming zero viability for cells that have indel mutation in the target sgRNA gene) is 1-indel frequency. However, the FC values of sgRNAs, especially those targeting essential genes (RPL8 and RPL15), fell far below the theoretical minimum, leading to an unacceptable conclusion that the viability of cells with target gene cleaved is below zero (Fig. 4A–C, Additional file 1: Fig. S3A-B). Hence, we hypothesized that the indel frequency is particularly underestimated for sgRNAs targeting genes whose disruption causes significant decreases in viability. To this end, we hypothesized that the mutation of the reporter sequence is coupled with that of endogenous target genes. Therefore, cells expressing sgRNA against essential genes with the reporter sequence and sgRNA target gene mutated will quickly be eliminated from the population, leading to the underrepresentation of indel mutation in the reporter sequence (Fig. 4D).

To test our hypothesis, we performed a 6-thioguanine (6-TG) selection assay [15]. 6-TG is metabolized by HPRT1 to thioguanosine monophosphate, which blocks purine biosynthesis and ultimately causes DNA damage and toxicity [29]. Therefore, ablation of HPRT1 prevents cell death after 6-TG treatment. If the mutation of the reporter sequence is independent of that of endogenous genes, the frequency of the mutated reporter sequence will be the same regardless of the 6-TG selection. Alternatively, if the mutation is a reporter sequence coupled with that of endogenous genes, the cells that survived 6-TG selection and those with endogenous HPRT1 mutations will have the reporter sequence mutations as well. We constructed vectors containing an HPRT1-targeting sgRNA coupled with a reporter sequence that could be targeted by the same sgRNA. Surprisingly, we found that up to 94% of cells that survived the 6-TG treatment had a mutated reporter sequence, suggesting that the mutation of the reporter sequence and that of endogenous sgRNA target genes were coupled (Fig. 4E). This is remarkable as the reporter sequence is located at random location independent of endogenous sgRNA target in HPRT1 gene. Any changes in the frequency of sgRNA, which is a consequence of the change in viability upon mutation of the endogenous target gene, will also change the indel frequency by the same magnitude (Fig. 4D). Therefore, the observed indel is underrepresented by FC-1 when compared with the actual indel frequency. This led us to define an adjusted, actual indel frequency denoted as “X,” as a function of the FC and observed indel. The equation for “X” is given below:

$$X=1-\mathrm{FC}+\mathrm{FC}\times \mathrm{indel}$$

Establishing a model for inferring viability upon gene perturbation from the changes in sgRNA frequency

We then established a model scenario that can be used to infer the actual viability v upon ablation of the sgRNA target gene from the changes in sgRNA frequency within the library. Any changes in the frequency of each sgRNA were a consequence of the change in viability v − 1 in the X fraction of the total population carrying the given sgRNA. Therefore, the FC, the observed fold change in sgRNA frequency in CRISPR screens, can be described as a function of v and X as follows:

$$\mathrm{FC}=\left({\varvec{v}}-1\right)\times X+1$$

In this model, FC is equal to 1 and 1 − X when v is 1 and 0, respectively. The deviation of FC from 1 − X is proportional to v. Therefore, v can be calculated as

$${\varvec{v}}=\frac{\mathrm{FC}-\left(1-X\right)}{X}$$

We first tested whether using the new viability metric v as the actual viability could attenuate bias. Contrary to our expectation, v still showed a clear negative correlation with X, even for genes whose ablations are not expected to affect the viability (e.g., CD4 and CCR5) (Additional file 1: Fig. S3C, D). In search of the cause of this correlation, we found that even v values for the negative control sgRNAs showed a negative correlation with X. This correlation may be a consequence of the DNA damage response and subsequent proliferation arrest upon indel mutations [30, 31]. The sgRNAs with higher cleavage efficiencies may also have higher off-target cleavage frequencies.

To adjust the changes in FC by simple negative correlation with X, we used the linear regression of FC and X values of the negative control sgRNAs as the hypothetical FC values when v = 1. With FC_v=1 as the expected FC when v = 1 inferred from the negative control sgRNAs (Fig. 4F), we newly defined the viability v for any given sgRNA as follows:

$${\varvec{v}}=\frac{\mathrm{FC}-{\mathrm{FC}}_{{\varvec{v}}=0}}{{\mathrm{FC}}_{{\varvec{v}}=1}-{\mathrm{FC}}_{{\varvec{v}}=0}}=\frac{\mathrm{FC}-\left(1-X\right)}{{\mathrm{FC}}_{v=1}-\left(1-X\right)}$$

The new analysis method reduces bias in the phenotype score of sgRNAs in screens with custom sgRNA library

We examined whether the use of our new viability metric v can attenuate the bias in the viability score by indel frequency. Actual viability v values had a much weaker correlation with X values than with FC within the range of moderate to high X values (X > 0.44) (Pearson’s r_FC = − 0.94 to − 0.52, r_v = − 0.32–0.13; Fig. 4G). Notably, the v values tended to increase with decreasing X values for the X values less than 0.44. This may be due to noise in the FC values generated by the off-target effect, which can be amplified with very low X values (see the “Discussion” section). We also tested whether sgRNAs targeting the essential genes RPL8 and RPL15 were more efficiently identified with v than with FC in our tiling array screens. Remarkably, the v score more efficiently identified RPL8 and RPL15 as essential genes with an increase in the ROC-AUC value of 6.27% and an increase in delta AUC value, which quantifies the ability in distinguishing essential and non-essential genes within library [32] by 21.7% (Fig. 4H). The AUC values were lower than those previously reported in genomewide screens, largely because the library mainly consisted of unoptimized sgRNAs.

Simultaneous quantification of sgRNA frequency and activity can be used for correcting bias in screens with custom sgRNA library without the existing database of sgRNA activity or screening results across multiple cell lines (Fig. 1). As a proof of concept, we investigated whether our new analysis method can more accurately identify essential genes in a customized sgRNA library containing 6932 sgRNAs chosen from the Brunello library [15] targeting 2305 genes that are predicted to be druggable by small molecules [33] (Fig. 5A). NGS analysis revealed that most sgRNAs are well represented with 76% (5236/6932) and 98% (6793/6932) of sgRNA frequencies within library within twofold and fivefold difference from median frequency (0.12 reads per thousand), respectively (Additional file 1: Fig. S4A). Up to 80% of sgRNAs were correctly paired with their reporter sequences (Additional file 1: Fig. S4B). Variations between replicates were consistent regardless of sgRNA frequency in the library, indicating minimal genetic drift (Additional file 1: Fig. S4C, D). The fold change and indel frequency between biological replicates were well correlated (Pearson’s r = 0.81–0.83, p < 0.0001 for fold change and Pearson’s r = 0.88–0.90, p < 0.0001 for indel frequency; Additional file 1: Fig. S4E, F). The median indel frequency reached 77.9% 21 days after infection (Additional file 1: Fig. S4G), suggesting that a significant portion of the optimized sgRNAs did not edit 100% of their target genes. Low-specificity sgRNAs predicted by GuideScan [34] did not have lower fold change compared to high-specificity sgRNAs (Additional file: Fig. S4H). Remarkably, the v value obtained with the same method as Fig. 4 was much less dependent of indel frequency calculated as X (Fig. 5B). Similar to what was performed for tiling array screens, we first compared the effectiveness of conventional FC and v in identifying previously identified essential genes [35]. Similar to the results shown in Fig. 4H, the ROC-AUC value and dAUC were enhanced by 5.24% and 15.75%, respectively, with the v metric as the viability score (Fig. 5C). Our screening results analyzed with the v metric were more consistent with previous screening results in the DepMap project [36] compared to those with FC metric (Additional file 1: Fig. S5A). In contrast, JACKS analysis failed to increase ROC-AUC in our screens (Fig. 5C), possibly because of the difference in our library design containing reporter sequence.

Finally, we tested whether our analysis method and the indel frequencies analyzed in Fig. 4B, C can be applied to screening results generated by an independent group using the same library [32]. The v viability metric more efficiently identified essential genes with ROC-AUC reaching 0.9830, outperforming conventional FC analysis (0.9716) and JACKS analysis [25] (0.9771) (Fig. 5D, E, Additional file 1: Fig. S5B). Our analysis did not compromise accuracy (98.4%, 96.8%, and 96.8% at 20% false discovery rate with v, FC, and JACKS, respectively) (Fig. 5F) while it was particularly effective in decreasing false negative data, with false discovery rate at 95% and 98% recall (true discovery) decreased by 2.0%p and 60%p compared to using FC values (Fig. 5G). This is expected as the essential genes represented in the library with low-activity sgRNAs, such as critical cell cycle regulator CDC25A [37, 38] in the present sgRNA library (median indel frequency of 37.7% 21 days after transduction), are classified as not essential (Additional file 1: Fig. S5C).

Our method also increased ROC-AUC in CRISPR screening results in 75% (12/16) of cell lines analyzed with an independent whitehead library [39, 40] (Additional file 1: Fig. S5D). The improvements were less dramatic compared to those in Fig. 5D–G because experimental conditions of our CRISPR screening would likely differ from those of previously published studies and therefore the sgRNA activity values are not accurate. Also, our analysis was limited to sgRNAs that are common in the libraries used in published results and those used in Fig. 5A (Additional file 1: Fig. S5D) [25, 39,40,41]. In fact, the whitehead library, which benefitted by using the v metric, had the largest number of common sgRNAs (1206 sgRNAs), whereas TKO (436 sgRNAs) and Yusa (1002 sgRNAs) libraries with fewer number of common sgRNAs failed to achieve a significant increase in ROC-AUC across cell lines. Our method showed comparable performance as JACKS across all libraries tested (Additional file 1: Fig. S5D).

The new analysis method identified NNMT, LDHB, and PTBP1 as vemurafenib resistance genes

sgRNA activity-corrected viability profile could be used for more efficient identification of genes with the desired phenotype. Hence, we analyzed CRISPR screens in Fig. 5A using conventional FC or v as metrics of viability in the presence and absence of the BRAF inhibitor, vemurafenib [42], to identify genes whose ablation could sensitize melanoma cells to vemurafenib. Both FC and v values were used for MAGeCK analysis [43] to calculate the gene-level significance of essentiality in the presence and absence of the drug (Fig. 6A–F). The PANTHER overrepresentation test [44, 45] revealed that the Notch signaling and PI3 kinase pathways (Fig. 6E, F), which are known to be involved in BRAF or MEK inhibitor resistance in melanoma [46,47,48,49,50,51], were exclusively identified in the list of hits obtained by analyzing v. In addition, the MaGECK [43] analysis using v identified NNMT, LDHB, PTBP1, receptor-interacting serine/threonine-protein kinase 1 (RIPK1), insulin-like growth factor 1 receptor (IGF1R), and presenilin-2 (PSEN2) as among the top hits that were not identified by conventional analysis with FC (Fig. 6A–C). IGF1R [52], RIPK1 [53], and PSEN2 [54] are already known to contribute to vemurafenib resistance, suggesting that our new analysis method using v reliably identified hits whose ablation can sensitize melanoma to vemurafenib. We similarly used FC and v values as input for the DrugZ analysis [55] (Fig. 6A). LDHB, RIPK1, and NNMT were similarly more effectively identified as hits using v values for the DrugZ analysis. TANK-binding kinase 1 (TBK1) [56, 57] and fibroblast growth factor 2 (FGF2) [58,59,60], which were previously reported to be involved in vemurafenib resistance, were also identified as top hits by using DrugZ with v input (Additional file 1: Fig. S6).

NNMT [61,62,63], LDHB [64, 65], and PTBP1 [66,67,68] have been studied for promoting cancer development, but their synergistic effects with vemurafenib are not clear. To validate the effect of NNMT, LDHB, and PTBP1 in A375 cells with vemurafenib treatment, we performed a GFP competition assay, where the sgRNA-expressing cells labeled with GFP were cocultured with unlabeled, wild-type cells, and the relative viability of sgRNA-expressing cells was analyzed by changes in the frequency of GFP-positive cells (Fig. 6G, H). When each of the three genes was knocked out by CRISPR-Cas9, the GFP-guide RNA-expressing cells selectively lost their fitness in the presence of vemurafenib (Fig. 6I). We further confirmed that the NNMT or LDHB knockout (Additional file 1: Fig. S7) reduced the survival rate when compared to wild-type cells under vemurafenib treatment (Fig. 6H). Finally, JBSFN000088, (R)-GNE-140, and NVP-ADW742 which are inhibitors of NNMT, LDHB, and IGF1R respectively, showed a synergistic effect with vemurafenib compared to the DMSO-treated control group (Fig. 6J–L, Additional file 1: Fig. S8). In addition, these inhibitors overcame resistance in vemurafenib-resistant A375-VR cells and Hs294T cells, which are intrinsic BRAF inhibitor-resistant cell lines [69] (Additional file 1: Fig. S9, 10). Collectively, the new analysis using v effectively identified potential therapeutic target genes whose inhibition can overcome resistance to targeted therapy.

Discussion

Pooled CRISPR screens enable a massive parallel inquiry of phenotypic changes upon ablation of thousands of genes in one experiment. Although this method is robust and economical, inherent noise and bias is stemming from the fact that the behaviors of individual cells ablated with each gene cannot be observed. Our work provides a widely applicable method to perform genetic screening and identify hits with a substantially reduced risk of bias due to variations in sgRNA activity. Especially, we successfully implemented our method to previous CRISPR screening results for better identification of essential genes. This highlights the generalizability of our approach.

Our method is unique as we used actual, measured sgRNA efficiency estimate to correct the screening results whereas other approaches were intended to use sgRNA efficiency estimates inferred from screening results generated in previous studies. Therefore, our method is best suited for screens using custom sgRNA libraries with little to no available published screening results to infer sgRNA activities. Our method is also easily applicable to screens using other Cas nucleases such as Cas12a and Campylobacter jejuni Cas9 (CjCas9) and Streptococcus canis Cas9 (ScCas9) [20, 70], with much less options in sgRNA activity optimizations. The increasing number of sgRNAs per gene in the library can improve the quality of the CRISPR screens by minimizing false negative results. However, especially for genomewide screens, the scales of the sgRNA library and subsequent screens grow too large with this approach. Our method can be used to downsize the sgRNA library by adjusting the viability phenotype of low-activity sgRNAs and minimizing false negatives with less number of sgRNAs. This could particularly be useful for CRISPR screens performed with complex models such as primary cultures with limited scalability.

Intriguingly, the presence of a mutation in the reporter sequence was tightly coupled to that of the endogenous sgRNA target gene at the single-cell level. This is likely a consequence of the variation in the expression of the lentivirally delivered Cas9 gene and sgRNAs. Cells with edited reporter sequences are likely to have a higher expression of Cas9 and sgRNAs than those with intact reporter sequences. Thus, cells with edited reporter sequences are also more likely to have an endogenous target gene edited.

Finally, our new analysis method identified NNMT and LDHB, which would otherwise be missed in conventional analysis, as novel target genes whose ablation sensitizes melanoma cells to vemurafenib. One of the mechanisms of BRAF inhibition is the induction of apoptosis [71]. NNMT1 reinforces chemoresistance by stabilizing the SIRT1 protein [63], which plays a crucial role in cancer drug resistance, including apoptosis inactivation [72, 73]. In addition, inhibiting LDHB increases apoptosis in cancer cells [64], in combination with chemotherapy [74]. Further studies are needed to identify the detailed molecular mechanisms underlying the role of the novel target genes in promoting resistance to therapy.

There are several sources of inaccuracies in measuring sgRNA activity with reporter sequence edit. The editing efficiency by sgRNA does not only depend on the sequence of sgRNA but also by the chromatin landscape of the target sequence [75]. Our reporter sequence as part of a lentiviral transgene integrated at random sites within genome cannot have the same chromatin landscape as the endogenous target gene. In fact, while the indel frequencies of the reporter sequence and the endogenous target gene are correlated, the values were not identical (Fig. S1E). Also, our reporter sequence cannot detect moderate to large size (> 10nt towards the 5′ end of the sgRNA sequence) deletion. This was inevitable as the distance between the sgRNA and the reporter sequence should be minimal for maximal efficiency in PCR amplification of the sgRNA-reporter cassette for NGS, and to prevent decoupling of the sgRNA and the reporter sequence. However, recent studies showed that the size of the indel mutation is non-random and can vary with sgRNA [76]. Therefore, varying fraction of indel mutations in the reporter sequence for each sgRNA may have been lost in the analysis, contributing to bias in the analysis. Finally, the sgRNA and reporter sequence can be coupled at a higher rate than achieved in our screens (Figs. S1A, S4A) by minimizing the physical distance between sgRNA and the reporter sequence, reducing PCR cycles and extending elongation time during library construction and NGS amplicon generation, and decreasing transfection rates [27]. Further modifications for maximizing the accuracy of reporter sequence as a proxy for target gene edit in our analysis may significantly improve the CRISPR screening results.

Although our analysis revealed that the differences in sgRNA activity are the major determinant of variations in viability in CRISPR screens, this does not exclude the possibility that the other attributes of sgRNAs contribute to the bias in screens. It is likely that the effects of other attributes are masked by the sgRNA activity effect. For example, the off-target effect of sgRNA can still be a contributing factor to the noise in the screens. The use of high-fidelity variant Cas9 [21, 23, 77] and specificity-optimizing tools such as GuideScan [34] can be used in combination with our library design for optimal performance. Also, the functional importance of the sgRNA target site within a gene can influence the viability phenotype. Xu and colleagues reported that in frame mutations, which are generally considered neutral, targeting essential protein domain can abolish protein function thereby forming CRISPR knockout hypersensitive regions [78]. As previous studies showed that a significant fraction of indel mutations are in frame [76], designing sgRNAs to maximize the essentiality of the sgRNA targeting site within the gene will significantly improve the robustness of the phenotype.

Other sources of noise can originate from population drift or off-target effects. Our method for determining v relies on identifying the relative FC values within the window of FC values expected when v is equal to 0 and 1. However, the expected values of FC_v=1 and FC_v=0 converge with low X values, so a small noise in FC can be overrepresented as a large fluctuation in v. Therefore, we believe that our analysis will be particularly useful in combination with on-target activity-optimized sgRNA libraries with high X values.

Conclusions

We develop and validate the method to adjust the phenotype scores with measured sgRNA activity in CRISPR viability screens. We expect our method can be used for broad applications where options for sgRNA optimization are limited.

Methods

Cell culture

HEK-293 T cells were maintained in Dulbecco’s modified Eagle’s medium (DMEM) supplemented with 10% fetal bovine serum (Welgene) and antibiotics. A375, Hs294T cells were maintained in RPMI1640 supplemented with FBS and antibiotics. All cell lines were obtained from Korean Cell Line Bank (https://cellbank.snu.ac.kr).

Library construction

Oligonucleotide pools synthesized by Twist Biosciences were cloned into the hU6 destination vector with Gibson assembly. The number of PCR cycles was minimized (~ 10 cycles) and extension time lengthened (5 min per kilobase) to maximize the sgRNA-reporter coupling rate [79]. The resulting hU6-sgRNA cassette was subcloned to FUW-EFS-BlastR lentiviral plasmid. The tiling-array library contained 1126 gRNAs for seven genes (RPL8, RPL15, WWTR1, FNTA, GSK3B, CCR5, and CD4) and the druggable gene library contained 6935 gRNAs targeting 2305 genes. All library vectors included the reporter target sequences that could be cleaved by the gRNA expressed in the same vector. All library constructions were done at a fold coverage of at least 500 to preserve the diversity of the sgRNAs within the library.

Viral vector transfection and virus production

The lentiviral sgRNA library plasmid was co-transfected with packaging vectors psPAX2 and pVSV-G (kind gifts from the lab of Timothy Lu) into HEK-293 T cells using PEI Max (Polyscience, Inc.). The lentiviral supernatant was collected 48 h after transfection, cleared of contaminating HEK-293 T cells, and stored at − 80 °C. To obtain appropriate lentiviral titer, A375-Cas9 cells were infected with twofold serial dilutions of lentivirus, and cells were grown in 96-well plates in the absence and presence of blasticidin. The fraction of infected cells were calculated by the relative viability of cells treated with blasticidin compared to those not treated.

Viral transduction into A375 cells

The lentiviral sgRNA library was delivered into A375-Cas9 cells stably expressing Cas9 at a multiplicity of infection (MOI) of 0.3–0.5, using 8 µg/mL polybrene. Two days later, the A375 cells were treated with blasticidin for 3 weeks until A375 cells were harvested. For the druggable gene library screens, A375 cells were treated with vemurafenib (MedChemExpress) or DMSO. Cells were passaged with fold coverage of at least 500 every 2 or 3 days.

Genomic DNA extraction and next-generation sequencing (NGS)

Genomic DNA was extracted using Accuprep genomic DNA extraction kit (Bioneer), according to the manufacturer’s instructions. The PCR amplicon spanning the sgRNA and reporter sequence was generated using primers.

F: 5′-CAAGCAGAAGACGGCATACGAGATNNNNNNGGACTATCATATGCTTACCGTAACTTG-3′ R: 5′-AATGATACGGCGACCACCGAGATCTACAC AAGCAGCGTATCCACATAGC-3′. Deep sequencing was performed using HiSeq2500 at a 100 nucleotide read length paired end. The primers used for sequencing were as follows (5′-3′): Read1 (reading the reporter sequence): CGTCAGGAATTATCCGGTGCCTAGAGAAGGTCC, Read2 (reading the sgRNA): CCGTAACTTGAAAGTATTTCGATTTCTTGGCTTTATATATCTTGTGGAAAGGACGAAACACCG, and Index: CGTCCTTTCCACAAGATATATAAAGCCAAGAAATCGAAATACTTTCAAGTTACGGTAAGCATATGATAGTCC.

6-TG treatment in HPRT1 knockout cells

Four guide RNAs targeting HPRT1 were cloned into lentiviral vectors and used to construct the lentiviruses. Each constructed vector contained HPRT1 reporter sequences targeted by each HPRT1 guide RNA to check the indel frequency rates of the reporter sequences. Each virus was transduced into A375 cells and 10 µg/mL blasticidin (Invivogen) for 1 week for selection. Then, 6-TG or DMSO was added for 2 weeks until the cells were harvested for genomic DNA extraction. The genomic locus flanking the sgRNA target sequence and the reporter sequence were PCR amplified with Q5 High-Fidelity DNA polymerase (New England Biolabs). The adapters for deep sequencing were appended to the PCR amplicons using xGen DNA library prep kits (Integrated DNA Technologies) for NGS analysis. The presence of indel mutations in the HPRT1 locus and the reporter sequence were quantified using CRISPRESSO2 [80].

Calculating actual indel frequency X and new phenotype score v

The frequencies of guide RNAs and their respective indel frequencies were counted using custom code available in github (https://github.com/tackhoonkim). The resulting read counts were subject to analysis as described below with Microsoft Excel.

As noted in Fig. 4F, the non-targeting control sgRNA frequency changed with its target cleavage efficiency. Therefore, the normalized fold change of each sgRNA frequency, FC_norm, was calculated as:

$$FC_{norm}=FC_{raw}\div FC_0$$

where FC₀ is the fold change of a theoretical control sgRNA of zero target DNA cleavage activity. FC₀ was obtained by linear regression of fold change and indel frequency of control sgRNAs (Fig. 4F).

Subsequent calculation of adjusted indel frequency X and viability metric v was done as described in the “Results” section. Briefly, the adjusted, “actual” indel frequency X was calculated as:

$$X=1-{\mathrm{FC}}_{\mathrm{norm}}+{\mathrm{FC}}_{\mathrm{norm}}\times \mathrm{indel}$$

where indel is the “observed” indel frequency of obtained from the NGS analysis.

Next, adjusted, actual viability v of cells with sgRNA target gene disruption was calculated as:

$${\varvec{v}}=\frac{{\mathrm{FC}}_{\mathrm{norm}}-{\mathrm{FC}}_{{\varvec{v}}=0}}{{\mathrm{FC}}_{{\varvec{v}}=1}-{\mathrm{FC}}_{{\varvec{v}}=0}}=\frac{{\mathrm{FC}}_{\mathrm{norm}}-\left(1-X\right)}{{\mathrm{FC}}_{{\varvec{v}}=1}-\left(1-X\right)}$$

where FC_v=0 is a theoretical FC value given zero viability upon gene disruption and indel frequency X and is equal to 1 − X. FC_v=1 is a theoretical FC value given no change in viability (v = 1) upon gene disruption and was obtained as an equation for linear regression of FC_norm and X for non-targeting control sgRNAs (Fig. 4F).

MAGEcK analysis was done as previously described [43]. The sgRNA count data for the day 21 sample for input data were generated by multiplying either FC or v to the initial sgRNA count in the plasmid. Subsequent procedures were followed by the user instruction (https://sourceforge.net/p/mageck/wiki/Home/). DrugZ [81] was performed following user instruction (https://github.com/hart-lab/drugz).

To apply our approach to previous works, CRISPR screening results as raw sgRNA counts were obtained from previous studies by Doench and colleagues [15] and Parts and colleagues [25], for identical analysis to obtain X and v. The sgRNA activities obtained in Fig. 5A are directly applied to the data. The fold changes of replicates were averaged. JACKS analysis was performed as instructed in https://github.com/felicityallen/JACKS.

GFP competition assay

The gRNAs for each hit gene were cloned into GFP-expressing lentiviral vectors FUW-EFS-GFP. Five days after virus transduction into A375 cells stably expressing Cas9, the fraction of GFP-positive cells was measured using BD Accuri C6Plus as the initial fraction of knockout cells. The same quantification of GFP-positive cell fraction was done 7 and 14 days after the initial flow cytometry experiment. The relative viability of GFP-positive knockout cells at “day x” relative to GFP-negative wild-type cells was calculated as below:

$$\mathrm{Relative\;viability}=\frac{{}^{{\mathrm{GFP}}_{\mathrm{day }x}^{+}}\!\left/ \!{}_{{\mathrm{GFP}}_{\mathrm{day }x}^{-}}\right.}{{}^{{\mathrm{GFP}}_{\mathrm{initial}}^{+}}\!\left/ \!{}_{{\mathrm{GFP}}_{\mathrm{initial}}^{-}}\right.}$$

Pathway enrichment analysis

After discovering hits using MAGEcK, we performed pathway analysis using the PANTHER overrepresentation test with selected genes (genes from each method, score < 0.09).

Cell viability assay with chemical inhibitors

A375 and A375-VR cells (1000 cells/well) and Hs294T cells (2000 cells/well) were seeded in 96-well plates. The next day, inhibitors and vemurafenib were treated at indicated concentrations. The WST assay was performed using EZ-cytox reagent (DogenBio) diluted in RPMI1640. The absorbance at the 450-nm wavelength was measured using Wallac EnVision (Perkin Elmer) 3 days after drug treatment. Drug synergies were evaluated by performing SynergyFinder (https://synergyfinder.org) [82].

Statistical analysis

Data are presented as mean ± s.d. Correlations were evaluated using Pearson’s correlation coefficient r analyzed by GraphPad Prism. One-sample Wilcoxon test and two-tailed test were used to calculate the p value.

Availability of data and materials

All data generated or analyzed during this study are included in this published article, its supplementary information files, and publicly available repositories. The NGS data for CRISPR screens in Figs. 2 and 5 are available in NCBI SRA (PRJNA856494). The numerical data for all the plots in the main and supplementary figures are in FigShare (https://doi.org/10.6084/m9.figshare.21971246). The codes used for the NGS or sgRNA count analysis are provided in the links below:

sgRNA counter and indel detector (custom code): https://github.com/tackhoonkim (Zenodo repository DOI: https://doi.org/10.5281/zenodo.7575490).

MAGeCK: https://sourceforge.net/p/mageck/wiki/Home/

JACKS: https://github.com/felicityallen/JACKS

DrugZ: https://github.com/hart-lab/drugz

Synergy Finder: https://synergyfinder.org

Abbreviations

NNMT:: Nicotinamide N-methyltransferase
LDHB:: Lactate dehydrogenase B
UMI:: Unique molecular identifier
6-TG:: 6-Thioguanine
PTBP1:: Polypyrimidine tract-binding protein 1
RIPK1:: Receptor-interacting serine/threonine-protein kinase 1
IGF1R:: Insulin-like growth factor 1 receptor
PSEN2:: Presenilin-2
TBK1:: TANK-binding kinase 1
FGF2:: Fibroblast growth factor 2
CjCas9:: Campylobacter jejuni Cas9
ScCas9:: Streptococcus canis Cas9
NCBI SRA:: National Center for Biotechnology Information Sequence Read Archive

References

Shalem O, Sanjana NE, Hartenian E, Shi X, Scott DA, Mikkelsen TS, et al. Genome-scale CRISPR-Cas9 knockout screening in human cells. Science. 2014;343(6166):84–7.
Article CAS PubMed Google Scholar
Manguso RT, Pope HW, Zimmer MD, Brown FD, Yates KB, Miller BC, et al. In vivo CRISPR screening identifies Ptpn2 as a cancer immunotherapy target. Nature. 2017;547(7664):413–8.
Article CAS PubMed PubMed Central Google Scholar
Wang D, Prager BC, Gimple RC, Aguilar B, Alizadeh D, Tang H, et al. CRISPR screening of CAR T cells and cancer stem cells reveals critical dependencies for cell-based therapies. Cancer Discov. 2021;11(5):1192–211.
Article CAS PubMed Google Scholar
Wang X, Tokheim C, Gu SS, Wang B, Tang Q, Li Y, et al. In vivo CRISPR screens identify the E3 ligase Cop1 as a modulator of macrophage infiltration and cancer immunotherapy target. Cell. 2021;184(21):5357-74.e22.
Article CAS PubMed PubMed Central Google Scholar
Dong MB, Wang G, Chow RD, Ye L, Zhu L, Dai X, et al. Systematic immunotherapy target discovery using genome-scale in vivo CRISPR screens in CD8 T cells. Cell. 2019;178(5):1189-204.e23.
Article CAS PubMed PubMed Central Google Scholar
Zhang HL, Hu BX, Li ZL, Du T, Shan JL, Ye ZP, et al. PKCβII phosphorylates ACSL4 to amplify lipid peroxidation to induce ferroptosis. Nat Cell Biol. 2022;24(1):88–98.
Article CAS PubMed Google Scholar
Grevet JD, Lan X, Hamagami N, Edwards CR, Sankaranarayanan L, Ji X, et al. Domain-focused CRISPR screen identifies HRI as a fetal hemoglobin regulator in human erythroid cells. Science. 2018;361(6399):285–90.
Article CAS PubMed PubMed Central Google Scholar
Hsu PD, Scott DA, Weinstein JA, Ran FA, Konermann S, Agarwala V, et al. DNA targeting specificity of RNA-guided Cas9 nucleases. Nat Biotechnol. 2013;31(9):827–32.
Article CAS PubMed PubMed Central Google Scholar
Kim E, Hart T. Improved analysis of CRISPR fitness screens and reduced off-target effects with the BAGEL2 gene essentiality classifier. Genome Medicine. 2021;13(1):2.
Article CAS PubMed PubMed Central Google Scholar
Liang S-Q, Liu P, Smith JL, Mintzer E, Maitland S, Dong X, et al. Genome-wide detection of CRISPR editing in vivo using GUIDE-tag. Nat Commun. 2022;13(1):437.
Article CAS PubMed PubMed Central Google Scholar
Tycko J, Wainberg M, Marinov GK, Ursu O, Hess GT, Ego BK, et al. Mitigation of off-target toxicity in CRISPR-Cas9 screens for essential non-coding elements. Nat Commun. 2019;10(1):4063.
Article PubMed PubMed Central Google Scholar
Beroukhim R, Mermel CH, Porter D, Wei G, Raychaudhuri S, Donovan J, et al. The landscape of somatic copy-number alteration across human cancers. Nature. 2010;463(7283):899–905.
Article CAS PubMed PubMed Central Google Scholar
Gonçalves E, Thomas M, Behan FM, Picco G, Pacini C, Allen F, et al. Minimal genome-wide human CRISPR-Cas9 library. Genome Biol. 2021;22(1):40.
Article PubMed PubMed Central Google Scholar
Bennett EP, Petersen BL, Johansen IE, Niu Y, Yang Z, Chamberlain CA, et al. INDEL detection, the ‘Achilles heel’ of precise genome editing: a survey of methods for accurate profiling of gene editing induced indels. Nucleic Acids Res. 2020;48(21):11958–81.
Article CAS PubMed PubMed Central Google Scholar
Doench JG, Fusi N, Sullender M, Hegde M, Vaimberg EW, Donovan KF, et al. Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9. Nat Biotechnol. 2016;34(2):184–91.
Article CAS PubMed PubMed Central Google Scholar
Schmierer B, Botla SK, Zhang J, Turunen M, Kivioja T, Taipale J. CRISPR/Cas9 screening using unique molecular identifiers. Mol Syst Biol. 2017;13(10):945.
Article PubMed PubMed Central Google Scholar
Michlits G, Hubmann M, Wu S-H, Vainorius G, Budusan E, Zhuk S, et al. CRISPR-UMI: single-cell lineage tracing of pooled CRISPR–Cas9 screens. Nat Methods. 2017;14(12):1191–7.
Article CAS PubMed Google Scholar
Zhu S, Cao Z, Liu Z, He Y, Wang Y, Yuan P, et al. Guide RNAs with embedded barcodes boost CRISPR-pooled screens. Genome Biol. 2019;20(1):20.
Article PubMed PubMed Central Google Scholar
Diehl V, Wegner M, Grumati P, Husnjak K, Schaubeck S, Gubas A, et al. Minimized combinatorial CRISPR screens identify genetic interactions in autophagy. Nucleic Acids Res. 2021;49(10):5684–704.
Article CAS PubMed PubMed Central Google Scholar
Kim E, Koo T, Park SW, Kim D, Kim K, Cho H-Y, et al. In vivo genome editing with a small Cas9 orthologue derived from Campylobacter jejuni. Nat Commun. 2017;8(1):14500.
Article CAS PubMed PubMed Central Google Scholar
Hu JH, Miller SM, Geurts MH, Tang W, Chen L, Sun N, et al. Evolved Cas9 variants with broad PAM compatibility and high DNA specificity. Nature. 2018;556(7699):57–63.
Article CAS PubMed PubMed Central Google Scholar
Lee JK, Jeong E, Lee J, Jung M, Shin E, Kim Y-H, et al. Directed evolution of CRISPR-Cas9 to increase its specificity. Nature Commun. 2018;9(1):3048.
Article Google Scholar
Slaymaker IM, Gao L, Zetsche B, Scott DA, Yan WX, Zhang F. Rationally engineered Cas9 nucleases with improved specificity. Science. 2016;351(6268):84–8.
Article CAS PubMed Google Scholar
Meyers RM, Bryan JG, McFarland JM, Weir BA, Sizemore AE, Xu H, et al. Computational correction of copy number effect improves specificity of CRISPR-Cas9 essentiality screens in cancer cells. Nat Genet. 2017;49(12):1779–84.
Article CAS PubMed PubMed Central Google Scholar
Allen F, Behan F, Khodak A, Iorio F, Yusa K, Garnett M, et al. JACKS: joint analysis of CRISPR/Cas9 knockout screens. Genome Res. 2019;29(3):464–71.
Article CAS PubMed PubMed Central Google Scholar
Kim HK, Song M, Lee J, Menon AV, Jung S, Kang YM, et al. In vivo high-throughput profiling of CRISPR-Cpf1 activity. Nat Methods. 2017;14(2):153–9.
Article CAS PubMed Google Scholar
Hegde M, Strand C, Hanna RE, Doench JG. Uncoupling of sgRNAs from their associated barcodes during PCR amplification of combinatorial CRISPR screens. PLoS ONE. 2018;13(5):e0197547.
Article PubMed PubMed Central Google Scholar
Mou H, Smith JL, Peng L, Yin H, Moore J, Zhang XO, et al. CRISPR/Cas9-mediated genome editing induces exon skipping by alternative splicing or exon deletion. Genome Biol. 2017;18(1):108.
Article PubMed PubMed Central Google Scholar
Aubrecht J, Goad ME, Schiestl RH. Tissue specific toxicities of the anticancer drug 6-thioguanine is dependent on the Hprt status in transgenic mice. J Pharmacol Exp Ther. 1997;282(2):1102–8.
CAS PubMed Google Scholar
Liu M, Zhang W, Xin C, Yin J, Shang Y, Ai C, et al. Global detection of DNA repair outcomes induced by CRISPR-Cas9. Nucleic Acids Res. 2021;49(15):8732–42.
Article CAS PubMed PubMed Central Google Scholar
Haapaniemi E, Botla S, Persson J, Schmierer B, Taipale J. CRISPR–Cas9 genome editing induces a p53-mediated DNA damage response. Nat Med. 2018;24(7):927–30.
Article CAS PubMed Google Scholar
Sanson KR, Hanna RE, Hegde M, Donovan KF, Strand C, Sullender ME, et al. Optimized libraries for CRISPR-Cas9 genetic screens with multiple modalities. Nat Commun. 2018;9(1):5416.
Article CAS PubMed PubMed Central Google Scholar
Finan C, Gaulton A, Kruger FA, Lumbers RT, Shah T, Engmann J, et al. The druggable genome and support for target identification and validation in drug development. Sci Transl Med. 2017;9(383):eaag1166.
Article PubMed PubMed Central Google Scholar
Perez AR, Pritykin Y, Vidigal JA, Chhangawala S, Zamparo L, Leslie CS, et al. GuideScan software for improved single and paired CRISPR guide RNA design. Nat Biotechnol. 2017;35(4):347–9.
Article CAS PubMed PubMed Central Google Scholar
Hart T, Chandrashekhar M, Aregger M, Steinhart Z, Brown KR, MacLeod G, et al. High-resolution CRISPR screens reveal fitness genes and genotype-specific cancer liabilities. Cell. 2015;163(6):1515–26.
Article CAS PubMed Google Scholar
Tsherniak A, Vazquez F, Montgomery PG, Weir BA, Kryukov G, Cowley GS, et al. Defining a cancer dependency map. Cell. 2017;170(3):564-76.e16.
Article CAS PubMed PubMed Central Google Scholar
Cho YC, Park JE, Park BC, Kim JH, Jeong DG, Park SG, et al. Cell cycle-dependent Cdc25C phosphatase determines cell survival by regulating apoptosis signal-regulating kinase 1. Cell Death Differ. 2015;22(10):1605–17.
Article CAS PubMed PubMed Central Google Scholar
Liu K, Zheng M, Lu R, Du J, Zhao Q, Li Z, et al. The role of CDC25C in cell cycle regulation and clinical cancer therapy: a systematic review. Cancer Cell Int. 2020;20(1):213.
Article CAS PubMed PubMed Central Google Scholar
Wang T, Yu H, Hughes NW, Liu B, Kendirli A, Klein K, et al. Gene essentiality profiling reveals gene networks and synthetic lethal interactions with oncogenic Ras. Cell. 2017;168(5):890-903.e15.
Article CAS PubMed PubMed Central Google Scholar
Wang T, Birsoy K, Hughes NW, Krupczak KM, Post Y, Wei JJ, et al. Identification and characterization of essential genes in the human genome. Science. 2015;350(6264):1096–101.
Article CAS PubMed PubMed Central Google Scholar
Ong SH, Li Y, Koike-Yusa H, Yusa K. Optimised metrics for CRISPR-KO screens with second-generation gRNA libraries. Sci Rep. 2017;7(1):7384.
Article PubMed PubMed Central Google Scholar
Bollag G, Tsai J, Zhang J, Zhang C, Ibrahim P, Nolop K, et al. Vemurafenib: the first drug approved for BRAF-mutant cancer. Nat Rev Drug Discov. 2012;11(11):873–86.
Article CAS PubMed Google Scholar
Li W, Xu H, Xiao T, Cong L, Love MI, Zhang F, et al. MAGeCK enables robust identification of essential genes from genome-scale CRISPR/Cas9 knockout screens. Genome Biol. 2014;15(12):554.
Article PubMed PubMed Central Google Scholar
Mi H, Ebert D, Muruganujan A, Mills C, Albou L-P, Mushayamaha T, et al. PANTHER version 16: a revised family classification, tree-based classification tool, enhancer regions and extensive API. Nucleic Acids Res. 2020;49(D1):D394–403.
Article PubMed Central Google Scholar
Mi H, Muruganujan A, Casagrande JT, Thomas PD. Large-scale gene function analysis with the PANTHER classification system. Nat Protoc. 2013;8(8):1551–66.
Article PubMed PubMed Central Google Scholar
Caporali S, Alvino E, Lacal PM, Levati L, Giurato G, Memoli D, et al. Targeting the PI3K/AKT/mTOR pathway overcomes the stimulating effect of dabrafenib on the invasive behavior of melanoma cells with acquired resistance to the BRAF inhibitor. Int J Oncol. 2016;49(3):1164–74.
Article CAS PubMed Google Scholar
Huynh C, Poliseno L, Segura MF, Medicherla R, Haimovic A, Menendez S, et al. The novel gamma secretase inhibitor RO4929097 reduces the tumor initiating potential of melanoma. PLoS ONE. 2011;6(9):e25264.
Article CAS PubMed PubMed Central Google Scholar
Irvine M, Stewart A, Pedersen B, Boyd S, Kefford R, Rizos H. Oncogenic PI3K/AKT promotes the step-wise evolution of combination BRAF/MEK inhibitor resistance in melanoma. Oncogenesis. 2018;7(9):72.
Article PubMed PubMed Central Google Scholar
Krepler C, Xiao M, Samanta M, Vultur A, Chen HY, Brafford P, et al. Targeting Notch enhances the efficacy of ERK inhibitors in BRAF-V600E melanoma. Oncotarget. 2016;7(44):71211–22.
Article PubMed PubMed Central Google Scholar
Sweetlove M, Wrightson E, Kolekar S, Rewcastle GW, Baguley BC, Shepherd PR, et al. Inhibitors of pan-PI3K signaling synergize with BRAF or MEK inhibitors to prevent BRAF-mutant melanoma cell growth. Front Oncol. 2015;5:135.
Article PubMed PubMed Central Google Scholar
Wang Z, Li Y, Ahmad A, Azmi AS, Banerjee S, Kong D, et al. Targeting Notch signaling pathway to overcome drug resistance for cancer therapy. Biochim Biophys Acta. 2010;1806(2):258–67.
CAS PubMed PubMed Central Google Scholar
Patel H, Mishra R, Yacoub N, Alanazi S, Kilroy MK, Garrett JT. IGF1R/IR mediates resistance to BRAF and MEK inhibitors in BRAF-mutant melanoma. Cancers (Basel). 2021;13(22):5863.
Article CAS PubMed Google Scholar
Lei FX, Jin L, Liu XY, Lai F, Yan XG, Farrelly M, et al. RIP1 protects melanoma cells from apoptosis induced by BRAF/MEK inhibitors. Cell Death Dis. 2018;9(6):679.
Article PubMed PubMed Central Google Scholar
Zhu G, Yi X, Haferkamp S, Hesbacher S, Li C, Goebeler M, et al. Combination with γ-secretase inhibitor prolongs treatment efficacy of BRAF inhibitor in BRAF-mutated melanoma cells. Cancer Lett. 2016;376(1):43–52.
Article CAS PubMed Google Scholar
Colic M, Wang G, Zimmermann M, Mascall K, McLaughlin M, Bertolet L, et al. Identifying chemogenetic interactions from CRISPR screens with drugZ. Genome Med. 2019;11(1):52.
Article PubMed PubMed Central Google Scholar
Eskiocak B, McMillan EA, Mendiratta S, Kollipara RK, Zhang H, Humphries CG, et al. Biomarker accessible and chemically addressable mechanistic subtypes of BRAF melanoma. Cancer Discov. 2017;7(8):832–51.
Article CAS PubMed PubMed Central Google Scholar
Vu HL, Aplin AE. Targeting TBK1 inhibits migration and resistance to MEK inhibitors in mutant NRAS melanoma. MCR. 2014;12(10):1509–19.
Article CAS PubMed Google Scholar
Liu F, Cao J, Wu J, Sullivan K, Shen J, Ryu B, et al. Stat3-targeted therapies overcome the acquired resistance to vemurafenib in melanomas. J Invest Dermatol. 2013;133(8):2041–9.
Article CAS PubMed PubMed Central Google Scholar
Dong L, Li Y, Cao J, Liu F, Pier E, Chen J, et al. FGF2 regulates melanocytes viability through the STAT3-transactivated PAX3 transcription. Cell Death Differ. 2012;19(4):616–22.
Article CAS PubMed Google Scholar
Metzner T, Bedeir A, Held G, Peter-Vörösmarty B, Ghassemi S, Heinzle C, et al. Fibroblast growth factor receptors as therapeutic targets in human melanoma: synergism with BRAF inhibition. J Invest Dermatol. 2011;131(10):2087–95.
Article CAS PubMed PubMed Central Google Scholar
Eckert MA, Coscia F, Chryplewicz A, Chang JW, Hernandez KM, Pan S, et al. Proteomics reveals NNMT as a master metabolic regulator of cancer-associated fibroblasts. Nature. 2019;569(7758):723–8.
Article CAS PubMed PubMed Central Google Scholar
Ulanovskaya OA, Zuhl AM, Cravatt BF. NNMT promotes epigenetic remodeling in cancer by creating a metabolic methylation sink. Nat Chem Biol. 2013;9(5):300–6.
Article CAS PubMed PubMed Central Google Scholar
Wang Y, Zeng J, Wu W, Xie S, Yu H, Li G, et al. Nicotinamide N-methyltransferase enhances chemoresistance in breast cancer through SIRT1 protein stabilization. Breast Cancer Res. 2019;21(1):64.
Article PubMed PubMed Central Google Scholar
Brisson L, Banski P, Sboarina M, Dethier C, Danhier P, Fontenille MJ, et al. Lactate dehydrogenase B controls lysosome activity and autophagy in cancer. Cancer Cell. 2016;30(3):418–31.
Article CAS PubMed Google Scholar
Cheng A, Zhang P, Wang B, Yang D, Duan X, Jiang Y, et al. Aurora-A mediated phosphorylation of LDHB promotes glycolysis and tumor progression by relieving the substrate-inhibition effect. Nat Commun. 2019;10(1):5566.
Article CAS PubMed PubMed Central Google Scholar
Cifdaloz M, Osterloh L, Graña O, Riveiro-Falkenbach E, Ximénez-Embún P, Muñoz J, et al. Systems analysis identifies melanoma-enriched pro-oncogenic networks controlled by the RNA binding protein CELF1. Nat Commun. 2017;8(1):2249.
Article PubMed PubMed Central Google Scholar
Wang X, Li Y, Fan Y, Yu X, Mao X, Jin F. PTBP1 promotes the growth of breast cancer cells through the PTEN/Akt pathway and autophagy. J Cell Physiol. 2018;233(11):8930–9.
Article CAS PubMed PubMed Central Google Scholar
Wang ZN, Liu D, Yin B, Ju WY, Qiu HZ, Xiao Y, et al. High expression of PTBP1 promote invasion of colorectal cancer by alternative splicing of cortactin. Oncotarget. 2017;8(22):36185–202.
Article PubMed PubMed Central Google Scholar
Konieczkowski DJ, Johannessen CM, Abudayyeh O, Kim JW, Cooper ZA, Piris A, et al. A melanoma cell state distinction influences sensitivity to MAPK pathway inhibitors. Cancer Discov. 2014;4(7):816–27.
Article CAS PubMed PubMed Central Google Scholar
Chatterjee P, Jakimo N, Lee J, Amrani N, Rodriguez T, Koseki SRT, et al. An engineered ScCas9 with broad PAM range and high specificity and activity. Nat Biotechnol. 2020;38(10):1154–8.
Article CAS PubMed Google Scholar
Beck D, Niessner H, Smalley KS, Flaherty K, Paraiso KH, Busch C, et al. Vemurafenib potently induces endoplasmic reticulum stress-mediated apoptosis in BRAFV600E melanoma cells. Sci Signal. 2013;6(260):ra7.
Article PubMed PubMed Central Google Scholar
Wang Z, Chen W. Emerging roles of SIRT1 in cancer drug resistance. Genes Cancer. 2013;4(3–4):82–90.
Article CAS PubMed PubMed Central Google Scholar
Yousafzai NA, Zhou Q, Xu W, Shi Q, Xu J, Feng L, et al. SIRT1 deacetylated and stabilized XRCC1 to promote chemoresistance in lung cancer. Cell Death Dis. 2019;10(5):363.
Article PubMed PubMed Central Google Scholar
Sun W, Zhang X, Ding X, Li H, Geng M, Xie Z, et al. Lactate dehydrogenase B is associated with the response to neoadjuvant chemotherapy in oral squamous cell carcinoma. PLoS ONE. 2015;10(5):e0125976.
Article PubMed PubMed Central Google Scholar
Jensen KT, Fløe L, Petersen TS, Huang J, Xu F, Bolund L, et al. Chromatin accessibility and guide sequence secondary structure affect CRISPR-Cas9 gene editing efficiency. FEBS Lett. 2017;591(13):1892–901.
Article CAS PubMed Google Scholar
Allen F, Crepaldi L, Alsinet C, Strong AJ, Kleshchevnikov V, De Angeli P, et al. Predicting the mutations generated by repair of Cas9-induced double-strand breaks. Nat Biotechnol. 2018.
Lee JK, Jeong E, Lee J, Jung M, Shin E, Kim YH, et al. Directed evolution of CRISPR-Cas9 to increase its specificity. Nat Commun. 2018;9(1):3048.
Article PubMed PubMed Central Google Scholar
He W, Zhang L, Villarreal OD, Fu R, Bedford E, Dou J, et al. De novo identification of essential protein domains from CRISPR-Cas9 tiling-sgRNA knockout screens. Nat Commun. 2019;10(1):4541.
Article CAS PubMed PubMed Central Google Scholar
Hanna RE, Doench JG. A case of mistaken identity. Nat Biotechnol. 2018;36(9):802–4.
Article CAS PubMed Google Scholar
Clement K, Rees H, Canver MC, Gehrke JM, Farouni R, Hsu JY, et al. CRISPResso2 provides accurate and rapid genome editing sequence analysis. Nat Biotechnol. 2019;37(3):224–6.
Article CAS PubMed PubMed Central Google Scholar
Colic M, Wang G, Zimmermann M, Mascall K, McLaughlin M, Bertolet L, et al. Identifying chemogenetic interactions from CRISPR screens with drugZ. Genome Medicine. 2019;11(1):52.
Article PubMed PubMed Central Google Scholar
Ianevski A, He L, Aittokallio T, Tang J. SynergyFinder: a web application for analyzing drug combination dose-response matrix data. Bioinformatics (Oxford, England). 2017;33(15):2413–5.
Article PubMed Google Scholar

Download references

Acknowledgements

We would like to thank Timothy Lu and his lab for providing the plasmids.

Funding

This work was supported by the National Research Foundation of Korea (2021R1A2C1093499 to T.K.) and the Korea Institute of Science and Technology Institutional Programs (2E31624 to T.K.).

Author information

Authors and Affiliations

Medicinal Materials Research Center, Korea Institute of Science and Technology, 5 Hwarangro-14-Gil, SeongbukGu, Seoul, 02792, Republic of Korea
Byung-Sun Park, Heeju Jeon & Tackhoon Kim
Department of Biological Sciences, Korea University, 145 AnamRo, SeongbukGu, Seoul, 02841, Republic of Korea
Byung-Sun Park, Heeju Jeon, Sung-Gil Chi & Tackhoon Kim
Division of Bio-Medical Science and Technology, Korea University of Science and Technology, 217 GajeongRo YuseongGu, Daejeon, 34113, Republic of Korea
Tackhoon Kim

Authors

Byung-Sun Park
View author publications
You can also search for this author in PubMed Google Scholar
Heeju Jeon
View author publications
You can also search for this author in PubMed Google Scholar
Sung-Gil Chi
View author publications
You can also search for this author in PubMed Google Scholar
Tackhoon Kim
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

BP and TK conceived the concept. BP, HJ, and TK performed the experiments and analyzed the data. BP, SC, and TK wrote the manuscript. The authors read and approved the final manuscript.

Corresponding author

Correspondence to Tackhoon Kim.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

Figure S1. Data quality control of Tiling array library screens. Figure S2. Indel frequency adds significant bias to log fold change (LFC) of gRNA frequency. Figure S3. Correlation of indel frequency and conventional FC reveals bias in phenotype score. Figure S4. Quality control of druggable gene CRISPR screen data in Figures 5 and 6. Figure S5. Comparison of our method to previous approaches. Figure S6. Effective synergistic targets with vemurafenib treatment using DrugZ with v as input. Figure S7. Confirmation of ablation of target genes by T7 endonuclease assay for Figure 6H. Figure S8. Dose response matrix data for Figures 6I-K. Figure S9. Drug synergy data for A375 VR cells. Figure S10. Drug synergy data for Hs294T cells.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Park, BS., Jeon, H., Chi, SG. et al. Efficient prioritization of CRISPR screen hits by accounting for targeting efficiency of guide RNA. BMC Biol 21, 45 (2023). https://doi.org/10.1186/s12915-023-01536-y

Download citation

Received: 19 October 2022
Accepted: 03 February 2023
Published: 24 February 2023
DOI: https://doi.org/10.1186/s12915-023-01536-y

Efficient prioritization of CRISPR screen hits by accounting for targeting efficiency of guide RNA

Abstract

Background

Results

Conclusions

Background

Results

Indel generation efficiency of sgRNA is the dominant factor influencing the changes in phenotype

Mutations in the reporter sequence are coupled with those in the endogenous target gene in the same cell

Establishing a model for inferring viability upon gene perturbation from the changes in sgRNA frequency

The new analysis method reduces bias in the phenotype score of sgRNAs in screens with custom sgRNA library

The new analysis method identified NNMT, LDHB, and PTBP1 as vemurafenib resistance genes

Discussion

Conclusions

Methods

Cell culture

Library construction

Viral vector transfection and virus production

Viral transduction into A375 cells

Genomic DNA extraction and next-generation sequencing (NGS)

6-TG treatment in HPRT1 knockout cells

Calculating actual indel frequency X and new phenotype score v

GFP competition assay

Pathway enrichment analysis

Cell viability assay with chemical inhibitors

Statistical analysis

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

Additional file 1:

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Biology

Contact us