1. Shiyou Zhu
  2. Wensheng Wei  Is a corresponding author
  1. School of Life Sciences, Peking University, China
  2. Peking University, China

Since the human genome sequence was completed in 2003, genome-wide screening has become a popular method for quickly associating specific genes with their roles in cells. More recently, the CRISPR-Cas9 system has become the dominant tool for genome-editing (Jinek et al., 2012; Cong et al., 2013; Mali et al., 2013) and it has subsequently been adapted to make highly effective genetic screening platforms (Shalem et al., 2014; Zhou et al., 2014).

The CRISPR-Cas9 system is derived from the methods used by certain bacteria to identify and cut up foreign genetic material (Barrangou et al., 2007). To edit the genome, specially designed RNA molecules guide a nuclease enzyme called Cas9 to the location of interest in the DNA sequence; the Cas9 enzyme then cuts the DNA at this position. A mutant form of Cas9 that is unable to cut DNA can also be used to generate libraries of single guide RNAs (sgRNAs) that target regions around transcription start sites in the genome. By allowing researchers to either repress or activate gene expression – techniques that are known as CRISPR interference (CRISPRi) and CRISPR activation (CRISPRa), respectively – these sgRNAs make it possible to carry out powerful genetic screens in mammalian cells (Gilbert et al., 2014; Konermann et al., 2015). Now, in eLife, Jonathan Weissman and colleagues at the University of California, San Francisco – including Max Horlbeck as first author – report that a new algorithm can predict the activity of sgRNAs more accurately than existing algorithms (Horlbeck et al., 2016a).

Many factors affect the ability of sgRNAs to activate or repress genes including the sequence, length and secondary structure of the sgRNA (Doench et al., 2014; Xu et al., 2015). Furthermore, the DNA in mammalian cells (and also in other eukaryotic cells) is packaged inside structures called nucleosomes, which make it difficult for the Cas9 enzyme to access the DNA (Hinz et al., 2015; Horlbeck et al., 2016b; Isaac et al., 2016). This is particularly important for CRISPRi and CRISPRa screens because the mutant Cas9 enzyme must stay bound to the DNA for extended periods of time. Horlbeck et al. therefore optimized the design of their sgRNAs to target DNA regions that were not packaged in nucleosomes and thus were more accessible to mutant Cas9.

To improve the CRISPRi and CRISPRa libraries that they had made previously (Gilbert et al., 2014), Horlbeck et al. analyzed data from 30 CRISPRi screens and 9 CRISPRa screens and defined “activity scores” for every sgRNA relative to the sgRNA with the strongest activity for each gene. They then used this information to make new CRISPRi and CRISPRa libraries that contained the ten most active sgRNAs for each gene.

The new human CRISPRi library was used to screen chronic myeloid leukemia K562 cells to identify genes that are essential for cell growth. Impressively, this library was able to identify about 10% more essential genes compared with the original CRISPRi library (Gilbert et al., 2014). Furthermore, a half-sized version of the new human CRISPRi library (with only the top five sgRNAs per gene) performed similarly to the full-sized version. This is reassuring because smaller libraries are easier to construct and use in screens. Similarly, Horlbeck et al. also demonstrated that the new human CRISPRa library outperformed the original one.

Horlbeck et al. found that, when used with the mutant form of Cas9, none of the CRISPRi libraries had toxic side effects like those observed with other approaches that use the active enzyme (Wang et al., 2015). This makes it possible to effectively identify genes, even if they show only slight differences in expression compared to negative controls.

To summarize, this study established an effective algorithm to predict the activity of sgRNAs based on the location of nucleosomes in the genome. Horlbeck et al. used this algorithm to generate new CRISPRi and CRISPRa libraries with much improved performance in genetic screens in humans and mice. It remains to be seen if the algorithm could be used to optimize other types of CRISPR screens, especially ones that use the normal Cas9 enzyme.

References

Article and author information

Author details

  1. Shiyou Zhu

    1. Biodynamic Optical Imaging Center, State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing, China
    2. Peking University-Tsinghua University-National Institute of Biological Sciences Joint Graduate Program, Peking University, Beijing, China
    Competing interests
    The authors declare that no competing interests exist.
  2. Wensheng Wei

    1. Biodynamic Optical Imaging Center, State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing, China
    2. Beijing Advanced Innovation Center for Genomics, Peking University, Beijing, China
    3. Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
    For correspondence
    wswei@pku.edu.cn
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-8053-2423

Publication history

  1. Version of Record published: November 3, 2016 (version 1)

Copyright

© 2016, Zhu et al.

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 2,181
    views
  • 446
    downloads
  • 0
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Shiyou Zhu
  2. Wensheng Wei
(2016)
Genetic Screening: Making better CRISPR libraries
eLife 5:e21863.
https://doi.org/10.7554/eLife.21863

Further reading

    1. Computational and Systems Biology
    Skander Kazdaghli, Iordanis Kerenidis ... Philip Teare
    Research Article

    Imputing data is a critical issue for machine learning practitioners, including in the life sciences domain, where missing clinical data is a typical situation and the reliability of the imputation is of great importance. Currently, there is no canonical approach for imputation of clinical data and widely used algorithms introduce variance in the downstream classification. Here we propose novel imputation methods based on determinantal point processes (DPP) that enhance popular techniques such as the multivariate imputation by chained equations and MissForest. Their advantages are twofold: improving the quality of the imputed data demonstrated by increased accuracy of the downstream classification and providing deterministic and reliable imputations that remove the variance from the classification results. We experimentally demonstrate the advantages of our methods by performing extensive imputations on synthetic and real clinical data. We also perform quantum hardware experiments by applying the quantum circuits for DPP sampling since such quantum algorithms provide a computational advantage with respect to classical ones. We demonstrate competitive results with up to 10 qubits for small-scale imputation tasks on a state-of-the-art IBM quantum processor. Our classical and quantum methods improve the effectiveness and robustness of clinical data prediction modeling by providing better and more reliable data imputations. These improvements can add significant value in settings demanding high precision, such as in pharmaceutical drug trials where our approach can provide higher confidence in the predictions made.

    1. Computational and Systems Biology
    Antony M Jose
    Research Article

    Interacting molecules create regulatory architectures that can persist despite turnover of molecules. Although epigenetic changes occur within the context of such architectures, there is limited understanding of how they can influence the heritability of changes. Here, I develop criteria for the heritability of regulatory architectures and use quantitative simulations of interacting regulators parsed as entities, their sensors, and the sensed properties to analyze how architectures influence heritable epigenetic changes. Information contained in regulatory architectures grows rapidly with the number of interacting molecules and its transmission requires positive feedback loops. While these architectures can recover after many epigenetic perturbations, some resulting changes can become permanently heritable. Architectures that are otherwise unstable can become heritable through periodic interactions with external regulators, which suggests that mortal somatic lineages with cells that reproducibly interact with the immortal germ lineage could make a wider variety of architectures heritable. Differential inhibition of the positive feedback loops that transmit regulatory architectures across generations can explain the gene-specific differences in heritable RNA silencing observed in the nematode Caenorhabditis elegans. More broadly, these results provide a foundation for analyzing the inheritance of epigenetic changes within the context of the regulatory architectures implemented using diverse molecules in different living systems.