Team:Warwick/Software

iGEM Warwick 2016 - Page

Abstract

We have created a software tool that uses a DNA sequence input, and designs a list of gRNA sequences as output. These gRNAs will bind to their target DNA with a strength specified by the user, as a percentage of the maximum binding strength of the dCas9 protein to that DNA site. It also calculates the approximate structure of the gRNA and displays first the ones which disturb the Cas9 handle the least. To achieve these outcomes, it incorporates open source code from RNAfold [1] and code developed by Iman Farasat and Howard M. Salis [2], generously provided by them. The aim is to use this tool for de novo design of CRISPR-repressed (CRISPRi) and CRISPR activated (CRISPRa) promoters, as there is currently no method available for designing promoter sequences of pre-defined repression/ activation strengths.

Fig. 1: The fold change for the dCas9 [3] is plotted against its binding energy as calculated by the Cas9 Calculator.

1. Calculation of the binding energy

The Cas9Calculator, written by Iman Farasat and Howard M. Salis and published in their latest paper [2], is used for all calculations determining the strength of gRNA:DNA binding. For its creation they analysed datasets from different experiments and took into account the effects of supercoiling on adjacent DNA sites, different PAM sites, gRNA:DNA site mismatches, Cas9 and gRNA expression levels, organisms and growth conditions. This allowed them to predict approximately the dynamics of dCas9-based binding and Cas9-based cleavage at all DNA sites.

Fig. 2: One over the repression activity of the dCas9 [4] is plotted against its binding energy as calculated by the Cas9 Calculator.

2. Relation between fold change and binding energy

The fold change is the fold difference in gene expression between the case when the dCas9 is bound to the promoter and when it's not bound. Since binding of dCas9 in a CRISPRi system prevents transcription by blocking access to the promoter, the strength of binding of dCas9 determines the fold-repression.

To estimate the relation between these, data from two papers is used. For a given DNA sequence one paper presents the fold change in binding of dCas9 for different number of mutations [3]. The second paper presents the repression activity of a single mutation at different locations[4].

In both cases the RNA:DNA binding is calculated with the Cas9Calculator and then plotted on a logarithmic scale against the fold change (Fig. 1) or one over the repression activity, respectively (Fig. 2). With the exception of few points, correlations seems to be linear and hence the relation is fitted with the following formula:𝑃=𝑒^(−𝛽(𝑚𝑎𝑥𝐺−𝐺))

Where maxG is the Gibson free energy when there are no mismatches and G is the energy for the software-designed sequence. P is the fold change between the two states. The coefficient %beta = 1.003074571 is the average of all individual coefficients satisfying the formula (with the exception of the few points noted above).

Fig. 3: (a) Starting interface for the web tool (b) Starting interface with the user’s input.

3. The Algorithm

Once the user inputs a DNA sequence and the desired fold change, the software calculates the corresponding RNA+dCas9:DNA binding energy required to achieve that change. Then, starting from the perfectly matching gRNA, a random number of mutations is chosen and then for each mutation two further random numbers indicate its position and the replaced nucleotide (A/T/G/C). This process is repeated 10,000 times where the binding energy of each sequence is calculated using the Cas9calculator. At the end of this step, only sequences with energy within the allowed range are selected.

Next, the dCas9 handle and terminator are added to the gRNA and RNAfold is used to calculate the structure of the final sequence. This is to determine whether the gRNA will disturb the structure of the Cas9 handle by hybridising with it. For higher binding strengths (closer to 100% of maximum), the output gRNA sequence mainly depends on the initial DNA input, the tool cannot enforce an unstructured DNA-binding region. Therefore, we selected the sequences with at least half of their bases unpaired and list them in ascending order of structure in the DNA-binding region, for the user to make their own choice based on other design criteria.

4. Next Steps

Our aim is to launch a web-based tool which will run the code described above in real time making it easier to use, and reaching a wider range of users. So far, a web interface has been created and the code has been successfully hosted on a webserver kindly provided by the Institute of Systems & Synthetic Biology (iSSB), Genopole, France (See snapshots Fig. 3,4,5). The tool could be further improved by increasing its precision or decreasing the time it takes to return results. This could happen by fitting the data from section 2 with a different function or developing a more efficient algorithm for creating gRNA sequences. While the idea behind this tool is simple, it could still be very effective in tunable engineering of CRISPR/Cas9 systems.

The tool is currently available at http://rikpu.lnx.warwick.ac.uk/ and http://warwickigem2016.issb.genopole.fr *We thank Warwick ITS and iSSB for hosting our web tool, and especially thank David Smyth (Warwick) and Joan Hérisson (iSSB) for all the help.

Fig. 4: Confirmation page for submission of input

Fig. 5: Results page

References

[1] Lorenz, Ronny, Stephan H Bernhart, Christian Höner zu Siederdissen, Hakim Tafer, Christoph Flamm, Peter F Stadler, and Ivo L Hofacker. 2011. “ViennaRNA Package 2.0.” Algorithms for Molecular Biology 6 (1): 26. doi:10.1186/1748-7188-6-26.

[2] Farasat, Iman, and Howard M. Salis. 2016. “A Biophysical Model of CRISPR/Cas9 Activity for Rational Design of Genome Editing and Gene Regulation.” PLOS Computational Biology 12 (1): e1004724. doi:10.1371/journal.pcbi.1004724.

[3] Bikard, David, Wenyan Jiang, Poulami Samai, Ann Hochschild, Feng Zhang, and Luciano A Marraffini. 2013. “Programmable Repression and Activation of Bacterial Gene Expression Using an Engineered CRISPR-Cas System.” Nucleic Acids Research 41 (15): 7429–37. doi:10.1093/nar/gkt520.

[4] Qi, Lei S, Matthew H Larson, Luke a Gilbert, Jennifer a Doudna, Jonathan S Weissman, Adam P Arkin, and Wendell a Lim. 2013. “Repurposing CRISPR as an RNA-Guided Platform for Sequence-Specific Control of Gene Expression.” Cell 152 (5). Elsevier: 1173–83. doi:10.1016/j.cell.2013.02.022.