Team:Technion Israel/Modifications/Rosetta

S.tar, by iGEM Technion 2016

S.tar, by iGEM Technion 2016

Computational Design of Ligand Binding Sites


The bacterial world offers a relatively small selection of chemoreceptors, in comparison to the vast number of possible ligands. These receptors evolved specifically to recognize substances, which benefit or harm the organism in some way. On top of that, the fact that the majority of known receptors today are not well characterized, meant that we had very few options of designing chimeric receptors like we initially planned.

In light of the above we had to turn to a new path – redesigning the Tar chemoreceptor to bind a different ligand using computational biology - The Rosetta software.

Rosetta

Rosetta is a bioinformatics software suite for macromolecular modeling and design built by the RosettaCommons organization - a collaboration between several universities and research groups from around the world.

Rosetta development began in the laboratory of Dr. David Baker at the University of Washington as a structure prediction tool but since then has been expanded to solve many different computational macromolecular problems.
As of 2016, Rosetta algorithms have been used to predict, design and analyze almost every set of biomolecular systems: proteins, RNA, DNA, peptides, small molecules and non-canonical amino acids.


Local installation of Rosetta

We quickly discovered that for heavy duty tasks such as redesigning a protein, Rosetta requires more computational power than a regular PC has to offer. While searching for possible computing resources, we came across the local Technion grid of WLCG.

Fig. 1: WLCG computing grid.

The Worldwide LHC Computing Grid (WLCG) is a global collaboration of more than 170 computing centers in 42 countries, linking up national and international grid infrastructures. It was launched in 2002 to provide a resource to store, distribute and analyze the 15 petabytes (15 million gigabytes) of data generated every year by the world’s largest and most powerful particle accelerator - The Large Hadron Collider (LHC).
In Israel, there are three computing centers connected to the grid, located in the Technion Institute, Tel Aviv University and Weizmann Institute.

WLCG supports not only the particle accelerator, but also allows casual users to benefit from this amazing project. We contacted the local Technion grid administrator and received a temporary user on the Atlas server (one of four particle accelerator components). This granted us access to vast computational power, much more than was necessary. With the help of David Cohen, grid computing specialist from the Technion Physics department, we successfully installed Rosetta and all required programs.

Fig. 2: Worldwide computing grid distribution. Yellow dots represent different computing centers.


Designing a binding site




To redesign the Tar chemoreceptor we followed the protocol presented in "Rosetta and the Design of Ligand Binding Sites", (1). The purpose of the protocol is to design a binding site around a selected small molecule ligand. The general steps of the protocol can be seen in the flowchart to the right.


Using this protocol we managed to generate a library of mutated Tar receptors that theoretically bind a substance in a novel way and activate the chemotaxis pathway in response to it. For each design we ran 3-5 iterations of the protocol to assure optimal results.

Fig. 3: Flowchart of the ligand binding domain design protocol.




Filtering Process

The output of the protocol is a library of variants, ranging from dozens to even thousands of protein PDB files, depending on the parameters of the design run. This fact means that filtering the results is an extremely crucial part of the process.
Rosetta is able to predict which protein designs are likely to have improved protein activity, this is done by measuring every aspect of the protein complex such as binding energies, interactions between amino acids, backbone angles, hydrogen bonds and more. After the calculation process the user can decide which parameters are relevant and drop the results which scored the lowest on these. The specific filters we used in our designs can be seen in this attachment.




First run - Benchmark Test




As a test phase, before advancing to more complex designs, we ran the protocol with Aspartic acid – which is the receptor's native ligand. This was done in order to make sure that Rosetta can "handle" the Tar protein, meaning it does not create unnecessary or drastic changes in the protein.

From this design process we received four output structures (after filtering) with 3-5 mutations each, all of which in the binding pocket. These results proved that Rosetta can recognize and work with the Tar LBD.

Fig. 4: Native Tar results.

Protocol automation

For the purpose of our work, we automated the different steps of the protocol, including the filtering process, turning it into a single main script file complete with well-documented instructions. This script also enables easy modification of the filtering parameters to suit the specific ligand being used in the design. For more information see our software tool.

Redesigning for new ligands


Using the Rosetta software's designs, we attempted to construct several new ligand binding site for different materials:
-Histamine
-Lactose and glucose
-Rohypnol
-Ampicillin




Histamine

As a proof of concept we redesigned the Tar LBD to bind Histamine. This ligand is a derivative of Histidine, which is also an amino acid as the native Tar ligand, aspartic acid. This increases the chances of a successful result. Beside the molecular considerations, Histamine is known to be found in decaying food, especially rotten fish.
The following figure and video demonstrate the library we recieved after running several cycles and filtration:

Fig. 5: Alignment of the ligand binding domain (LBD). The alignment presents the 11 variants in the library with the native Tar (wild type).




Fig. 6: 3D imaging of the 11 variants in the library with the native Tar (wild type), each color represents a different variant. As expected, the mutations can be seen near the binding pocket.




Analysis of the results shows two main regions of mutations, one around amino acid number 34 in the LBD sequence and the second around the 115th amino acid. Those results led us to design and perform a two-step cloning assay (link to Histamine cloning assay), in each step we insert the mutations with single PCR reaction.




Lactose and Glucose

These days many people are allergic to milk products because of their sensitivity or intolerance to Lactose. We want to offer them a detection solution based on our FlashLab system, therefore we redesigned the Tar LBD to bind Lactose.
The following figures demonstrate the library we recieved for Lactose after running several cycles and filtration:

Fig. 7: Alignment of the ligand binding domain (LBD). The alignment presents the 11 variants in the library with the native Tar (wild type).




Fig. 8: 3D imaging of the 7 variants in the library with the native Tar (wild type), each color represents a different variant.




As this is a novel design of a ligand binding domain to bind a sugar molecule, we decided to have a proof of concept with a smaller and easier molecule, a Lactose component - Glucose. Glucose is well known monosaccharide and is the main compound used in the production of energy in living organisms. For this reason we can find existing chemoreceptors for Glucose (2), however redesigning the Tar LBD to bind Glucose was performed as a pre step before redesigning it to bind Lactose.
The following figure and video demonstrate the library we recieved for Glucose after running several cycles and filtration:

Fig. 9: Alignment of the ligand binding domain (LBD). The alignment presents the 8 variants in the library with the native Tar (wild type).




Fig. 10: 3D imaging of the 8 variants in the library with the native Tar (wild type), each color represents a different variant.




Rohypnol

Rohypnol, also known as Flunitrazepam, is used in some countries to treat insomnia. However it is better known as the 'date rape drug'.
As one application of our project, we would like to offer a simple drug test based on our system to help men and women identify danger when going out. Redesigning the Tar LBD to bind Rohypnol takes as one step closer to achieving this goal.
The following figures demonstrate the library we recieved for Rohypnol after running several cycles and filtration:

Fig. 12: Alignment of the ligand binding domain (LBD). The alignment presents the 4 variants in the library with the native Tar (wild type).




Fig. 13: 3D imaging of the 4 variants in the library with the native Tar (wild type), each color represents a different variant




Ampicillin

Targeting bacteria towards antibiotics may seem redundant as the bacteria will simply die, however it can be also used as an effective kill switch - small amount of antibiotics can kill more bacteria if those are attracted to it. To expand our novel approach we redesigned the Tar LBD to bind Ampicillin antibiotics.
The following figures demonstrate the library we recieved for Ampicillin after running several cycles and filtration:

Fig. 14: Alignment of the ligand binding domain (LBD). The alignment presents the 13 variants in the library with the native Tar (wild type).




Fig. 15: 3D imaging of the 13 variants in the library with the native Tar (wild type), each color represents a different variant.

Histamine results


The Rosetta’s design process produced 870 results, out of which 11 variants remained after filtering. The 11 variants were cloned into the native Tar LBD and out of them only 6 exhibited the expected sequences in sequencing and were subjected to chemotaxis tests. The tests consisted of placing the cloned bacterial solution in a commercial ibidi chip. The chip was placed under a microscope and the attractant added to start the experiment. The interaction of the bacteria with the attractant was observed with a timelapse of one frame every 30 seconds for 20 minutes. The control consisted of the same process, with motility buffer being placed instead of the attractant, Histamine. Out of 6 tested variants only one was discovered to be attracted to Histamine. The results of the chemotaxis test for variant number 9 are presented in figure 1.




a.


b.


c.


d.


Fig. 1: microscope results of chemotaxis activity for variant His_9 with 10mM of Histamine. a. Tar-Histamine after 0 minutes (when the Histamine added). b. Tar-Histamine after 20 minutes. c. Control after 0 minutes (when the Histamine added). d. Control after 20 minutes .




To prove the correct localization of the LBD on both poles of the bacteria, GFP was fused to its C-terminus with a short linker sequence (E0040) . The results of these tests as seen in figure 2, prove our assumption of correct localizations.




a.


b.


c.


d.


Fig. 2: Results of GFP fusion. a. White light of Tar-Histamine-GFP b. Flourcense (490nm excitation) of Tar-Histamine-GFP c. White light of normal Tar d. Flourcense (490nm excitation) of normal Tar .




Finally, demonstrated in video 1 is a working concept of the FlashLab project - a chip that serves as a detection tool based on the chemotaxis system of E. coli bacteria - by using a commercial ibidi chip filled with a suspension of bacteria expressing the chemoreceptor and chromoprotein (K1357008). A solution of Histamine in concentration of 10-3M, the attractant, was added to the chip and the displacement of the bacteria was monitored and recorded.


Video 1: from left to right: (1) Histamine-Tar with Histamine atrractent added. (2) Histamine-Tar with Motillity buffer added (control).

Glucose results


A swarming plate assay was performed in order to confirm the functionality of the Tar-Glucose receptor (swarming assay protocol). Two glucose concentrations were tested- 1mM and 10mM. A control was performed with no glucose added. The results of the swarming assay did not indicate any chemotaxis activity of the Tar-Glucose.

Rosetta Guide for the iGEM beginner:


During our work with Rosetta we stumbled into quite a few challenges that required us to browse through the official documentation and the Rosetta support forums and also consult with experts in computational design. These problems made us realize how difficult Rosetta can be to completely new users, especially undergraduates lacking the necessary knowledge.
To make Rosetta more accessible to the iGEM community we decided to team up with iGEM TU Eindhoven and compile a quick start guide complete with important links, protocols and information we gathered from our experience with Rosetta.
We hope that this guide will help future iGEM teams and novice Rosetta users in general.

Click here to see the full guide.

References:
1. Moretti, R., Bender, B.J., Allison, B. and Meiler, J., 2016. Rosetta and the Design of Ligand Binding Sites. Computational Design of Ligand Binding Proteins, pp.47-62.

2. Adler, J., Hazelbauer, G.L. and Dahl, M.M., 1973. Chemotaxis toward sugars in Escherichia coli. Journal of bacteriology, 115(3), pp.824-847.




S.tar, by iGEM Technion 2016