Computational Design of Ligand Binding Sites
The bacterial world offers a relatively small selection of chemoreceptors, in comparison
to the vast number of possible ligands. These receptors evolved specifically to recognize
substances, which benefit or harm the organism in some way. On top of that, the fact that
the majority of known receptors today are not well characterized, meant that we had very few
options of designing chimeric receptors like we initially planned.
In light of the above we had to turn to a new path – redesigning the Tar chemoreceptor
to bind a different ligand using computational biology - The Rosetta software.
Rosetta
Rosetta is a bioinformatics software suite for macromolecular modeling and design built
by the RosettaCommons organization - a collaboration between several universities and
research groups from around the world.
Rosetta development began in the laboratory of Dr. David Baker at the University of
Washington as a structure prediction tool but since then has been expanded to solve
many different computational macromolecular problems.
As of 2016, Rosetta algorithms have been used to predict, design and analyze almost
every set of biomolecular systems: proteins, RNA, DNA, peptides, small molecules and
non-canonical amino acids.
Local installation of Rosetta
We quickly discovered that for heavy duty tasks such as redesigning a protein, Rosetta requires more computational power than a regular PC has to offer. While searching for possible computing resources, we came across the local Technion grid of WLCG.
The Worldwide LHC Computing Grid (WLCG) is a
global collaboration of more than 170 computing centers in 42 countries, linking up national
and international grid infrastructures. It was launched in 2002 to provide a resource to
store, distribute and analyze the 15 petabytes (15 million gigabytes) of data generated every
year by the world’s largest and most powerful particle accelerator - The Large Hadron Collider (LHC).
In Israel, there are three computing centers connected to the grid, located in the Technion Institute,
Tel Aviv University and Weizmann Institute.
WLCG supports not only the particle accelerator, but also allows casual users to benefit from this amazing project. We contacted the local Technion grid administrator and received a temporary user on the Atlas server (one of four particle accelerator components). This granted us access to vast computational power, much more than was necessary. With the help of David Cohen, grid computing specialist from the Technion Physics department, we successfully installed Rosetta and all required programs.
Designing a binding site
To redesign the Tar chemoreceptor we followed the protocol presented in "Rosetta and the
Design of Ligand Binding Sites", (1). The purpose of the protocol is to design a binding
site around a selected small molecule ligand. The general steps of the protocol can be
seen in the flowchart to the right.
Using this protocol we managed to generate a library of mutated Tar receptors that theoretically
bind a substance in a novel way and activate the chemotaxis pathway in response to it. For each
design we ran 3-5 iterations of the protocol to assure optimal results.
Filtering Process
The output of the protocol is a library of variants, ranging from dozens to even thousands
of protein PDB files, depending on the parameters of the design run. This fact means that
filtering the results is an extremely crucial part of the process.
Rosetta is able to predict which protein designs are likely to have improved protein
activity, this is done by measuring every aspect of the protein complex such as binding
energies, interactions between amino acids, backbone angles, hydrogen bonds and more. After
the calculation process the user can decide which parameters are relevant and drop the
results which scored the lowest on these. The specific filters we used in our designs
can be seen in this attachment.
First run - Benchmark Test
Before running ahead into complex designs, we had to make sure that Rosetta can "handle"
the Tar protein, meaning it does not design unnecessary or drastic changes. To do this
we ran the protocol with aspartic acid – which is the receptor's native ligand, to see if Rosetta
outputs the native Tar LBD or a result as similar to it as possible.
From this design process we received four output structures (after filtering) with 3-5
mutations each, all of which in the binding pocket. These results proved that Rosetta
can recognize and work with the Tar LBD.
Protocol automation
For the purpose of our work, we automated the different steps of the protocol,
including the filtering process, turning it into a single main script file complete
with well-documented instructions. This script also enables easy modification
of the filtering parameters to suit the specific ligand being used in the design. For more information
see our software tool.
Redesigning for new ligands
Using the Rosetta software's designs, we attempted to construct several new ligand binding site for different materials:
-Histamine
-Lactose and glucose
-Rohypnol
-Ampicillin
Histamine
As a proof of concept we redesigned the Tar LBD to bind Histamine. This ligand is a
derivative of Histidine, which is also an amino acid as the native Tar ligand, aspartic
acid. This increases the chances of a successful result. Beside the molecular considerations,
Histamine is known to be found in food poison, especially in rotten fish.
The next figure and video demonstrate the library we got after running few cycles and filtration:
Fig. 6: 3D imaging of the 11 variants in the library with the native Tar (wild type), each color represent different variant. As expected, the mutations can be seen near the binding pocket.
Analyzing the results illustrating two main regions of mutations, one around amino acid number 34 in the LBD sequence and the second around the 115th amino acid. Those results led us to design and perform a two-step cloning assay (link to Histamine cloning assay), in each step we insert the mutations with single PCR reaction.
Lactose and Glucose
These days many people are allergic to milk products because of their sensitivity or intolerance
to Lactose. We want to offer them a detection solution based on our system and a chip, therefore
we redesign the Tar LBD to bind Lactose.
The next figures demonstrate the library we got for Lactose after running few cycles and filtration:
Fig. 8: 3D imaging of the 7 variants in the library with the native Tar (wild type), each color represent different variant.
As this is a novel design of ligand binding domain to bind sugar molecule, we decide to
have proof of concept with smaller and easier sugar, one of the Lactose component- Glucose.
Glucose is well known monosaccharide and it is the main compound used in the production of
energy in living organisms. For this reason we can find existing chemoreceptors for Glucose
(2), but redesigning Tar LBD to bind Glucose was perform as one step before redesigning
Tar LBD to bind Lactose.
The next figure and video demonstrate the library we got for Glucose after running few cycles
and filtration:
Rohypnol
Rohypnol, also known as Flunitrazepam, used in some countries to treat insomnia. But
Rohypnol known better as 'date rape drug', meaning it used for drugging other person,
incapacitate him and make him more vulnerable to sexual assault such as rape.
As one application of our project, we like to offer a simple rape drug test based on
our system to help men and women defend themselves when going out. Redesign the Tar
LBD to bind Rohypnol take as one step closer to achieve this goal.
The next figures demonstrate the library we got for Rohypnol after running few cycles
and filtration:
Fig. 13: 3D imaging of the 4 variants in the library with the native Tar (wild type), each color represent different variant
Ampicillin
Target bacteria toward antibiotics may seem redundant because than the bacteria will die,
but it can be also used as an effective kill switch- small amount of antibiotics can kill
more bacteria if those attract to it. To expand our novel redesigning method we redesigned
the Tar LBD to bind Ampicillin antibiotics.
The next figures demonstrate the library we got for Ampicillin after running few cycles and filtration:
Histamine chemotaxis results
The design run produced 870 results, after filtering we remained with 11 possible variants which we decided to clone and test with microscope (link to chemotaxis microscope assay). Only one variant was discovered as attractant for histamine.
Rosetta Guide for the iGEM beginner:
During our work with Rosetta we stumbled into quite a few challenges that required us to browse
through the official documentation and the Rosetta support forums and also consult with experts
in computational design. These problems made us realize how difficult Rosetta can be to completely
new users, especially undergraduates lacking the necessary knowledge.
To make Rosetta more accessible to the iGEM community we decided to team up with iGEM TU Eindhoven
and compile a quick start guide complete with important links, protocols and information we gathered
from our experience with Rosetta.
We hope that this guide will help future iGEM teams and novice Rosetta users in general.
Click here to see the full guide.
References:
1. Moretti, R., Bender, B.J., Allison, B. and Meiler, J., 2016. Rosetta and the Design
of Ligand Binding Sites. Computational Design of Ligand Binding Proteins, pp.47-62.
2. Adler, Julius, Gerald L. Hazelbauer, and M. M. Dahl. "Chemotaxis toward sugars in
Escherichia coli." Journal of bacteriology 115.3 (1973): 824-847.