1. Modeling Promoter Strength
Because a major goal of the model is to predict the effects of the selection of
promoters on the final production of violacein, we decided to find a way to
characterize promoters first. To simplify the computation, we used the promoter
strength as a single standard to characterize the promoters. Moreover, we assumed
the degradation rate of proteins only depends on the growth rate of E.coli. Then,
every enzyme has the same degradation rate. The bacteriophage T7 promoter has
been widely used for protein expression and purification (Jones et al., 2013),
so we used data of five mutant T7 promoters to create a proof-of-concept model.
If this model was functional, we could implement the same modeling technique to
the promoters we were working with.The five mutant T7 promoters have distinct
promoter strength over time after induction. The experimental data from the
literature are shown in the figure below (Jones et al., 2013).
The first step of our model is to describe the rate of change of enzymes based on promoter strength. Here we assumed that the enzyme production rate is directly proportional to strength of the promoter. Therefore, we were able to use a mass-action kinetics equation of promoters to describe the enzyme concentration. The equation is shown below:
In this equation, Ai is the concentration of enzyme i, ki is the production rate of each enzyme i, kd is the degradation rate of all enzymes, and t is time. By solving this equation, we derived the equation of enzyme concentration against time.
Since we assumed that the promoter strength is proportional to the promoter concentration, we would use the equation to fit our data using least squares method (Fig. 1).
Figure 1. Linear regressions fitted to normalized fluorescence vs time. The circles represent data from Jones et al., 2013. The solid lines are our regression lines. The colors indicate with which promoters the circles and lines correspond.
In general, the regression lines are able to capture the change of strength of each enzyme over time. In this way, the parameters are determined. The table below lists the parameter values.
Table 1. Parameters realted to promoter strength and degradation of molecules. In the table, ki (i = 1,2,3,4,5) are the production rate coefficients of promoter I (i = 1,2,3,4,5), and kd is the degradation rate coefficient of all promoters.
2. Modeling the Steady-state Violacein Yield
After we finished the regression model of each promoter, we created a second model to describe the violacein biosynthetic pathway. The pathway (Fig. 2) involves five enzyme-catalyzed reactions and one non-enzymatic reaction (Lee et al, 2013).
Figure 2. Violacein synthetic pathway. The purple arrows highlight the five enzymatic and one non-enzymatic steps of violacein production from two molecules of tryptophan. The five enzymes are indicated by bolding (VioA, VioB, etc.).
The model was developed as three major parts. A pseudocode of this model is provided here.
Define ODE System
- Calculate the production and degradation rate of each molecule in the pathway from the concentration of reagents and parameters.
- Obtain the rate of change of each molecule based on the production and degradation rates.
Solve the System of Nonlinear Equations at Steady State
- Solve the system of nonlinear equations at steady state starting at an initial guess X0.
- Use the result as a new initial guess; repeat the numerical method to solve the system of equations again.
- Calculate the relative error of each chemical in the new result.
- If the maximum error is smaller than 0.0001%, output violacein concentration at steady state as the final result.
Optimize Parameters to Fit Experimental Data
- Set the initial guess of the parameters.
- Load the data from literature, which include the choice of promoter for each gene and the corresponding violacein yield determined experimentally.
- For each promoter selection scenario, pass the promoter types and the temporary parameters to the steady-state model.
- Obtain the violacein yield predicted by the steady-state model for each promoter selection scenario.
- Compute the residual sum of squares (RSS) of between the predicted violacein yields and the violacein yields given by experiment.
- Determine the optimal parameters by minimizing the RSS (least square method).
Using the principles of mass action kinetics, we derived the system of ODE equations in the model. The equations involves 17 parameters (Table 2). Five parameters (kA, kB, kC, kD and kE) are related to the production rates of the five enzymes, which depend only the strength of the promoter type. Another parameter, kd, is the degradation coefficient of all molecules due to the growth of E.coli. The value of this parameter is fixed and shown in Table1. In addition to these known parameters, the equations include 11 undetermined parameters related to the reaction rates at specific steps in the violacein synthetic pathway. As described in the pseudocode, we used least square regression to determine the optimal values of these parameters.
Each one of the11 differential equations describes the rate of change of specific molecule in the system. The equations consider the production, consumption, and degradation rates of the molecules. Degradation of molecules is described by first order decay. Therefore, the rate of degradation of a molecule depends on a degradation constant and the degradation coefficient. The degradation coefficient is identical for all molecules since it only depends on E.coli growth rate.
Differential Equations in the Model
Our model is able to compute the average violacein yields for all the strains tested experimentally, but can not capture the difference of violacein yield with different promoters strengths. The comparison between the violacein yields determined by experiments and those predicted by our model is shown in Figure 3. The optimal parameters determined by the model are listed in Table 2.
Figure 3. VIolacein yield with different promoter combinations. This graph compares the violacein found for various promoter combinations determined by Jones et al., 2013 (shown in blue) with the violacein concentrations that our model predicted for the same promoter combinations. The root-mean-square error (RMSE) is 52.04.
Table 2. Notations of parameters.