## On Periodic Reference Tracking Using Batch-Mode Reinforcement Learning with Application to Gene Regulatory Network Control

A synthetic oscillatory network of transcriptional regulators
To illustrate our reference tracking method, we consider its application to synthetic biology gene regulatory networks, specifically to a network called a generalised repressilator. The repressilator is a ring of three mutually repressing genes pioneered in [9], and theoretically generalised to rings consisting of more than three genes in [10]. Repression is an interaction between two genes, such that the protein product of the repressing gene prevents protein expression of the repressed gene. Or simply put, where one gene can turn off the other. A generalised repressilator with a sufficiently large, even number of genes (such as the four-gene ring in Figure 1) can exhibit decaying but very long-lived oscillations [11].

Citation Context ...most 300 samples. These state transitions are then gathered in the set F . The discount factor γ is equal to 0.75, the choice of which is guided by considerations similar to the ones in [8]. The stopping criterion is simply a bound on the number of iterations, which for the purpose of this paper is 30. At every iteration every Qvi function is approximated using EXTRA Trees, which was shown to be an effective regression algorithm for the FQI framework [13]. The parameters for the algorithm are set to the default values from [8]. The algorithm is implemented in Python using the machine learning [19], parallelisation [20], graphics [21] and scientific computation [22] toolboxes. C. Results One sinusoidal reference trajectory, different periods. In this example, we are going to force the concentration of the protein 2 to track a sinusoid with different periods. Here α = 0 and the sinusoid is chosen to resemble the natural oscillations in terms of amplitude and offset from zero: d2t = 8 + 7 · sin(Tt/(2π)). We test the algorithm for the following periods T = 50, 150, 250. We can increase the concentration of the protein 2 directly through application of the control signal u2. We can also dec... |

