Fig. 2
From: Incorporating causal factors into reinforcement learning for dynamic treatment regimes in HIV

The evolution of the six types of cells for the first patient (i.e., before learning). a-f corresponds to the continuous change of \(\mathrm {T}_{1}, \mathrm {T}_{2}, \mathrm {T}_{1}^{*}, \mathrm {T}_{2}^{*}\), E and V cells, respectively