In this article, we propose a deep reinforcement learning based chemotherapy regulation framework to realize personalized and dynamic optimization of cancer treatment. We use a nonlinear dynamic system to model the dynamic evolution of the tumor microenvironment including tumor cell, normal cell and immune cell interactions, with drug concentration serving as the control input variable. The Deep Deterministic Policy Gradient (DDPG) algorithm makes agents can study optimal dosing strategy in a continuous space of movement to inhibit tumor growth effectively and minimize damage to normal tissues. To make the strategy more stable, Gauss noise is added to the model to simulate physiological oscillations and uncertainties in the treatment reaction. Experimental results show that it can control the growth of tumor in various initial scenarios and the accumulation of the drug concentration with high flexibility and safety. Our technique provides a feasible technical way for precision, low toxicity adaptive chemotherapy.
Building similarity graph...
Analyzing shared references across papers
Loading...
Wenlang Zhu
Ming Zhu
Weiye Wang
PLoS ONE
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhu et al. (Wed,) studied this question.
www.synapsesocial.com/papers/69d8958f6c1944d70ce068b8 — DOI: https://doi.org/10.1371/journal.pone.0345877