supported in part by the National Science and Technology Major Project(2021ZD0112302);the National Natural Science Foundation of China(62222301,61890930-5,62021003)。
This article develops a novel data-driven safe Q-learning method to design the safe optimal controller which can guarantee constrained states of nonlinear systems always stay in the safe region while providing an opti...