Markov Decision Process (MDP) is a () process, means action outcomes depend only on the current state.马可夫决策过程(MDP)是一种()过程,意味着动作结


Markov Decision Process (MDP) is a () process, means action outcomes depend only on the current state.马可夫决策过程(MDP)是一种()过程,意味着动作结果仅仅依赖于当前状态。

正确答案:discrete time stochastic control

离散时间随机控制