In reinforcement learning, the agent's policy is predetermined and remains unchanged throughout the training process, regardless of the rewards received from the environment for its actions, the state of the environment, or the outcomes of its actions.判断题

A

True

B

False

登录即可查看完整答案

我们收录了全球超50000道真实原题与详细解析,现在登录,立即获得答案。

更多留学生实用工具

加入我们,立即解锁 海量真题独家解析,让复习快人一步!