What is the key idea behind Monte Carlo Tree Search (MCTS)单项选择题
A
It estimates action values by sampling simulated rollouts guided by statistics of past returns
B
It performs exhaustive search over all possible action sequences
C
It uses policy gradients to directly optimize the expected return
D
It backpropagates exact rewards from terminal nodes only
登录即可查看完整答案
我们收录了全球超50000道真实原题与详细解析,现在登录,立即获得答案。
类似问题
Select all of the following statements that are true about Monte Carlo Tree Search (MCTS)
蒙特卡洛树搜索(MCTS)的核心思想是什么?
Select all of the following statements that are true about Monte Carlo Tree Search (MCTS)
In a single iteration of the Monte-Carlo Tree Search (MCTS) algorithm, what is the primary purpose of the "Simulate" step (also known as rollout)?
更多留学生实用工具
希望你的学习变得更简单
加入我们,立即解锁 海量真题 与 独家解析,让复习快人一步!