What is the key idea behind Monte Carlo Tree Search (MCTS)单项选择题

A

It estimates action values by sampling simulated rollouts guided by statistics of past returns

B

It performs exhaustive search over all possible action sequences

C

It uses policy gradients to directly optimize the expected return

D

It backpropagates exact rewards from terminal nodes only

登录即可查看完整答案

我们收录了全球超50000道真实原题与详细解析,现在登录,立即获得答案。

类似问题

更多留学生实用工具

加入我们,立即解锁 海量真题独家解析,让复习快人一步!