Which statement is correct?  单项选择题

A

Stochastic Gradient Descent (SGD) computes the gradients using the whole training set to update the model parameters once.

B

Batch Gradient Descent (BGD) computes the gradients using one data point to update the models parameters once.

C

Mini-batch Gradient Descent has the most bouncing behavior compared to SGD and BGD.

D

10 training epochs mean each data point has the opportunity to update the model parameters 10 times.

登录即可查看完整答案

我们收录了全球超50000道真实原题与详细解析,现在登录,立即获得答案。

类似问题

更多留学生实用工具

加入我们,立即解锁 海量真题独家解析,让复习快人一步!