Which statement is correct? 单项选择题
A
Stochastic Gradient Descent (SGD) computes the gradients using the whole training set to update the model parameters once.
B
Batch Gradient Descent (BGD) computes the gradients using one data point to update the models parameters once.
C
Mini-batch Gradient Descent has the most bouncing behavior compared to SGD and BGD.
D
10 training epochs mean each data point has the opportunity to update the model parameters 10 times.
登录即可查看完整答案
我们收录了全球超50000道真实原题与详细解析,现在登录,立即获得答案。
类似问题
Which if the following answers are correct regarding "Gredient descent"?Select all the correct answers.
Question at position 32 What are the steps for using a gradient descent algorithm in neural networks? Calculate error between the actual value and the predicted value Reiterate until you find the best weights of the network Pass an input through the network and get values from the output layer Initialize random weight and bias Go to each neuron that contributes to the error and change its respective values to reduce the error 5, 4, 3, 2, 14, 3, 1, 5, 23, 2, 1, 5, 41, 2, 3, 4, 5
Why does the gradient descent algorithm not work for training a linear classifier?
Which parameter determines the size of the improvement step to take on each iteration of Gradient Descent?
更多留学生实用工具
希望你的学习变得更简单
加入我们,立即解锁 海量真题 与 独家解析,让复习快人一步!