Assume that your hypothesis function for linear regression is of the form f(x) = w0 + w1x and that the current values of w0 and w1 are 1 and 2 respectively. Further assume that you are using a learning rate (alpha) of 0.001 What is the new w0 value associated with the point (1, 12), after one gradient update?数值题
登录即可查看完整答案
我们收录了全球超50000道真实原题与详细解析,现在登录,立即获得答案。
类似问题
Which if the following answers are correct regarding "Gredient descent"?Select all the correct answers.
Question at position 32 What are the steps for using a gradient descent algorithm in neural networks? Calculate error between the actual value and the predicted value Reiterate until you find the best weights of the network Pass an input through the network and get values from the output layer Initialize random weight and bias Go to each neuron that contributes to the error and change its respective values to reduce the error 5, 4, 3, 2, 14, 3, 1, 5, 23, 2, 1, 5, 41, 2, 3, 4, 5
Why does the gradient descent algorithm not work for training a linear classifier?
Which parameter determines the size of the improvement step to take on each iteration of Gradient Descent?
更多留学生实用工具
希望你的学习变得更简单
加入我们,立即解锁 海量真题 与 独家解析,让复习快人一步!