Question: Show how gradient descent can be used for learning a linear function that minimizes the absolute error. [Hint: Do a case analysis of the error;
Show how gradient descent can be used for learning a linear function that minimizes the absolute error. [Hint: Do a case analysis of the error; for each example the absolute value is either the positive or the negative of the value.
What is appropriate when the value is zero?]
Step by Step Solution
There are 3 Steps involved in it
Get step-by-step solutions from verified subject matter experts
