Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Q 1 . [ 5 points ] The optimization problem of training logistic regression for the binary classification problem, i . e . , Y
Q points The optimization problem of training logistic regression for the binary classification problem, ie uses the crossentropy CE loss which is defined as
hat
where hat
One may believe that we could have used mean squared error MSE as the loss function instead of the CE loss. In this case, the optimization problem for the logistic regression can be described as
hat
Here, hat
The goal of this exercise is to compare the CE loss and the MSE loss when training a logistic regression model and to understand why we should use the CE over the MSE.
a points Compute the first order and the second order derivatives of
b points Compute the first order and the second order derivatives of
c point Using the seconder order derivatives, which are computed in a and b explain why we should prefer the crossentropy loss over the MSE loss for training a logistic regression model. Hint Use the second derivative and check the convexity.
d point Given the gradients computed in a derive the GD algorithm for optimization problems in equations
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started