Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Jul 30, 2024

(Machine Learning) In Python3 Don't use preprocessing from sklearn 3.6 Stochastic Gradient Descent When the training data set is very large, evaluating the gradient of

(Machine Learning) In Python3

Don't use preprocessing from sklearn

image text in transcribed

3.6 Stochastic Gradient Descent When the training data set is very large, evaluating the gradient of the objective function can take a long time, since it requires looking at each training example to take a single gradient step. When the objective function takes the form of an average of many values, such as (as it does in the empirical risk), stochastic gradient descent (SGD) can be very effective. In SGD, rather than taking - VJ(0) as our step direction, we take - Vfi(0) for some i chosen uniformly at random from {1,... ,m). The approximation is poor, but we will show it is unbiased. In machine learning applications, each fi(0) would be the loss on the ith example (and of course we'd typically write n instead of m, for the number of training points). In practical implementations for ML, the data points are randomly shuffled, and then we sweep through the whole training set one by one, and perform an update for each training example individually. One pass through the data is called an epoch. Note that each epoch of SGD touches as much data as a single step of batch gradient descent. You can use the same ordering for each epoch, though optionally you could investigate whether reshuffling after each epoch affects the convergence speed 1. Show that the obiective function TTL can be written in the form J(0)- the two expressions equivalent. f(0) by giving an expression for f. (0) that makes 2. Show that the stochastic gradient Vf (0), for i chosen uniformly at random from {1,... ,m), is an unbiased estimator of J(). In other words, show that E[ A(0)| J() for any . (Hint: It will be easier, notationally, to prove this for a general J(0) 12mif(9), rather than the specific case of ridge regression. You can start by writing down an expression for E Vf (0)]...) 3. Write down the update rule for 0 in SGD for the ridge regression objective function. 4. Implement stochastic_grad_descent. (Note: You could potentially generalize the code you wrote for batch gradient to handle minibatches of any size, including 1, but this is not necessary 5. Use SGD to find that minimizes the ridge regression objective for the and B that you selected in the previous problem. (If you could not solve the previous problem, choose -10-2 and B-1). Try a few fixed step sizes (at least try .c {0.05, .005). Note that SGD may not converge with fixed step size. Simply note your results. Next try step sizes that decrease with the step number according to the following schedules: .-c and C

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

OpenStack Trove

OpenStack Trove

Authors: Amrith Kumar, Douglas Shelley

1st Edition

1484212215, 9781484212219

More Books

Students also viewed these Databases questions

Question

★★★★★

The police department in a large city has installed a traffic camera at a busy intersection. Any car that runs a red light will be photographed with its license plate visible, and the driver will...

Answered: 1 week ago

Question

★★★★★

=+This activity can be completed as an individual or as a team. Your instructor will specify which approach (individual or team) you should use.

Answered: 1 week ago

Question

★★★★★

=+c) Assuming that the process was in control during the calibration period, is the companys process for filling its basketball with the proper amount of air in control?

Answered: 1 week ago

Question

★★★★★

Nintendo Company reports the following income statement accounts for the year ended March 31, 2011. (Japanese yen in millions.) Net sales . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ....

Answered: 1 week ago

Question

★★★★★

1-3 are together and 4 is alone. please help i need them all! Use the information below to answer the following questions 1-3. The following lots of a particular commodity were available for sale...

Answered: 1 week ago

Question

★★★★★

When there is no voltage the electrons are still moving randomly. Since the number of electrons moving one direction is balanced by electrons moving the opposite direction then there is no net...

Answered: 1 week ago

Question

★★★★★

Zuma is deciding between a 3 0 year and 1 5 year mortgage. He plans to purchase a $ 4 0 0 , 0 0 0 and has decided to do a 2 0 % downpayment so he doesn't have PMI. He expects property taxes to be $ 3...

Answered: 1 week ago

Question

★★★★★

An ANOVA is run looking at the effect of water, sun, and water*sun on plant growth. Given the partial output below, what conclusion would you draw about the relationship of these variable(s) on plant...

Answered: 1 week ago

Question

★★★★★

Sta. Elena Winery sells 100,000 gallons of wine a year. It sells 10% of its wine in barrels, 60% in jugs, and 30% in 1-liter bottles. Its base selling price is $2 per gallon in barrels. The selling...

Answered: 1 week ago

Question

★★★★★

Review very carefully property 2 in the Definition of norm ( you might also want to listen carefully to the presenter ) , then answer the following question: True / False: \ | \ alpha \ mathbf { v }...

Answered: 1 week ago

Question

★★★★★

Who are Netflix s competitors and why would they be likely to take action against Netflix? Include the role of market commonality and resource similarities. Why does Netflix appear to have a...

Answered: 1 week ago

Question

★★★★★

Is there anything wrong, from an HRM perspective, with the expansion plans?

Answered: 1 week ago

Question

★★★★★

Given that Ravi has no access to immediate funds and cannot give salary increases to other employees, what should he do next? Clearly outline a strategic plan of action that he should follow. Ravi...

Answered: 1 week ago

Question

★★★★★

If you could build your own total rewards program, what would be the components? Explain your answer. One answer is to ask employees, although there might be concern that by asking, employees will...

Answered: 1 week ago

Previous Question Next Question