Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Jan 19, 2024

The goal of this problem is to implement the Stochastic Gradient Descent algorithm to build a Latent Factor Recommendation system. We can use it

3. Set k = 20, = 1, and the number of iterations to 40. Find a reasonable value for the learning rate,

The goal of this problem is to implement the Stochastic Gradient Descent algorithm to build a Latent Factor Recommendation system. We can use it to recommend movie to users. Suppose that we have a matrix R of ratings where the element Ri,u is the rating given to item i by user u. The size of R is m x n, where m is the number of movies and n is the number of users. Note that most elements of the matrix are unknown/empty, since each user can only rate/view a small proportion of all of the movies. Our goal is to find two matrices P and Q so that R~ QPT where Q is mx k and P is n x k, where k will be parameter of our algorithm. The error metric we will use is: E = (ER (Riu - qip!)3) + 1 Pall3 + PT ) + x inu |la|13) Where i~ j means that we only sum over entries where the user actually rated that item, q, is the ith row of Q,corresponding to an item, and pu is the uth row of P, corresponding to a user, so rhese are both vectors of size k. The regularization parameter is A and || - || is the sum of the squares of the vector entries. Complete the following steps: 1. If &i,u denotes the derivative of E with respect to Ri,u then Ei,u = 2(Ri,uqi pu) and the update equations for qi and pu in stochastic gradient descent are: Q1 = qi +n(i,upu - 2Xqi) Pu = Pu + n(Ei,uli - 2pu) 2. Implement the algorithm using the updates described in the previous part. Read each entry of R from disk and update &i,u, qi, and pu for each entry. 3. Set k = 20, A = , and the number of iterations to 40. Find a reasonable value for the learning rate, starting with n=. The error on the training set should be below 70,000 after 40 iterations and qi and p; should have converged. 3. Set k = 20, = 1, and the number of iterations to 40. Find a reasonable value for the learning rate, starting with n=1 The error on the training set should be below 70,000 after 40 iterations and q and p; should have converged. that is, the entries in R that are known 2This means that you should not store R in memory. Instead, you should read each element sequentially and apply the update equations to each element at each iteration. Thus, each iteration will read the whole file. If n is too large, the error value can converge to something too large or may not monotonically decrease (it can fail to converge) If n is too small, the error function doesn't have time to decrease within 40 steps. 4. Use the dataset ratings.train.txt included with the assignment, which is formatted as a matrix R as described above. Plot the value of E as a function of the number of iterations for your value of n. Hints: You might try to initialize P and Q to random values in [0,] so that qi p = [0,5]. V . In the update step q; and pu depend on each other. Compute the new values for each depending on all of the old values and then update both vectors at once. . E should be computed at the end of the full iteration, not elementwise while the matrices are being updated.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

This is a machine learning problem using the Stochastic Gradient Descent SGD algorithm to factorize a ratings matrix R using latent factor models for a recommendation system The task is to find two ma... blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Numerical Methods With Chemical Engineering Applications

Authors: Kevin D. Dorfman, Prodromos Daoutidis

1st Edition

1107135117, 978-1107135116

More Books

Students also viewed these Programming questions

Question

Given a selling price of P80 per unit, contribution margin ratio of 30% and a fixed costs of P240,000, WHAT IS THE TOTAL VARIABLE COSTS AT THE BREAK-EVEN POINT?

Answered: 1 week ago

Question

★★★★★

Problem 1 0 - 1 4 ( Algo ) Basic Variance Analysis [ LO 1 0 - 1 , LO 1 0 - 2 , LO 1 0 - 3 ] Becton Labs, Incorporated, produces various chemical compounds for industrial use. One compound, called...

Answered: 1 week ago

Question

★★★★★

7. In reference to selection of activities (sequencing and progression) and overall safety guidelines, what would you recommend? 8. What are the differences between constructive and actual notice? 9....

Answered: 1 week ago

Question

★★★★★

Do laws provide a complete guide to ethical behavior? Can an activity be legal but not ethical?

Answered: 1 week ago

Question

★★★★★

A psychologist studying the effects of nutrition on the behavior of laboratory rats is feeding one group a combination of three foods: I, II, and III. Each of these foods contains three additives, A,...

Answered: 1 week ago

Question

★★★★★

A completely randomized design was used to compare the means of six treatments based on samples of four observations per treatment. The pooled estimator for 2 s is 2 s 59.12. Use this information...

Answered: 1 week ago

Question

★★★★★

How do feature engineering and data wrangling work together? Why is this relationship so important?

Answered: 1 week ago

Question

★★★★★

Tracy, Inc., estimates that next years results will be: Sales revenue (75,000 units) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .$ 900,000 Less variable costs ....

Answered: 1 week ago

Question

★★★★★

Venezuela Co. is building a new hockey arena at a cost of $8,000,000 . It received a downpayment of $2,000,000 from local businesses to support the project, and now needs to borrow $6,000,000 to...

Answered: 1 week ago

Question

★★★★★

In 1879, A.A. Michelson measured the velocity of light in air using a modification of a method proposed by the French physicist Foucault. Twenty of these measurements are in table 6E.27 (the value...

Answered: 1 week ago

Question

★★★★★

eBook Periodic Inventory by Three Methods The beginning inventory at Midnight Supplies and data on purchases and sales for a three-month period ending March 31, are as follows: Number Date...

Answered: 1 week ago

Question

★★★★★

Mid-Term Exam 10/18/22 19. In a reciprocating engine powered aircraft, would you fly faster, slower, or the same speed for maximum endurance versus maximum range? Why? UPPER SURFACE 20. (a) What is...

Answered: 1 week ago

Question

★★★★★

2. In many ransomware attacks, the perpetrators ask for far less money than the victims end up paying to recover from the attack. Would it not save money and time for victims to pay the ransom? Why...

Answered: 1 week ago

Question

★★★★★

Consider the network topology shown below. The topology consists of multiple routers interconnected by full-duplex links. Each link has a static cost associated with it, which represents the cost of...

Answered: 1 week ago

Question

★★★★★

Discussion Case: The Arrival of Autonomous CarsBright Future or Looming Threat? As Elaine Herzberg walked her bicycle across a six-lane road in Tempe, Arizona, around 10 o'clock at night, she was...

Answered: 1 week ago

Question

★★★★★

Using the file Corporate Bonds.xlsx, run the K Means Clustering tool under the Clustering menu (i.e. not the Predictive Modeling version), selecting Years and Yield as the Y, Columns. In the Control...

Answered: 1 week ago

Question

★★★★★

A difference in quantitative indicators and qualitative indicators is a. Both of these answers are correct b. Quantitative indicators are subjective and quantitative indicators are objective c....

Answered: 1 week ago

Question

★★★★★

Burberrys competitive advantage is through its differentiation strategy. What risk should Burberry remain aware of?

Answered: 1 week ago

Question

★★★★★

We would like to use centered finite differences and the method of lines to solve the unsteady diffusionreaction problem subject to no-flux on the left boundaries, c/x = 0 at x = 0, a constant...

Answered: 1 week ago

Question

★★★★★

Use the 1-norm to determine the condition number of A: || 1 2 1 1 12 402 1 22

Answered: 1 week ago

Question

★★★★★

Develop a MATLAB code to solve for an arbitrary number of nodes. dy dx = -2y(1-2y), y(-1)=-1, y(1) = 1 (6.3.15)

Answered: 1 week ago

Question

★★★★★

Briefly define or explain each of these tools: a. Brainstorming. b. Benchmarking. c. Run charts.

Answered: 1 week ago

Question

★★★★★

A company that handles hazardous waste wants to minimize the shipping cost for shipments to a disposal center from five receiving stations it operates. Given the locations of the receiving stations...

Answered: 1 week ago

Question

★★★★★

An analysis of sites for a distribution center has led to two possible sites (L1 and L2 on the map). The sites are comparable on every key factor. The one remaining factor is the center of gravity....

Answered: 1 week ago

Previous Question Next Question