Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Python code (normal Python and not pyspark) to answer the following question. Social computing research at the University of Minnesota has released moving rating data

Python code (normal Python and not pyspark) to answer the following question.

"Social computing research at the University of Minnesota" has released moving rating data sets at different sizes at the "gouplens.org" website.

Load MovieLens 10M dataset, which consists of 10 million movie ratings.  You can download the data by going to grouplens.org, and under the "datasets" tab, upload "movieLens 10M dataset" it is 63 MB.

a) Divide the data to 5 almost equal size files and use the five files in the rest of the assignment (2 points) 

b) Sort the data from the highest rating movie to the lowest one. 

Measure how much time sorting takes. (6 points)

  • Don't use the sort function, and write the sort function yourself.
  • Use sort function

c). Create a histogram of the movie ratings. 

Measure how much time it takes to create the histogram. (2 points)

d). Data contains more than 10M ratings of 10681 movies by 71567 users. 

Create a histogram of the number of times each movie got rated. 

Measure how much time it takes to create the histogram. (4 points)

e). Choose the lowest three bins of histogram in part C and create a histogram of movie ratings for these three bins. Do the same thing for the top three bins of the histogram. (6 points)

Step by Step Solution

3.47 Rating (147 Votes )

There are 3 Steps involved in it

Step: 1

To accomplish the tasks you described you can use Pythons standard libraries Heres a stepbystep approach a Divide the data into 5 almost equalsized files python import pandas as pd Load the MovieLens ... blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Income Tax Fundamentals 2013

Authors: Gerald E. Whittenburg, Martha Altus Buller, Steven L Gill

31st Edition

1111972516, 978-1285586618, 1285586611, 978-1285613109, 978-1111972516

More Books

Students also viewed these Programming questions