Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

You are to import the MovieLens dataset into a PostgreSQL database. You will need to download the file and unzip the file. You will need

You are to import the MovieLens dataset into a PostgreSQL database.
You will need to download the file and unzip the file. You will need to import the movies.csv file into PostgreSQL. Also you need to import the ratings.csv file into the database. You will need to create a relationship between the movies.movieid and ratings.movieid.
Make sure you import all the data and not just some of the data. Run a select count(*) from ratings or select count(*) from movies to make sure all your data is imported.
The movie data has foreign characters so you must ensure you movie titles are imported correctly. Also some movie titles have a ' in the name. Make sure these movies are also being imported correctly.
The ratings.rating is a scale from .5 to 5 in increments of .5, with 5 being the best.
The rating.userID and timestamp are not important for this project usually.
You are then to find the best movie of all time. Report how long the query took to run. It is up to you to decide what is considered the best movie of all time. You can first start looking for movies with perfect 5. But, some of the movies do not have enough ratings. So you want to lower your perfect 5 but have some level of minimum number of ratings.
Next you are to convert the MovieLens relational database into a Star Schema. Then run the same query to find the best movie of all time. Also report how long it takes to run.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Concepts of Database Management

Authors: Philip J. Pratt, Joseph J. Adamski

7th edition

978-1111825911, 1111825912, 978-1133684374, 1133684378, 978-111182591

More Books

Students also viewed these Databases questions