Answered step by step
Verified Expert Solution
Question
1 Approved Answer
You are to import the MovieLens dataset into a PostgreSQL database. You will need to download the file and unzip the file. You will need
You are to import the MovieLens dataset into a PostgreSQL database.
You will need to download the file and unzip the file. You will need to import the movies.csv file into PostgreSQL. Also you need to import the ratings.csv file into the database. You will need to create a relationship between the movies.movieid and ratings.movieid.
Make sure you import all the data and not just some of the data. Run a select count from ratings or select count from movies to make sure all your data is imported.
The movie data has foreign characters so you must ensure you movie titles are imported correctly. Also some movie titles have a in the name. Make sure these movies are also being imported correctly.
The ratings.rating is a scale from to in increments of with being the best.
The rating.userID and timestamp are not important for this project usually.
You are then to find the best movie of all time. Report how long the query took to run. It is up to you to decide what is considered the best movie of all time. You can first start looking for movies with perfect But, some of the movies do not have enough ratings. So you want to lower your perfect but have some level of minimum number of ratings.
Next you are to convert the MovieLens relational database into a Star Schema. Then run the same query to find the best movie of all time. Also report how long it takes to run.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started