Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

The TripAdvisor website has a wealth of informa 8 on on restaurants in many ci 8 es around the world. The goal of this exercise

The TripAdvisor website has a wealth of informa8on on restaurants in many ci8es around the world.
The goal of this exercise is to start with raw data obtained from TripAdvisor and apply data
preprocessing techniques we are learning to obtain a dataset that will be compa8ble with machine
learning algorithms. We will also explore feature selec8on. For this assignment I would like you to
use the Anaconda/Python/Scikitlearn plaForm we discussed in class.
A sample of data from 11,377 restaurants was extracted from the TripAdvisor website into a comma
separated values (.csv) file and is included with this assignment. Because cleaning the data is a lot
of work i recommend you choose a sample of 50 restaurants for your assignment. For each
restaurant the following data is available: restaurant name, star ra8ng, number of reviews, price
range, cuisine type, city and city popula8on. As the data is in raw format it must be edited before it
can be used for machine learning. Your task is to determine what needs to be done to convert the
raw data into usable format and then apply it to the data. Visualize your data by making some
scaUerplots of pairs of features. Test various machine learning techniques such as support vector,
decision tree and k-nearest neighbors. Explore the result of choosing various features. For example,
how accurately can you predict the star ra8ng of a restaurant given all of the other features of the
restaurant? Which features had the most predic8ve value?
The assignment must be documented in a wriUen report. You must discuss how you edited your
data by showing a sample of data before and aYer the preprocessing. You must provide some
graphical visualiza8on of the data to see if there is any clustering and discuss what your inves8ga8on
revealed to you about the data. For example is there any rela8onship between average price per
meal and the star ra8ng? Or is there any rela8onship between the size of the city and the average
price of the meal? Or perhaps between type of restaurant and star ra8ng? The assignment is worth
10 points of your grade and must be uploaded to the blackboard assignment by the last day of class.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Handbook Of Relational Database Design

Authors: Candace C. Fleming, Barbara Von Halle

1st Edition

0201114348, 978-0201114348

More Books

Students also viewed these Databases questions

Question

3. Is IBMs program really a mentoring program? Why or why not?

Answered: 1 week ago