Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

CSS 300 Module 3 Activity Worksheet Use this worksheet to complete your lab activity. Submit it to the applicable assignment submission folder when complete. Please

CSS 300 Module 3 Activity Worksheet Use this worksheet to complete your lab activity. Submit it to the applicable assignment submission folder when complete. Please ensure that your answers are easy to distinguish from the questions. You may also submit your work in a fresh Word document if you prefer. Deliverable: -A word document answering the following questions (or you may work in this worksheet; please ensure your answers/responses are easy to distinguish from the questions/prompts). Download the avocado.csv dataset. You can find out more about this dataset Kaggles Avocado Prices page. 1. For each numeric field, what is the count and five number summary (median, 1st - 3rd quartiles, minimum and maximum)? What is the standard deviation and mean? Use the following code sample below: df.describe() 2. Create a histogram for each numeric field. Based on the histograms, are there any signs of skewness for any of the fields? Are there any outliers? Do any columns have missing values? Use the following sample code: import matplotlib.pyplot as plt import seaborn as sns %matplotlib inline sns.set(style="whitegrid") plt.figure(figsize=(10,8)) filter_data = df.dropna(subset=['AveragePrice']) plt.figure(figsize=(14,8)) sns.distplot(filter_data['AveragePrice'], kde=False) 3. For the categorical fields, find out how many observations fall within each type by creating a bar chart using the following sample code: sns.set(style='darkgrid')

plt.figure(figsize=(20,10)) ax = sns.countplot(y='region', data=df) 4. Create a heat map to check relationships. Are any of the numeric fields correlated? corr = df.corr(method='pearson') import seaborn as sns sns.heatmap(corr)

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Probabilistic Databases

Authors: Dan Suciu, Dan Olteanu, Christopher Re, Christoph Koch

1st Edition

3031007514, 978-3031007514

More Books

Students also viewed these Databases questions

Question

=+ What is the role of government in bargaining?

Answered: 1 week ago

Question

=+ Who is the negotiation partner at company level?

Answered: 1 week ago

Question

=+Which associations exist?

Answered: 1 week ago