Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Pandas Assignment Explore the UCI Repository and: a.) Load 3 datasets and their appropriate column names. Do not use any of the examples as one
Pandas Assignment
Explore the UCI Repository and:
a.) Load 3 datasets and their appropriate column names. Do not use any of the examples as one of your datasets. Choose datasets that interest you.
b.) Choose one dataset and show what each column means.
- Submit your dataset work in a Jupyter Notebook.
Choose one of the data sets you loaded with pandas in "Assignment I: Pandas" and calculate the following for all relevant columns.
- Look at the DataFrame shape
- head
- tail
- describe all numerical features
- describe all categorical features
- mean for all numerical columns
- median for all numerical columns
- mode for all numerical columns
- standard deviation for all numerical columns
- sum for all numerical columns
- count for all columns
- How many missing values are in each column
- value_counts for all categorical columns
- Subset your DataFrame based on one condition
- Subset your DataFrame based on two different conditions
- Subset your DataFrame based on three different conditions
- Use loc to slice your DataFrame for the 2 columns that returned the largest sum
- Use iloc to slice your DataFrame for the 2 columns that returned the largest mean
- Find 2 pandas DataFrame methods we did not cover and use them on your own data. Explain what they do by writing a description in markdown. Docs Here
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started