Question
Requirements You are to create a program in Python that performs the following: 1. Loads the cps.csv file (assume it's in the current directory) and
Requirements You are to create a program in Python that performs the following:
1. Loads the cps.csv file (assume it's in the current directory) and create a DataFrame object fromit.
2. Based on the data contained in the cps.csv file, generates a dataframe with the following information:
a. School_ID
b. Short_Name
c. Is_High_School
d. Zip e. Student_Count_Total
f. College_Enrollment_Rate_School
g. Lowest Grade Offered (derived from Grades_Offered_All column)
h. Highest Grade Offered (derived from Grades_Offered_All column)
i. Starting Hour (derived from School_Hours column) The values for a-f are based on existing columns in the data.
For g-i, you will need to generate new columns which derives information from existing ones. Replace the missing numeric values with the mean for that column. Display the first 10 rows of this dataframe.
3. Displays the following information: a. Mean and standard deviation of College Enrollment Rate for High Schools b. Mean and standard deviation of Student_Count_Total for non-High Schools c. Distribution of starting hours for all schools d. Number of schools outside of the Loop Neighborhood (i.e., outside of zip codes 60601,60602, 60603, 60604, 60605, 60606, 60607, and 60616)
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started