Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Import the csv filed name CommuteStLouis.csv a. Produce a correlation matrix of age, distance and time. Which two numeric variables are most highly correlated? What
Import the csv filed name "CommuteStLouis.csv" a. Produce a correlation matrix of age, distance and time. Which two numeric variables are most highly correlated? What is the correlation coefficient for the above pair? Hint: If df is your dataframe then you can use print(df.corro) Age Distance Time Age 1.000000 -0.000774 0.030292 Distance -0.000774 1.000000 0.830241 Time 0.030292 0.830241 1.000000 b. Create a scatterplot matrix of the numeric variables in the data. What do the figures in the diagonal going from the top left to the bottom right show? What can you say about the skewness of the various attributes? 100 Distance 50 Time c. Produce a side-by-side boxplot of distance travelled by gender. Do the dat in the file indicate that women tend to commute shorter distances? c. Produce a side-by-side boxplot of distance travelled by gender. Do the data in the file indicate that women tend to commute shorter distances? 70 60 10
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started