Question
The Lending Club is a lending company based on San Franscisco, CA. They connect borrowers with investors through an online marketplace. They have provided a
The Lending Club is a lending company based on San Franscisco, CA. They connect borrowers with investors through an online marketplace. They have provided a publicly available data from 2007-2011. I have it under Project files and it is available on here as well: https://www.lendingclub.com/info/download-data.action
(Links to an external site.)
Links to an external site.
Please see the data with its data dictionary (This is in another separate excel sheet which explains what all the variables mean).
You just got an interview as an analyst for the Lending Club. The client wants you to analyze this big amount of information.
- Start by making initial observations of the data. What types of variables are present? Is there anything that catches your eye? A good analyst checks the data carefully. See the Quartz Guide to Bad Data: https://github.com/Quartz/bad-data-guide
- (Links to an external site.)
- Links to an external site.
- Use at least two ways to summarize the qualitative data present in the data set with frequency distributions and the various graphs/charts we have used in the class for Chapter 2.
- Do the same thing with the quantitative data present. These four ways should be different aspects from the data set.Interpret your results.
- Pick two of the above graphs you chose and describe the shape of those distributions.
- Why did you use the certain graphs you did? Are there any benefits over the other?
- Now I want you to take two variables you think might be related.Create a scatterplot. Find the covariance, correlation and interpret the results.
- For the 2 examples you chose on Step 4, give me the best central tendency measure you feel is right for the data sets. Then find their sample variances.
- Create a box plot for me for one of the examples.
- Depending on the distribution you get for Step 8, let me know where the limits of the observations lie within 2 standard deviations of the mean. What does this mean in relation to the variable?
Finally give me a summary of what you have discovered as a whole from this data set. You want the Lending Club to know that you are very interested in working with them. Give them something to think about.
Submit on Canvas. Send me whatever work you have done with Excel or any other tool you wish to use all in 1 document. Send me formulas/code used. DO NOT Handwrite the calculations.We will discuss tools in class.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started