Question
Dataset: credit.csv. The description of the variables is in an excel file named descriptionCreditScoringData.xls. This data set consists of genuine credit records from a South
Dataset: credit.csv. The description of the variables is in an excel file named descriptionCreditScoringData.xls. This data set consists of genuine credit records from a South German bank. The aim would generally be to predict which customers will repay the loan in full and which of them will not. There are 1000 records, and all amounts are in Deutschmarks. Answer the following using suitable approaches whether descriptive/graphical or inferential and using a suitable package e.g. StatTools. Justify your answers in the main text and include all workings as appendix.
a) Wherever possible and meaningful, provide a brief analysis of each variable, including their distribution, outliers, etc.
b) Does there seem to be differences in age, length of loan, or amount of loan for those who repaid their loans and those who defaulted?
c) Explore and describe the association of each variable with the credit status.
d) Does the Length of the loan vary with the use of the loan?
e) Determine relationships, if any, between Age, Length of loan and Amount of loan.
The Dataset can be found at the following :
https://online.stat.psu.edu/stat857/node/215/
Data Files - > German_credit.csv
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started