Question
A credit score is a number, based on the analysis of a persons credit files, to represent the creditworthiness of the person. Lenders, such as
A credit score is a number, based on the analysis of a persons credit files, to represent the creditworthiness of the person. Lenders, such as banks and credit card companies, use credit scores to evaluate the potential risk posed by lending money to consumers. Credit scoring is not limited to banks. Other organizations, such as mobile phone companies, insurance companies, landlords, and government, employ the same techniques. Credit scoring also has much overlap with data mining.
A consumer services agency is interested in providing a service in which an individual can estimate their own credit score. The Excel file CreditScoreData.xlsx contains data on an individuals credit score and other variables. The description of these nine (9) variables can be found in the worksheet Description.
Make a Standard Partition of the data into Training, Validation, and Test sets. Select all the 9 variables to be in the partition, use 12345 as the seed in the randomized sampling, and specify 50% of observations in the training set, 30% in the validation set, and 20% in the test set.
Predict the individuals credit scores using k-Nearest Neighbors with up to k = 20. Use CreditScore as the output variable and all the other variables as input variables. In Step 2 of XLMiners k-Nearest Neighbors Prediction procedure, be sure to Normalize input data and to Score on best k between 1 and specified value. Select Summary Report for Score Training Data, and Score Validation Data. Select Detailed Report, Summary Report, and Lift Charts for Score Test Data.
Based on the results from XLMiner, answer the following questions.
What is the best k chosen? What does it mean?
Compare the RMSE on the test set to the RMSE on the validation set. Please comment.
What is the average error on the test set? What does it suggest (e.g., an underestimate or overestimate in the prediction)?
Using the best k, predict the CreditScore for two individuals with the information given in the table below (For your convenience, this table is stored in the worksheet NewData).
BureauInquiries | CreditUsage | TotalCredit | CollectionReports | MissedPayments | HomeOwner | CreditAge | TimeOnJob |
3 | 0.4 | 15,000 | 1 | 2 | 0 | 6 | 3 |
5 | 0.3 | 27,000 | 0 | 1 | 1 | 8 | 9 |
Report the predicted values (rounded to the nearest integers) of CreditScore for the two individuals.
HERE IS CREDIT.XLSX LINK
https://ufile.io/y7hdz
Copy and past this link to your browser, where it says Download Slow (free) wait 5 second and click download !!
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started