Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

A credit score is a number, based on the analysis of a persons credit files, to represent the creditworthiness of the person. Lenders, such as

A credit score is a number, based on the analysis of a persons credit files, to represent the creditworthiness of the person. Lenders, such as banks and credit card companies, use credit scores to evaluate the potential risk posed by lending money to consumers. Credit scoring is not limited to banks. Other organizations, such as mobile phone companies, insurance companies, landlords, and government, employ the same techniques. Credit scoring also has much overlap with data mining.

A consumer services agency is interested in providing a service in which an individual can estimate their own credit score. The Excel file CreditScoreData.xlsx contains data on an individuals credit score and other variables. The description of these nine (9) variables can be found in the worksheet Description.

Make a Standard Partition of the data into Training, Validation, and Test sets. Select all the 9 variables to be in the partition, use 12345 as the seed in the randomized sampling, and specify 50% of observations in the training set, 30% in the validation set, and 20% in the test set.

Predict the individuals credit scores using k-Nearest Neighbors with up to k = 20. Use CreditScore as the output variable and all the other variables as input variables. In Step 2 of XLMiners k-Nearest Neighbors Prediction procedure, be sure to Normalize input data and to Score on best k between 1 and specified value. Select Summary Report for Score Training Data, and Score Validation Data. Select Detailed Report, Summary Report, and Lift Charts for Score Test Data.

Based on the results from XLMiner, answer the following questions.

What is the best k chosen? What does it mean?

Compare the RMSE on the test set to the RMSE on the validation set. Please comment.

What is the average error on the test set? What does it suggest (e.g., an underestimate or overestimate in the prediction)?

Using the best k, predict the CreditScore for two individuals with the information given in the table below (For your convenience, this table is stored in the worksheet NewData).

BureauInquiries

CreditUsage

TotalCredit

CollectionReports

MissedPayments

HomeOwner

CreditAge

TimeOnJob

3

0.4

15,000

1

2

0

6

3

5

0.3

27,000

0

1

1

8

9

Report the predicted values (rounded to the nearest integers) of CreditScore for the two individuals.

HERE IS CREDIT.XLSX LINK

https://ufile.io/y7hdz

Copy and past this link to your browser, where it says Download Slow (free) wait 5 second and click download !!

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Students also viewed these Finance questions

Question

What is a biological asset?

Answered: 1 week ago