Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Using a dataset representing applications for a bank loan, the task will be to build a logistic regression model that can predict whether or not

Using a dataset representing applications for a bank loan, the task will be to build a logistic regression model that can predict whether or
not a loan will be approved.
Useful R and Python functions for this assignment are:
For this activity, perform the following:
Load the "application_record.csv," located in the topic Resources, into a data frame and perform initial exploratory tasks:
Display representative portions of the data.
Check for missing values and clean the data.
Check for outliers and decide if and how to process them.
Formally state what your model will predict using the variables in the data.
Split the data into a training set and a testing set with a split ratio of 70:30.
Build the Predictive Model:
Define the formula for the gim().
Run the model.
Interpret the results, referring to the p-values.
Evaluate the Model Performance:
Compare the predicted versus actual values.
Search for any predictions that differ significantly from the actual values.
Validate the Model:
Produce a Gain and Lift chart and use it to describe the performance of the model.
Measure the Variation Inflation Factor (VIF) to test for multicollinearity. If changes are necessary to the model based in VIF, state and
implement them.
Has the formula, as defined in the previous section, changed? Why or why not?
If changes to the model occurred, repeat the validation steps on the new model.
Make Predictions:
Demonstrate a few examples of predictions your model can make.
Validate the predictions by calculating the misclassification error.
Interpret the results.
State a few suggestions for improving the model.
RangeIndex: 438557 entries, 0 to 438556
Data columns (total 18 columns):
# Column Non-Null Count Dtype
----------------------------
0 ID 438557 non-null int64
1 CODE_GENDER 438557 non-null object
2 FLAG_OWN_CAR 438557 non-null object
3 FLAG_OWN_REALTY 438557 non-null object
4 CNT_CHILDREN 438557 non-null int64
5 AMT_INCOME_TOTAL 438557 non-null float64
6 NAME_INCOME_TYPE 438557 non-null object
7 NAME_EDUCATION_TYPE 438557 non-null object
8 NAME_FAMILY_STATUS 438557 non-null object
9 NAME_HOUSING_TYPE 438557 non-null object
10 DAYS_BIRTH 438557 non-null int64
11 DAYS_EMPLOYED 438557 non-null int64
12 FLAG_MOBIL 438557 non-null int64
13 FLAG_WORK_PHONE 438557 non-null int64
14 FLAG_PHONE 438557 non-null int64
15 FLAG_EMAIL 438557 non-null int64
16 OCCUPATION_TYPE 304354 non-null object
17 CNT_FAM_MEMBERS 438557 non-null float64
image text in transcribed

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Processing

Authors: David Kroenke

11th Edition

0132302675, 9780132302678

More Books

Students also viewed these Databases questions