Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Tableau: Data Exploration Report ( loans ) Background You are a data scientist at Lending Club and you are trying to improve your accuracy in
Tableau: Data Exploration Report loans
Background
You are a data scientist at Lending Club and you are trying to improve your accuracy in predicting how likely each customer will be to pay back their loan. You need to begin by working through the Data Understanding Phase for this project. Recall that there are four reports typically generated from this phase:
Initial Data Collection Report
Data Description Report
Data Exploration Report
Data Cleaning Report
However, you are only requried to generate Reports and in a single combined Word document for this assignment using the Lending Club data set below. There is a sample report included for download in the files below. However, to keep this assessment to a realistic scope, you can ignore the hypotheses and referencescitations used in the example report. You only need to include the analyses themselves. See the details below.
Task Description:
Use Word and any combination of Tableau and Excel you would like to complete the tasks outlined in the questions below, which will walk you through creating the Data Description Report and Data Exploration Report.
Data Source:
Use the lclarge.csv file available to download below.
Drag the table: lcLoans into the entity view in Tableau. Select the "Extract" option for the connection in Tableau.
Data Dictionary:
Features about the loan
loanstatus: current status of the loan
loanstatusnumeric: a rankordered numeric version of loanstatus
loanamount: the listed amount of the loan applied for by the borrower
issued: the date the loan was fundedissued
term: the number of payments on the loan
intrate: the interest rate on the loan
installment: the monthly payment owed by the borrower
totalpymnt: payments received to date for total amount funded
totalrecprncp: payments received to date for total amount funded
totalrecint: interest received to date
totalreclatefee: late fees received to date
recoveries: post charge off gross recovery ie if the loan was charged off, how much money was recovered afterward, if any
title: the loan title provided by the borrower
purpose: a category provided by the borrower for the loan request
Features obtained from the borrower before the loan was issued
emptitle: the job title supplied by the borrower
emplength: employment length in years
homeownership: the homeownership status provided by the borrower
annualincome: the selfreported annual income provided by the borrower
verificationstatus: was income verified by LC the source, or not verified
Features obtained from the credit bureau about the borrower before issued
accnowdelinq: the number of accounts on which the borrower is now delinquent
delinqyrs: the number of days pastdue incidences of delinquency in the borrower's credit file for the past years
earliestcrline: the month the borrower's earliest reported credit line was opened
inqlastmths: the number of unsecured inquiries in the past months
mthssincelastdelinq: the number of months since the borrower's last delinquency
mthssincelastrecord: the number of months since the last public record
openacc: the number of open credit lines in the borrower's credit file
pubrec: number of derogatory public records
revolbal: total credit revolving balance
revolutil: the amount of credit the borrower is using relative to all available revolving credit
totcollamt: total collection amounts ever owed
totcurbal: total current balance of all accounts
totalacc: the total number of credit lines currently in the borrower's credit file
totalrevhilim: total credit limit on revolving accounts
Features engineered by LC based on the credit bureau data
dti: a ratio calculated using the borrower's total monthly payments on the total debt obligations, excluding mortgages and the requested LC loan, divided by the borrower's combined selfreported monthly income
grade: the likelihood that the loan will be paid back
subgrade: a more granular version of grade
To limit the scope of this assignment, you will not need to include every feature above in your report. Instead, only complete the tasks required in the questions below. Additional details:
Use loanstatusnumeric as the label for this project.
It represents the outcomes we are interested in: charged off default late late grace period current fully paid
You do NOT need to write hypotheses H H etc or Summary descriptions for your report as demonstrated in the example project documentation included. You can keep this simple by only generating the visualizations, metrics, and statistics required by the questions below.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started