Answered step by step
Verified Expert Solution
Question
1 Approved Answer
1 . 1 . A . Preliminary data exploration, understanding and minor corrections in Excel file: The starting point for this subtask is the dataset
A Preliminary data exploration, understanding and minor corrections in Excel file:
The starting point for this subtask is the dataset itself, including the data dictionary and preparation
tasksnotes in the second worksheet Rawdatadict
The next suggested step is to filter each column in the raw data to see whether there are blanks
missing values or obvious data entry errors; the latter should be corrected in the Excel spreadsheet
already before loading. The anomaly in the column Dependents should also be corrected in the Excel
spreadsheet first, making reasonable assumptions. Finally, add and generate the variables
Incometotal and LoanIncomeratio as explained in the data dictionary tab.
B Data preparation for multiple analysis tasks:
After the initial error removal in the source file A you now have to perform the following data
preparation steps in that order:
Missing value handlingtransformation;
Data transformation for statistical analysis eg stringcat to numeric, one to many;
Outlier identification and treatment.
For this subtask you have essentially three options:
Use KNIME workflows for each analysis task or all tasks in one single workflow;
Use only EXCEL or any other tool and load the data for each analysis task;
Use a combination of the two options above eg create a KNIME workflow to prepare the data
and write or copy them to Excel files after outlier correction, and then use this for analysis tasks in
Excel and KNIME; OR: treat missing variables and convert category variables already in Excel do
outlier analysis only in KNIME.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started