Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Please note: Though I have mentioned all the Data Mining steps below, you are only required to answer Step 4 : Data Modeling and Step
Please note:
Though I have mentioned all the Data Mining steps below, you are only required
to answer Step : Data Modeling and Step : Data Evaluation.
I prefer that you use Python for the project. Excel is fine too.
Where to Submit: Blackboard
What to submit?
points Model Step of the Datamining Process Look for more detailed
instructions below in Step
points Model Evaluation Step of the Datamining Process Look for more
detailed instructions below in Step
Step : Business UnderstandingProblem:
Is there any indicator they predict the average SAT score of school and can we use the
learnings to help schools improve their students SAT score?
Set : Data Understanding:
You are provided with AP and SAT data. In the CSV file APSATData.csv there are
independent variablesattributes
NoAPTestTakers
TotalExamTaken
NoExamPassed
Dependent variable Target
SATMathScore
Step : Data Preparation:
For this Project, I took care of it I performed the following exercise
Exclude categorical attributes that are difficult to transform to Numeric ones
Add dummy values for some missing one
Deleted rows which more than few missing attributes values
Step : Modeling Linear Regression
Use Excel or Python preferred to perform modeling. Use the APSATData.csv file
create the models.
Model : Use the following independent variables to build a predictive model for the
target variable SATMathScore
NoAPTestTakers
TotalExamTaken
Model : Use the all independent variables to build a predictive model for the target
variable SATMathScore
Submit the following:
The work points for each model. Total points
a If you used Excel, submit the regression output in excel for both the
models
b If you used Python, submit the Jupyter notebook. The code has to run.
points What is the regression equation for both these models?
points Compare the models you created. Which one is the better model
based on MSE? Provide the MSE.
Set : Evaluation:
I have also provided testingcsv Use the data in this file to evaluate your models.
Submit the following:
points Test your two model from Step using the test data. Which one is the better
model now use MSE Has your answer changed from Step
Set : Deployment:
Think through how you would use the model findings.
Are there important ethical considerations? Nothing to submit.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started