Answered step by step
Verified Expert Solution
Link Copied!

Question

00
1 Approved Answer

The goal of this assignment is to practice classification methods and to apply them to a dataset using SPSS data analysis tools. The study aims

The goal of this assignment is to practice classification methods and to apply them to a dataset using SPSS data analysis tools. The study aims at determining whether one can predict the severity of heart disease based on a number of factors including age, blood pressure, and many others. Assignment preparation This assignment will be using SPSS data analysis tool. The dataset will be used in this analysis (see Assignment 2 for instructions on how to create this file). It is recommended to replace missing values (character ?) with an empty character. This can be accomplished through a text editor before importing the file into SPSS, or within SPSS. The expected results for this assignment will be based on this assumption. Classification in SPSS The dataset heart-ch.txt is ready to be processed for classification. The target attribute is the last one by default: num. Evaluation of a classifier generally uses a training set first to train the model, then a test set on which predictions by the model are compared with known classes. a. Switch to the Analyze tab. This dataset is going to be analyzed with Binary Logistic Regression. The options by default can be used, however it is recommended to use 95 for CI for exp(B). By selecting num as the dependent variable, and all the other variables as covariates, which result do you obtain (Block 1 Classification Table)? Method Significance R Square Accuracy Logistic regression b. Split the data into 75% for the training set and 25% for the test set, and compare results between Logistic Regression and Nearest-neighbor (both from the Analyze tab). Method Accuracy Logistic regression Nearest-neighbor c. What do you conclude about the predictability of heart disease (from the factors in the dataset)? Explain. d. Which factors are most predictive of heart disease (hint: these are the variables from the Variables in the Equation table having Sig <= 0.05; the higher B, the larger the effect)?

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access with AI-Powered Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Understanding Basic Statistics

Authors: Charles Henry Brase, Corrinne Pellillo Brase

6th Edition

978-1133525097, 1133525091, 1111827028, 978-1133110316, 1133110312, 978-1111827021

Students also viewed these Mathematics questions

Question

Implement the method contains() for BST.

Answered: 1 week ago

Question

What risks come with the reliance on authority for knowledge?

Answered: 1 week ago