Question
1) Pick any dataset relevant to your major that you would like to analyze. 2) Randomly divide it into two chunks 80% and 20% of
1) Pick any dataset relevant to your major that you would like to analyze.
2) Randomly divide it into two chunks 80% and 20% of records.
3) Select input variables (2 min) that you will use for cluster analysis.
4) Provide reasoning for the selection.
4) Use SPSS or other tool to apply appropriate cluster analysis method to cluster the larger part of the dataset.
5) Identify clusters and describe their centroids and business meaning.
6) If classes are poorly identified by the analysis or their business meaning is hard to describe, change your variable selection and go to the step 3.
7) For at least 5 records from the remaining smaller part of the dataset identify the closest cluster centroid. That will be a prediction which cluster those records belong too. Note that they have not been used in cluster identification, therefore this prediction will qualify as an example of predictive analytics.
8) Submit a Word report describing each step and a result of this process, include relevant scripts and outputs produced by the tool you use.
Step by Step Solution
3.39 Rating (155 Votes )
There are 3 Steps involved in it
Step: 1
Consider a code of length six n 6 defined as a1 az a3 a2 23 a1 a3 a1 a2 where ai E 0 1 Here a1 az a3 ...Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started