Answered step by step
Verified Expert Solution
Question
1 Approved Answer
ThedatasetCereals.csvincludesnutritionalinformation,storedisplay,andconsumer ratings for 77 breakfast cereals. Data Preprocessing. Remove all cereals with missing values. ApplyhierarchicalclusteringtothedatausingEuclideandistancetothenormalized measurements.Comparethedendrogramsfromsinglelinkageandcompletelinkage,and look at cluster centroids. Comment on the structure of
- ThedatasetCereals.csvincludesnutritionalinformation,storedisplay,andconsumer ratings for 77 breakfast cereals. Data Preprocessing. Remove all cereals with missing values.
- ApplyhierarchicalclusteringtothedatausingEuclideandistancetothenormalized measurements.Comparethedendrogramsfromsinglelinkageandcompletelinkage,and look at cluster centroids. Comment on the structure of the clusters and on their stability. Hint: Toobtainclustercentroidsforhierarchicalclustering,computethe averagevaluesofeach cluster members, using the aggregate() function.
- Which method leads to the most insightful or meaningful clusters?
- Chooseoneofthe methods.Howmanyclusterswouldyouuse? Whatdistance isusedfor this cutoff? (Look at the dendrogram.)
- The elementary public schools would like to choose a set of cereals to include in their daily cafeterias. Every day a different cereal is offered, but all cereals should support a healthy diet. Forthisgoal,youarerequestedtofindaclusterof "healthy cereals." Should the data be normalized? If not, how should they be used in the cluster analysis?
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started