Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Customer Rating of Breakfast Cereals. The dataset Cereals.jmp ,1; includes nutritional information, store display, and consumer ratings for F? breakfast cereals. Data preprocessing. Note that
Customer Rating of Breakfast Cereals. The dataset Cereals.jmp ,1; includes nutritional information, store display, and consumer ratings for F? breakfast cereals. Data preprocessing. Note that some cereals are missing values. These will be automatically omitted from the analysis. Use the ColseColamns VIIEWEF to identify which variables are missing values and how many values are missing. [Select] v missing - The variables [59'3\"] V have variables in total. The Hierarchical platform dialog provides an option to standardize {Standardize Data}. Should this be selected? Why? - The scales used for measurement [Select] V l different, so the distance '0' measure [55'3\"] dominated by variables with largervalues. Hence, Standardize Data option in Hierarchical platform l [salad] V selected. Apply hierarchical clustering to the data using single linkage and complete linkage [use only continuous variables in "r', Columns and cast the variable name to Label}. Look at the dendrograms and the parallel plots. Comment on the structure of the clusters and on their stability. - 1With [salad] V , small changes in the distance cause large changes in the number of clusters. For example, the distance from 55 to 30 clusters is very narrow clusters change very quickly over a short distance. So, [59'9\"] V is more unstable. [ Select] v The change in clusters for is more gradual. - Hence [Salad 1 \"l method leads to the most insightful or meaningful clusters. - In Distance Graph there is a sharp upward bend at cluster number= ' [ Select ] v . This gives an idea about the optimal number of clusters that will be used in clustering. The public elementary schools would like to choose a set of cereals to include in their daily cafeterias. Every day a different cereal is offered, but all cereals should support a healthy diet. For this goal you are requested to find a cluster of "healthy cereals." Based on the variables at hand, how would you characterize "healthy cereals"? [ Select ] calories, [ Select ] v protein, [ Select ] fat, [ Select ] fiber, [ Select ] carbo, [ Select ] sugar, [ Select ] V potass, [ Select ] vitamins. Use the red triangle options Cluster Summary, Cluster Means, and Parallel Coord Plots to check cluster means across the variables. Which cluster of cereals is the most "healthy"? [ Select ] is the healthiest, with high protein, fiber, and potass and low calories, fat, and carbs. But, this cluster contains the high bran and high fiber cereals that students might generally don't like. An alternative might be [ Select ] which is moderately high in the "good" characteristics (protein, vitamin, potassium) and students would be more likely to eat
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access with AI-Powered Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started