Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Data file for this exercise: The data is stored in a SAS data file called cereals.sas7bdat located in mydata library on the SAS OnDemand
Data file for this exercise: The data is stored in a SAS data file called cereals.sas7bdat located in mydata library on the SAS OnDemand server. Variables in that file are as follows: Variable name mfr type calories Description Name of cereal Manufacturer of cereal where A = American Home Food Products; G = General Mills; K = Kelloggs; N = Nabisco; P = Post; Q = Quaker Oats; R = Ralston Purina C = cold, H = hot Calories per serve Grams of fat protein Grams of protein Milligrams of sodium fat sodium fiber carbo sugars potass vitamins shelf weight cups Grams of dietary fibre Grams of complex carbohydrates Grams of sugar Milligrams of potasium Vitamins and minerals, 0, 25, or 100, indicating the typical percentage of FDA recommended Display shelf (1 = bottom, 2 = middle, or 3 = top, counting from the floor) Weight in ounces of one serving Number of cups in one serving rating Rating of the cereals calculated from Consumer Reports, out of 100. The higher the score, the healthier the cereals Breakfast cereals are big business. According to Choice magazine, in 2011 '...we spent $1.17 billion on ready-to-eat cereal, and munched our way through 169,470 tonnes of it - that's about 10 large (750g) boxes for every man, woman and child...' But are breakfast cereals healthy? One variable of particular interest is the amount of sugar, which plays an important role in the tastiness of the product but can make for a less than healthy breakfast. The data file for this exercise contains nutritional information and ratings for 77 breakfast cereals. (a) Apply the log transformation to variable rating to create a new variable Lrating. Check Normality of rating and Lrating and briefly discuss the effect of the log transformation on the distribution. (b) Obtain a Pearson correlation matrix relating variables Lrating, sugars, fiber and sodium, and comment briefly on these correlations. 1 (c) Obtain a scatterplot matrix relating Lrating, sugars, fiber and sodium, and briefly discuss the resulting relationships. Based on your scatterplots and results from part (b), which variable would you recommend as the single explanatory variable in a simple linear regression model for Lrating. (d) Fit a simple linear regression model relating Lrating to the variable you have identified in part (c), with Lrating as the dependent variable. Interpret the model equation. Obtain and discuss fit diagnostics. Are there any observations that would require closer inspection? Explain briefly.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started