Question

1 Approved Answer

Posted on Oct 14, 2024

Drive (miles) 20 28 54 76 36 88 6 25 40 42 55 71 76 78 4 20 25 29 33 36 36 36 63

Drive (miles) 20 28 54 76 36 88 6 25 40 42 55 71 76 78 4 20 25 29 33 36 36 36 63 73 73 76 80 33 54 63 76 80 80 94 6 State OR NY NV PA TX SC MI CA IL IL OH MI NY CA NY IL GA TX TX MI IL OR MI FL SC NY FL SC FL PA PA NV SC KY OR Shoe Size 7 7 11 11 7 6 10 8 10 8 9 11 9 11 9 5 9 11 9 10 12 9 5 12 11 9 9 11 9 10 11 12 10 11 13 Height (inches) 65 67 65 63 66 63 66 69 68 65 67 64 63 63 69 62 63 70 75 61 61 73 69 70 68 65 66 68 70 66 70 69 63 74 75 Sleep (hours) 4 4 4 4 5 5 6 6 6 6 6 6 6 6 7 7 7 7 7 7 7 7 7 7 7 7 7 8 8 8 8 8 8 8 10 Gender Car Color F green F black F silver M silver M orange F black M black F silver M silver F red F dark blue F silver M silver M blue F blue F black M silver M silver M silver M blue M red M blue F orange F green F red F red F black M blue F green F silver F blue M white F black M red M silver TV (hours) 1 1 5 5 3 3 5 4 5 4 4 2 4 2 5 3 3 2 5 6 4 5 3 4 2 6 5 1 2 3 3 4 2 3 6 Money (dollars) 46.00 25.00 54.00 13.00 7.00 43.00 34.00 30.00 16.00 4.00 27.00 1.00 38.00 47.00 21.00 10.00 53.00 28.00 54.00 5.00 14.00 23.00 9.00 15.00 18.00 7.00 22.00 9.00 5.00 5.00 5.00 3.00 7.00 20.00 7.00 Coin 4 4 7 5 3 6 3 4 5 4 2 2 2 2 5 2 5 3 7 2 3 5 2 3 5 2 2 4 3 7 4 4 7 4 4 Die1 Die2 5 6 6 4 2 6 4 3 2 3 2 4 3 3 4 2 2 3 4 6 2 2 6 1 4 1 6 5 5 3 4 2 4 3 2 Die3 5 2 2 5 4 2 4 5 2 4 6 3 5 4 2 4 4 3 6 6 5 5 2 6 2 6 4 1 4 1 4 3 4 5 1 Die4 1 6 3 2 4 4 5 3 4 1 5 5 2 4 1 5 4 2 2 4 3 1 4 4 3 6 4 5 3 2 5 2 1 3 2 Die5 5 5 6 3 1 2 1 6 2 3 5 4 1 5 5 4 2 6 6 5 6 4 4 6 1 5 2 6 3 1 2 5 6 2 6 Die6 2 5 1 3 6 3 4 1 1 2 5 3 2 2 6 5 5 6 4 4 4 5 6 2 6 1 2 2 3 6 4 6 3 6 4 1 4 5 5 3 5 4 3 3 5 3 1 3 5 5 5 2 6 3 4 3 1 2 3 4 5 4 5 3 5 4 4 2 4 5 Die7 Die8 5 3 3 3 1 3 5 1 2 5 5 2 4 5 2 6 3 6 1 6 2 1 2 2 3 2 1 2 6 4 1 6 4 5 2 Die9 1 1 6 6 6 1 5 4 4 5 2 3 5 4 1 3 4 5 4 4 4 1 5 2 3 5 1 1 3 2 3 3 6 3 2 Die10 6 3 5 2 4 4 5 1 4 6 3 6 5 6 6 4 3 5 3 6 3 2 3 4 3 1 2 3 4 1 3 2 5 4 3 3 1 1 1 4 2 1 5 2 6 3 1 3 5 2 1 3 1 5 6 3 1 1 6 2 6 1 2 2 5 1 6 1 4 5 MATH 221 Statistics for Decision Making Week 6 iLab Name:_______________________ Statistical Concepts: Data Simulation Confidence Intervals Normal Probabilities Short Answer Writing Assignment All answers should be complete sentences. We need to find the confidence interval for the SLEEP variable. To do this, we need to find the mean and then find the maximum error. Then we can use a calculator to find the interval, (x - E, x + E). First, find the mean. Under that column, in cell E37, type =AVERAGE(E2:E36). Under that in cell E38, type =STDEV(E2:E36). Now we can find the maximum error of the confidence interval. To find the maximum error, we use the \"confidence\" formula. In cell E39, type =CONFIDENCE.NORM(0.05,E38,35). The 0.05 is based on the confidence level of 95%, the E38 is the standard deviation, and 35 is the number in our sample. You then need to calculate the confidence interval by using a calculator to subtract the maximum error from the mean (x-E) and add it to the mean (x+E). 1. Give and interpret the 95% confidence interval for the hours of sleep a student gets. (5 points) Then, you can go down to cell E40 and type =CONFIDENCE.NORM(0.01,E38,35) to find the maximum error for a 99% confidence interval. Again, you would need to use a calculator to subtract this and add this to the mean to find the actual confidence interval. 2. Give and interpret the 99% confidence interval for the hours of sleep a student gets. (5 points) 3. Compare the 95% and 99% confidence intervals for the hours of sleep a student gets. Explain the difference between these intervals and why this difference occurs. (5 points) In the week 2 lab, you found the mean and the standard deviation for the HEIGHT variable for both males and females. Use those values for follow these directions to calculate the numbers again. (From week 2 lab: Calculate descriptive statistics for the variable Height by Gender. Click on Insert and then Pivot Table. Click in the top box and select all the data (including labels) from Height through Gender. Also click on \"new worksheet\" and then OK. On the right of the new sheet, click on Height and Gender, making sure that Gender is in the Rows box and Height is in the Values box. Click on the down arrow next to Height in the Values box and select Value Field Settings. In the pop up box, click Average then OK. Write these down. Then click on the down arrow next to Height in the Values box again and select Value Field Settings. In the pop up box, click on StdDev then OK. Write these values down.) You will also need the number of males and the number of females in the dataset. You can either use the same pivot table created above by selecting Count in the Value Field Settings, or you can actually count in the dataset. Then in Excel (somewhere on the data file or in a blank worksheet), calculate the maximum error for the females and the maximum error for the males. To find the maximum error for the females, type =CONFIDENCE.T(0.05,stdev,#), using the females' height standard deviation for \"stdev\" in the formula and the number of females in your sample for the \"#\". Then you can use a calculator to add and subtract this maximum error from the average female height for the 95% confidence interval. Do this again with 0.01 as the alpha in the beginning of the formula to find the 99% confidence interval. Find these same two intervals for the male data by using the same formula, but using the males' standard deviation for \"stdev\" and the number of males in your sample for the \"#\". 4. Give and interpret the 95% confidence intervals for males and females on the HEIGHT variable. Which is wider and why? (7 points) 5. Give and interpret the 99% confidence intervals for males and females on the HEIGHT variable. Which is wider and why? (7 points) 6. Find the mean and standard deviation of the DRIVE variable by using =AVERAGE(A2:A36) and =STDEV(A2:A36). Assuming that this variable is normally distributed, what percentage of data would you predict would be less than 40 miles? This would be based on the calculated probability. Use the formula =NORM.DIST(40, mean, stdev,TRUE). Now determine the percentage of data points in the dataset that fall within this range. To find the actual percentage in the dataset, sort the DRIVE variable and count how many of the data points are less than 40 out of the total 35 data points. That is the actual percentage. How does this compare with your prediction? (10 points) Mean ______________ Standard deviation ____________________ Predicted percentage ______________________________ Actual percentage _____________________________ Comparison ___________________________________________________ ______________________________________________________________ 7. What percentage of data would you predict would be between 40 and 70 and what percentage would you predict would be more than 70 miles? Subtract the probabilities found through =NORM.DIST(70, mean, stdev, TRUE) and =NORM.DIST(40, mean, stdev, TRUE) for the \"between\" probability. To get the probability of over 70, use the same =NORM.DIST(70, mean, stdev, TRUE) and then subtract the result from 1 to get \"more than\". Now determine the percentage of data points in the dataset that fall within this range, using same strategy as above for counting data points in the data set. How do each of these compare with your prediction and why is there a difference? (11 points) Predicted percentage between 40 and 70 ______________________________ Actual percentage _____________________________________________ Predicted percentage more than 70 miles ________________________________ Actual percentage ___________________________________________ Comparison ____________________________________________________ _______________________________________________________________ Why? __________________________________________________________ ________________________________________________________________ Answers for MATH 221 Statistics for Decision Making Name: -----------------------------Statistical Concepts: Data Simulation Confidence Intervals Normal Probabilities Short Answer Writing Assignment All answers should be complete sentences. We need to find the confidence interval for the SLEEP variable. To do this, we need to find the mean and then find the maximum error. Then we can use a calculator to find the interval, (x - E, x + E). First, find the mean. Under that column, in cell E37, type =AVERAGE(E2:E36). Under that in cell E38, type =STDEV(E2:E36). Now we can find the maximum error of the confidence interval. To find the maximum error, we use the \"confidence\" formula. In cell E39, type =CONFIDENCE.NORM(0.05,E38,35). The 0.05 is based on the confidence level of 95%, the E38 is the standard deviation, and 35 is the number in our sample. You then need to calculate the confidence interval by using a calculator to subtract the maximum error from the mean (x-E) and add it to the mean (x+E). 1. Give and interpret the 95% confidence interval for the hours of sleep a student gets. (6 points) (6.73, 7.67) This says that 95% of the values would be between the left and right endpoints of this interval or 95% of the values would be between 6.73 and 7.67 or there is a 95% chance that the true population mean would be between 6.73 and 7.67 Then, you can go down to cell E40 and type =CONFIDENCE.NORM(0.01,E38,35) to find the maximum error for a 99% confidence interval. Again, you would need to use a calculator to subtract this and add this to the mean to find the actual confidence interval. 2. Give and interpret the 99% confidence interval for the hours of sleep a student gets. (6 points) (6.59, 7.81)This says that 99% of the values would be between the fest and right endpoints of this interval or 99% of the values would be between 6.59 and 7.81 or there is a 99% chance that the true population mean would be between 6.59 and 7.81 3. Compare the 95% and 99% confidence intervals for the hours of sleep a student gets. Explain the difference between these intervals and why this difference occurs. (6 points) It is obvious that the 99% confidence interval is \"wider\" than the 95% confidence interval (6.59, 7.81) is the 99% and (6.73, 7.67) is the 95% confidence interval. In order to have more confidence that your interval contains the true population mean, it has to be widened. In the week 2 lab, you found the mean and the standard deviation for the HEIGHT variable for both males and females. Use those values for follow these directions to calculate the numbers again. (From week 2 lab: Calculate descriptive statistics for the variable Height by Gender. Click on Insert and then Pivot Table. Click in the top box and select all the data (including labels) from Height through Gender. Also click on \"new worksheet\" and then OK. On the right of the new sheet, click on Height and Gender, making sure that Gender is in the Rows box and Height is in the Values box. Click on the down arrow next to Height in the Values box and select Value Field Settings. In the pop up box, click Average then OK. Write these down. Then click on the down arrow next to Height in the Values box again and select Value Field Settings. In the pop up box, click on StdDev then OK. Write these values down.) You will also need the number of males and the number of females in the dataset. You can either use the same pivot table created above by selecting Count in the Value Field Settings, or you can actually count in the dataset. Then in Excel (somewhere on the data file or in a blank worksheet), calculate the maximum error for the females and the maximum error for the males. To find the maximum error for the females, type =CONFIDENCE.T(0.05,stdev,#), using the females' height standard deviation for \"stdev\" in the formula and the number of females in your sample for the \"#\". Then you can use a calculator to add and subtract this maximum error from the average female height for the 95% confidence interval. Do this again with 0.01 as the alpha in the beginning of the formula to find the 99% confidence interval. Find these same two intervals for the male data by using the same formula, but using the males' standard deviation for \"stdev\" and the number of males in your sample for the \"#\". 4. Give and interpret the 95% confidence intervals for males and females on the HEIGHT variable. Which is wider and why? (9 points) (Females 65.46, 68.66) Males (68.02, 71.31) The intervals represent the 95% confidence interval, in that there is a 95% chance that the true population mean lies within the interval (65.46, 68.66) for the females or the males(68.02, 71.31). 5. Give and interpret the 99% confidence intervals for males and females on the HEIGHT variable. Which is wider and why? (9 points) Females (64.85, 69.26) Males (67.41, 71.93) The intervals represent the 99% confidence interval, in that there is a 99% chance that the true population mean lies within the interval (64.85, 69.26) for the females or the males (67.41, 71.93) 6. Find the mean and standard deviation of the DRIVE variable by using =AVERAGE(A2:A36) and =STDEV(A2:A36). Assuming that this variable is normally distributed, what percentage of data would you predict would be less than 40 miles? This would be based on the calculated probability. Use the formula =NORM.DIST(40, mean, stdev,TRUE). Now determine the percentage of data points in the dataset that fall within this range. To find the actual percentage in the dataset, sort the DRIVE variable and count how many of the data points are less than 40 out of the total 35 data points. That is the actual percentage. How does this compare with your prediction? (12 points) Mean __51.657___miles___ Standard deviation ___25.801___miles Predicted percentage 0.3257 or 32.57 % Actual percentage 0.40 or 40% Comparison. The predicted percentage of 32.57 is less than the actual percentage, but is pretty close. Keeping in mind that we only had 35 data points, this is understandable. As the amount of data increases, the actual percentage of 40% would get closer to the predicted percentage. 7. What percentage of data would you predict would be between 40 and 70 and what percentage would you predict would be more than 70 miles? Subtract the probabilities found through =NORM.DIST(70, mean, stdev, TRUE) and =NORM.DIST(40, mean, stdev, TRUE) for the \"between\" probability. To get the probability of over 70, use the same =NORM.DIST(70, mean, stdev, TRUE) and then subtract the result from 1 to get \"more than\". Now determine the percentage of data points in the dataset that fall within this range, using same strategy as above for counting data points in the data set. How do each of these compare with your prediction and why is there a difference? (12 points) Predicted percentage between 40 and 70 0.4357 or 43.57% Actual percentage 0.229 or 22.9% Predicted percentage more than 70 miles 0.2386 or 23.86% Actual percentage 0.371 or 37.1% Comparison Again, in this case, the predicted was a little off. About 5% on the between 40 and 70 case and 2% on the greater than 70 case. However, they were still pretty close. Why? The reason is that we had only 35 data points, as the amount of data increases, the actual percentage should get closer to the predicted percentage