Answered step by step
Verified Expert Solution
Question
1 Approved Answer
MATH1P98 - Assignment #1 - Section #4 Due: Monday, September 26 @ 11:59 pm Students are expected to complete all questions on the assignment. However,
MATH1P98 - Assignment #1 - Section #4 Due: Monday, September 26 @ 11:59 pm Students are expected to complete all questions on the assignment. However, only a subset of questions will be considered for marking. Marks will be deducted for incomplete assignments. Assignment submissions must be neat, legible, written on one side of the page only, and questions must be submitted in order. A cover page must be attached to the front of the assignment. (See sample cover page on Sakai) Please submit answers to the following questions to your assignment drop box by 11:59pm of the due date. The text is Introduction to Probability & Statistics (3rd Edition), by Mendenhall, et. al. Correction: Question 1(e) should be 'upper class limits' 1 a) Use the data for systolic blood pressure found in Exercise 13.22 (y column, pg. 601). Starting with a lower class limit of 100.0 and class width of 21.0, create a frequency table by hand. Your table should include columns for frequency, relative frequency and cumulative frequency. b) Using the frequency table, what percentage of observations is at least 163.0 units? c) Create a frequency histogram of the table in part (a), using EXCEL and the procedure described below: - On your worksheet, add a new column anywhere with the upper class limits in order, starting with 120.0. You should have a column of data with 40 rows and column of upper class limits (bins) with 6 rows (Note: Excel needs the upper class limits, not lower class limits). - Load the Data Analysis ToolPak: Click the File tab, and then click Options. Select Add-Ins on the left, click Analysis ToolPak in the list on the right, and then in the Manage box, select Excel Add-ins. Click Go. Select Analysis ToolPak and click Ok. - Click the Data tab in the top menu and click Data Analysis - Select Histogram and click Ok. - Click the Input Range box and select the column of data with your mouse - Click the Bin Range box and select the upper class limit column data with your mouse - Select New Worksheet Ply and Chart Output, and then click Ok Once your histogram is created, edit the main title and x-axis label to be something more informative than the default labels. Include a printout of the histogram only with your assignment (you do not need to include the other output). 1 d) Using the histogram, describe the distribution of the data. What is the shape of the distribution? Are there any outliers? If so, approximately how many are there? e) Repeat part (c), except this time change the number of upper class limits so that there are 15 intervals. To do this, follow the same procedue in part (c), with now 16 rows of upper class limits. Create a histogram. Do this 3 more times, each time selecting the number of intervals to be 10, 5 and 3 respectively. Write a brief summary of what you notice about the shape of the histogram and the number of outliers as the number of intervals changes. What is the disadvantage of having too many or too few intervals? Note: You do not need to include these histograms in your solutions. 2 a) Using the Fighting the Flu data from Section 2-3 #2.13 (pg. 62), calculate the mean and the median. Show your calculations and include units in your response. Based on these two measures, how would you describe the distribution of the data? b) Calculate the variance and standard deviation by completing a table like the one provided below. Use four decimal places throughout your calculations, and include units in your response. What does (x x) approximately equal? Explain. x (x x) (x x)2 .. . Total: .. . .. . s2 = s= c) Calculate a z-score for the observations 8, 23, 5. Show your work. Based on the z-score values, which of these would be considered \"unusual\" values, and why? d) If you had to pick two values from the Fighting the Flu data set such that the variability was minimized, which two values would you pick? 3 a) A data set for blood platelet counts of women has the summary statistics x = 280 and s = 65. Assuming that the data follows a bell-shaped distribution, what percentage of women has blood platelet counts between 150 and 410? b) What are the blood platelet counts that are 1.5 standard deviations above and below the mean? What percentage of women has a blood platelet count within this interval? c) Consider the Fighting the Flu data set from Exercise 2. Would it be reasonable to use the Empirical Rule on this data set? What about Chebyshevs Theorem? Why or why not? 4 a) Return to the data set used in Exercise 1 (systolic blood pressure). Using Excel and the procedure described below, find P10 and P84. - Select the column of the data, click the Formulas tab and click Insert Function fx 2 - In the Insert Function box, select Statistical - In the Select a function box, click PERCENTILE.INC and click Ok - In the Functions Arguments box, under Array, enter the range of data (i.e. A2:A41) - Click the K box and enter the percentile in decimal form, i.e., the 25th percentile is entered as 0.25, then click Ok. b) In which percentile does the data value 140.2 approximately fall? How would you interpret this percentile in the context of systolic blood pressure? c) Use EXCEL and the procedure described below to find the 5-Number Summary for this data set. - Select the column of the data, click the Formulas tab and click Insert Function fx - In the Insert Function box, select Statistical - In the Select a function box, click QUARTILE.INC and click Ok - In the Functions Arguments box, under Array, enter the range of data (i.e. A2:A41) - In the Quart box, enter 0 for the minimum, 1 for the 1st quartile, and so on to find the 5-number summary. Include the output in your solutions. Use the procedure found on page 82 to construct a regular boxplot by hand for this data set. On your boxplot, label the values from the 5-Number Summary. 5) Obtain a sample of Suncor Energys financial data for a one-year period by the following procedure: - Go to the website: https://ca.finance.yahoo.com/ - In the top left corner, in the box where it says Enter Symbol, type in SU.TO. Click on Look Up. You will be directed to the summary page for Suncor Energy Inc. - In the left side menu, click on the link for Historical Prices. You can now select a date range to look up. Start with January 1, and end with December 31. Select any year between 2000 and 2013. For example, if you wanted to look up the price information for the year 2007, you would enter January 1, 2007 and December 31, 2007. On the right hand side, select Weekly. Click Get Prices. - Export the data to a spreadsheet by scrolling down to the bottom of the page and clicking on Download to Spreadsheet. - Select one of the columns labeled Open, High, Low, or Close to be your data set (ignore the columns for Date, Volume and Adj. Close). Copy and paste your selected column into a new spreadsheet. - Complete each of the following. Include your output, and remember to show your work where applicable. 3 a) Use EXCEL to find the mean, median, and standard deviation. b) By hand, calculate the interquartile range. Show your work. c) By hand, use the procedure for Outliers described below to determine whether your data set contains any outliers. Show your work. Find quartiles Q1 , Q2 and Q3 Find the interquartile range (IQR), where IQR = Q3 Q1 Find IQR 1.5. A data value is an outlier if it is above Q3 by an amount greater than 1.5 IQR, or below Q1 by an amount greater than 1.5 IQR. d) Copy your summary statistics onto the same spreadsheet as your data. Print this page and include it in your solutions. (For the sake of saving paper, once you have finished your calculations you may wish to condense your data into two shorter columns). 4
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started