Question
The manufacturing process at a factory produces ball bearings that are sold to automotive manufacturers. The factory wants to estimate the average diameter of a
The manufacturing process at a factory produces ball bearings that are sold to automotive manufacturers. The factory wants to estimate the average diameter of a ball bearing that is in demand to ensure that it is manufactured within the specifications. Suppose they plan to collect a sample of 50 ball bearings and measure their diameters to construct a 90% and 99% confidence interval for the average diameter of ball bearings produced from this manufacturing process.
The sample of size 50 was generated using Python's numpy module. This data set will be unique to you, and therefore your answers will be unique as well. Run Step 1 in the Python script to generate your unique sample data. Check to make sure your sample data is shown in your attachment.
In your initial post, address the following items. Be sure to answer the questions about both confidence intervals and hypothesis testing.
- In the Python script, you calculated the sample data to construct a 90% and 99% confidence interval for the average diameter of ball bearings produced from this manufacturing process. These confidence intervals were created using the Normal distribution based on the assumption that the population standard deviation is known and the sample size is sufficiently large. Report these confidence intervals rounded to two decimal places. See Step 2 in the Python script.
- Interpret both confidence intervals. Make sure to be detailed and precise in your interpretation.
It has been claimed from previous studies that the average diameter of ball bearings from this manufacturing process is 2.30 cm. Based on the sample of 50 that you collected, is there evidence to suggest that the average diameter is greater than 2.30 cm? Perform a hypothesis test for the population mean at alpha = 0.01.
In your initial post, address the following items:
- Define the null and alternative hypothesis for this test in mathematical terms and in words.
- Report the level of significance.
- Include the test statistic and the P-value. See Step 3 in the Python script. (Note that Python methods return two tailed P-values. You must report the correct P-value based on the alternative hypothesis.)
- Provide your conclusion and interpretation of the results. Should the null hypothesis be rejected? Why or why not?
Step 1: Generating sample data
This block of Python code will generate a unique sample of size 50 that you will use in this discussion. Note that your sample will be unique and therefore your answers will be unique as well. The numpy module in Python allows you to create a data set using a Normal distribution. Note that the mean and standard deviation were chosen for you. The data set will be saved in a Python dataframe that will be used in later calculations.
Click the block of code below and hit theRunbutton above.
In[1]:
import pandas as pd import numpy as np import math import scipy.stats as st # create 50 randomly chosen values from a Normal distribution. (arbitrarily using mean=2.48 and standard deviation=0.50). diameters = np.random.normal(2.4800,0.500,50) # convert the array into a dataframe with the column name "diameters" using pandas library. diameters_df = pd.DataFrame(diameters, columns=['diameters']) diameters_df = diameters_df.round(2) # print the dataframe (note that the index of dataframe starts at 0). print("Diameters data frame ") print(diameters_df) Diameters data frame diameters 0 1.94 1 1.99 2 3.22 3 3.58 4 2.03 5 3.01 6 3.13 7 2.51 8 2.57 9 1.86 10 3.72 11 3.19 12 2.26 13 3.15 14 2.00 15 2.39 16 2.76 17 3.29 18 2.42 19 2.32 20 2.65 21 2.55 22 2.58 23 2.30 24 1.78 25 1.62 26 2.54 27 2.62 28 2.60 29 2.45 30 2.59 31 2.58 32 1.90 33 2.18 34 1.74 35 1.89 36 2.90 37 2.82 38 2.53 39 2.89 40 1.56 41 2.77 42 2.70 43 1.38 44 3.42 45 2.33 46 2.60 47 2.09 48 2.72 49 2.58
Step 2: Constructing confidence intervals
You will assume that the population standard deviation is known and that the sample size is sufficiently large. Then you will use the Normal distribution to construct these confidence intervals. You will use the submodule scipy.stats to construct confidence intervals using your sample data.
Click the block of code below and hit theRunbutton above.
In[2]:
# Python methods that calculate confidence intervals require the sample mean and the standard error as inputs. # calculate the sample mean mean = diameters_df['diameters'].mean() # input the population standard deviation, which was given in Step 1. std_deviation = 0.5000 # calculate standard error = standard deviation / sqrt(n) where n is the sample size. stderr = std_deviation/math.sqrt(len(diameters_df['diameters'])) # construct a 90% confidence interval. conf_int_90 = st.norm.interval(0.90, mean, stderr) print("90% confidence interval (unrounded) =", conf_int_90) print("90% confidence interval (rounded) = (", round(conf_int_90[0], 2), ",", round(conf_int_90[1], 2), ")") print("") # construct a 99% confidence interval. conf_int_99 = st.norm.interval(0.99, mean, stderr) print("99% confidence interval (unrounded) =", conf_int_99) print("99% confidence interval (rounded) = (", round(conf_int_99[0], 2), ",", round(conf_int_99[1], 2), ")") 90% confidence interval (unrounded) = (2.3876912846323326, 2.6203087153676674) 90% confidence interval (rounded) = ( 2.39 , 2.62 ) 99% confidence interval (unrounded) = (2.3218613632281553, 2.6861386367718447) 99% confidence interval (rounded) = ( 2.32 , 2.69 )
Step 3: Performing hypothesis testing for the population mean
Since you were given the population standard deviation in Step 1 and the sample size is sufficiently large, you can use the z-test for population means. The z-test method in statsmodels.stats.weightstats submodule runs the z-test. The input to this method is the sample dataframe and the value under the null hypothesis. The output is the test-statistic and the two-tailed P-value.
Click the block of code below and hit theRunbutton above.
In[3]:
from statsmodels.stats.weightstats import ztest # run z-test hypothesis test for population mean. The value under the null hypothesis is 2.30. test_statistic, p_value = ztest(x1 = diameters_df['diameters'], value = 2.30) print("z-test hypothesis test for population mean") print("test-statistic =", round(test_statistic,2)) print("two tailed p-value =",round(p_value,4)) z-test hypothesis test for population mean test-statistic = 2.74 two tailed p-value = 0.0061
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started