Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 22, 2024

Python Code PLEASE FIX THE CODE and ADD the code. Note: The following information and instructions apply to students that have been assigned to Scenario

Python Code

PLEASE FIX THE CODE and ADD the code.

Note: The following information and instructions apply to students that have

been assigned to Scenario C by the instructor. You can find your scenario assignment in the Final Project Review and Scenario Assignments item in the Module One area of the course. Reach out to your instructor if you have questions about your scenario assignment. If you run into issues and would like to start over with a clean copy of the provided script file (e.g., Milestone1.py), you can click on the gear button, and select Restore current files from the drop-down menu.

Imagine you are a quantitative analyst working in an investment fund that is looking to invest in an agricultural companies that is involved in rice production. Your primary job is to analyze the production levels of four candidate companies. Your analysis will help identify companies with rice production levels that are significantly different than others. This will help your fund make a data based decision on their investment.

You are now given a task to analyze historical data on rice production for four agricultural companies. The data consist of monthly rice production in cwt (Centum Weight - measure of weight) for four companies for the past 21 years. You will be using this data to finish this task.

Use the following table as a reference guide for the variables of interest for this project.

Variable	Definition
Company1	Total rice production, in cwt, for Company 1.
Company2	Total rice production, in cwt, for Company 2.
Company3	Total rice production, in cwt, for Company 3.
Company4	Total rice production, in cwt, for Company 4.
Month	The Month of the year reported as integers (1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12).

Step 1: Import your data set

You will be working with the Rice Production data set. To ensure that you analyze the correct data set, make the following edits to the function underneath Step 1: Import your data set in the Tool Panel on the left side of the screen.

Uncomment line 11 Replace '???FILE_NAME???' with 'Rice_Production.csv' Replace ???DATASET_NAME??? with production

Step 2: Calculate descriptive statistics for rice production in Company 1 (Company1)

You will need to calculate the following descriptive statistics for the variable Company1:

Mean

Median

Variance

Standard Deviation

Minimum

Maximum

25th Percentile

75th Percentile

In order to perform these functions, you need to make the appropriate modifications to correct Python functions within the provided script. In other words, you should:

Uncomment lines 17 - 52 Replace ???DATASET_NAME??? with production Replace ???VARIABLE_NAME??? with the variable 'Company1'

for each of the required Python functions listed above.

Step 3: Calculate descriptive statistics for rice production in Company 2 (Company2)

You will need to calculate the following descriptive statistics for the variable Company2:

Mean

Median

Variance

Standard Deviation

Minimum

Maximum

25th Percentile

75th Percentile

In order to perform these functions, you need to make the appropriate modifications to associated Python functions listed within the provided script. In other words, you should:

Uncomment lines 56 - 89 Replace ???DATASET_NAME??? with production Replace '???VARIABLE_NAME???' with the variable 'Company2'

Step 4: Construct confidence interval for population proportion

Construct a 99% confidence interval for the proportion of months with rice production above 307200 cwt (Company1>307200.0). In order to perform this function, you need to make the appropriate modifications to the provided script. In other words, you should:

Comment lines 17 - 52 Comment lines 56 - 89 Uncomment lines 94 - 100 Replace ???DATASET_NAME??? with production Replace '???VARIABLE_NAME???' with the variable 'Company1' Replace ???Xvalue??? with the appropriate value

Step 5: Construct confidence interval for population mean

Construct a 95% confidence interval for average rice production. In order to perform this function, you need to make the appropriate modifications to the provided script. In other words, you should:

Uncomment lines 106 - 113 Replace ???DATASET_NAME??? with production

Replace '???VARIABLE_NAME???' with the variable 'Company2'####### Milestone 1 Python Script

Step 6: Perform hypothesis test for population proportion

It is known that 29% of the months have rice production by Company1 above 307200 cwt (company1>307200.0). Is there sufficient evidence to suggest that the proportion is less than 29%? Test at 5% level of significance. In order to perform this function, you need to make the appropriate modifications to the provided script. In other words, you should:

Comment lines 94 - 100 Comment lines 106 - 113 Uncomment lines 119 - 125 Replace ???DATASET_NAME??? with production Replace '???VARIABLE_NAME???' with the variable 'Company1' Replace ???Xvalue??? with the appropriate value Replace ???NULL_VALUE??? with the appropriate value for the null hypothesis Replace '???TEST_TYPE???' with the appropriate alternative hypothesis

Step 7: Perform hypothesis test for population mean It is claimed that average rice production by Company2 is 209,500 cwt (Company2 = 209500.0). Test this claim using a hypothesis test at 1% level of significance. In order to perform this function, you need to make the appropriate modifications to the provided script. In other words, you should:

Uncomment lines 131 - 139 Replace ???DATASET_NAME??? with production Replace '???VARIABLE_NAME???' with the variable 'Company2' Replace ???NULL_VALUE??? with the appropriate value for the null hypothesis

Replace '???TEST_TYPE???' with the appropriate alternative hypothesis

Step 8: Get your results

Once you have confirmed that your code is correct by passing each of the self-assessments, it is time to get the results of your statistical tests. First, you must uncomment all of the lines of your scripting file that are not instructional, as indicated below:

Uncomment lines 17 - 52 Uncomment lines 56 - 89 Uncomment lines 94 - 100 Uncomment lines 106 - 113 Uncomment lines 119 - 125 Uncomment lines 131 - 139

Then, open a new terminal window in which to execute your script.

image text in transcribed

Finally, insert the command illustrated below and execute the function to see the output of your statistical tests.

image text in transcribed

Step 9: Submit your Milestone1.py

Now that you have completed all of the above steps for this part of the final project, follow the directions in the rubric document to download and submit your final scripting file to the correct item in Brightspace. Note: Submitting your files to the self-assessment Check Its in Codio is not enough to satisfy the submission requirements for this part of your final project.You must also download and submit a final version of the file to the associated item in Brightspace to earn the points for this assignment.

Step 10: Create Summary Report

Review the prompt in your final project documentation to create your summary report in which you interpret and communicate the results of your statistical analyses in Part I. Follow the directions in your rubric for formatting and submitting this component of your final project.THIS IS THE CODE:import pandas as pdimport scipy.stats as st from snhu_MAT243 import prop_1samp_ztest from snhu_MAT243 import means_1samp_ttest ##Step 1: Import your data set ##----------------------------------------------------------------------------- # ???DATASET_NAME??? = pd.read_csv('???FILENAME???') ##Step 2: Calculate descriptive statistics ##----------------------------------------------------------------------------- # print ('Descriptive Statistics - Step 2') # print ('') # print ('Mean') # print (???DATASET_NAME???[['???VARIABLE_NAME???']].mean()) # print ('') # print ('Median') # print (???DATASET_NAME???[['???VARIABLE_NAME???']].median()) # print ('') # print ('Mode') # print (???DATASET_NAME???[['???VARIABLE_NAME???']].mode()) # print ('') # print ('Minimum') # print (???DATASET_NAME???[['???VARIABLE_NAME???']].min()) # print ('') # print ('Maximum') # print (???DATASET_NAME???[['???VARIABLE_NAME???']].max()) # print ('') # print ('Mean') # print (???DATASET_NAME???[['???VARIABLE_NAME???']].var()) # print ('') # print ('Standard Deviation') # print (???DATASET_NAME???[['???VARIABLE_NAME???']].std()) # print ('') # print ('Describe') # print (???DATASET_NAME???[['???VARIABLE_NAME???']].describe()) # print ('') # print ('') ##Step 3: Calculate descriptive statistics ##----------------------------------------------------------------------------- # print ('Descriptive Statistics - Step 3') # print ('') # print ('Mean') # print (???DATASET_NAME???[['???VARIABLE_NAME???']].mean()) # print ('') # print ('Median') # print (???DATASET_NAME???[['???VARIABLE_NAME???']].median()) # print ('') # print ('Mode') # print (???DATASET_NAME???[['???VARIABLE_NAME???']].mode()) # print ('') # print ('Minimum') # print (???DATASET_NAME???[['???VARIABLE_NAME???']].min()) # print ('') # print ('Maximum') # print (???DATASET_NAME???[['???VARIABLE_NAME???']].max()) # print ('') # print ('Mean') # print (???DATASET_NAME???[['???VARIABLE_NAME???']].var()) # print ('') # print ('Standard Deviation') # print (???DATASET_NAME???[['???VARIABLE_NAME???']].std()) # print ('') # print ('Describe') # print (???DATASET_NAME???[['???VARIABLE_NAME???']].describe()) # print ('') ##Step 4: Construct confidence interval for population proportion ##----------------------------------------------------------------------------- # print ('Confidence Interval - Step 4') # n = ???DATASET_NAME???[['???VARIABLE_NAME???']].count() # x = (???DATASET_NAME???[['???VARIABLE_NAME???']] > ???Xvalue???).values.sum() # p = x*1.0 # stderror = (p * (1 - p))**0.5 # print (st.norm.interval(0.99, p, stderror)) # print ('') ##Step 5: Construct confidence interval for population mean ##----------------------------------------------------------------------------- # print ('Confidence Interval - Step 5') # n = ???DATASET_NAME???[['???VARIABLE_NAME???']].count() # df = n - 1 # mean = ???DATASET_NAME???[['???VARIABLE_NAME???']].mean() # stdev = ???DATASET_NAME???[['???VARIABLE_NAME???']].std() # stderror = stdev/(n**0.5) # print (st.t.interval(0.95, df, mean, stderror)) # print ('') ##Step 6: Perform hypothesis test for population proportion ##----------------------------------------------------------------------------- # print ('Hypothesis Test - Step 6') # n = ???DATASET_NAME???[['???VARIABLE_NAME???']].count() # x = (???DATASET_NAME???[['???VARIABLE_NAME???']] > ???Xvalue???).values.sum() # null_value = ???NULL_VALUE??? # alternative = '???TEST_TYPE???' # print (prop_1samp_ztest(x, n, null_value, alternative)) # print ('') ##Step 7: Perform hypothesis test for population mean ##----------------------------------------------------------------------------- # print ('Hypothesis Test - Step 7') # n = ???DATASET_NAME???[['???VARIABLE_NAME???']].count() # df = n - 1 # mean = ???DATASET_NAME???[['???VARIABLE_NAME???']].mean() # std_dev = ???DATASET_NAME???[['???VARIABLE_NAME???']].std() # null_value = ???NULL_VALUE??? # alternative = '???TEST_TYPE???' # print (means_1samp_ttest(mean, std_dev, n, null_value, alternative)) # print ('')

File Edit Find View Tools Education Help one: Terminal estone z PyScr nstall Software as as pd y.stats as st AT243 import prop 1sa JUnit (Beta) Debugger Settings (Beta) File Edit Find View Tools Education Help one: Terminal estone z PyScr nstall Software as as pd y.stats as st AT243 import prop 1sa JUnit (Beta) Debugger Settings (Beta)