Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Hi... I need answers for my tasks and assignemnt that is due. The programming language is R and I will attach all the data sets

Hi...

I need answers for my tasks and assignemnt that is due. The programming language is R and I will attach all the data sets needed to answer the questions.

Thank you!

#Task 2

Test to see if New York (nytorns.txt) has more than 8 tornadoes in a year (=0.05).

#Task 3

Determine if the average number of tornadoes in New York has changed between the first two decades of the dataset and the second two decades. (=0.05)

#Task 4

Determine if the variability in the number of tornadoes in New York has changed between the first two decades of the dataset and the second two decades. (=0.05)

#Task 5

Determine if Ithaca low temperature data follow a normal distribution. Use 7 bins.

#Task 6

Compute confidence intervals for the Ithaca and Canadaigua high temperature data. Compare the two and determine if they are statistically significantly different. Assume normality (=0.05)

#Task 7

Are the average low temperatures equal between Canadaigua and Ithaca? Compare the results of a t-test with a Mann-Whitney test. (=0.05)

#Task 8

Set your seed to 100. Compare the actual calculated mean of a random set of 1000 gamma distributed data points with alpha = 1 and beta = 2 to the bootstrap mean and median of these same data. That is, compute the median and mean of the bootstrap replicates.

#Task 9

Use bootstrap confidence intervals to determine if climate change caused a change in Ithaca rainfall between the first 25 years and second 25 years of the Ithaca precipitation dataset. Plot your CIs using the plot.ci function to visually assess.

#Task 10

Perform a similar permutation test on Ithaca high temperature and precipitation data, comparing it to Canadaigua. Which, if any, are statistically significantly different?

For this assignment, we will be working with two new datasets. They are similar in structure, one is a series of observations for Chicago, IL (KORD) while the other is the same observation time period but for Pittsburgh, PA (KPIT). Observations span from December 29, 2012 to March 31, 2013. Both files (called KORD_spring.txt and KPIT_spring.txt) contain 4 columns, with the first being the date, the second the daily high temperature (in F), the third the daily low temperature (in F), and the fourth the daily peak wind gust (in mph). Use these files to answer the questions that follow. Note the section headers that state the types of tests to be used to answer these questions.

Parametric testing

1) Does either site have an average high temperature exceeding 45F, on average? (=0.05)

2) Was March statistically significantly warmer than January at each site? (=0.01)

3) Was the variability in KPIT high temperatures different than KORD? Discuss those results in the context of (2). (=0.1)

4) Are the wind data for KORD gamma distributed? What about KPIT? (=0.05)

Non-parametric testing

5) Use a Mann-Whitney U-test to determine if the low temperatures in the two locations are equal. Compare those results with a t-test on the lows in the two locations. (=0.1)

6) Use bootstrapping to plot 95% confidence intervals on the mean of the high temperature data, low temperature data, and wind speed data comparing the two sites. Comment on the statistical significance in the results you see. Provide boxplots (one figure with both sites for each variable) of the bootstrap replicates showing the differences you discuss.

7) Are the mean low temperatures of the two sites the same? Verify the results of (5) using a permutation test. (=0.1)

Canadaigua_data.txt

1 0.00 34 28

2 0.04 36 28

3 0.84 30 26

4 0.00 29 19

5 0.00 30 16

6 0.00 35 24

7 0.02 44 26

8 0.05 38 24

9 0.01 31 24

10 0.09 33 29

11 0.18 39 29

12 0.04 33 27

13 0.00 34 31

14 0.00 39 26

15 0.06 51 38

16 0.03 44 23

17 0.04 25 13

18 0.00 34 14

19 0.00 36 28

20 0.35 29 19

21 0.02 27 19

22 0.01 29 17

23 0.35 27 22

24 0.08 24 2

25 0.00 11 4

26 0.00 21 5

27 0.00 19 7

28 0.00 26 8

29 0.01 28 14

30 0.01 31 14

31 0.13 38 23

Ithaca_data

1 0.00 33 19

2 0.07 32 25

3 1.11 30 22

4 0.00 29 -1

5 0.00 25 4

6 0.00 30 14

7 0.00 37 31

8 0.04 37 22

9 0.02 29 23

10 0.05 30 27

11 0.34 36 29

12 0.06 32 25

13 0.18 33 29

14 0.02 34 15

15 0.02 53 29

16 0.00 45 24

17 0.00 25 0

18 0.00 28 2

19 0.00 32 26

20 0.45 27 17

21 0.00 26 19

22 0.00 28 9

23 0.70 24 20

24 0.00 26 -6

25 0.00 9 -13

26 0.00 2 -13

nytorns

3

4

5

1

3

1

5

1

2

2

7

4

5

6

6

6

3

7

5

8

3

4

3

3

8

6

7

9

6

5

5

10

5

8

14

8

4

7

4

6

Kord_spring

20130320 24.98 14 10.3

20130321 33.08 14 6.7

20130322 41 19.94 5.4

20130323 41 24.98 7.2

20130324 37.04 32 13.4

20130325 37.94 32 11.6

20130326 42.98 28.94 7.6

20130327 42.08 30.92 5.8

20130328 51.08 28.04 5.4

20130329 55.04 32 5.8

20130330 59 28.04 9.8

20130331 59 35.96 12.5

KPIT_spring

20130320 39.02 24.08 10.7

20130321 28.94 17.06 8

20130322 33.98 24.98 8

20130323 46.04 19.04 7.2

20130324 42.98 26.96 9.8

20130325 35.96 32 7.6

20130326 41 32 9.8

20130327 44.06 32 10.3

20130328 42.08 35.06 8

20130329 53.06 33.98 7.6

20130330 55.04 26.06 5.8

20130331 50 42.08 4

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Concepts

Authors: David M Kroenke, David J Auer

6th Edition

0132742926, 978-0132742924

More Books

Students also viewed these Databases questions

Question

Describe the factors influencing of performance appraisal.

Answered: 1 week ago

Question

What is quality of work life ?

Answered: 1 week ago

Question

What is meant by Career Planning and development ?

Answered: 1 week ago