You are designing a cohort study to examine whether exposure to high levels of air population increases obesity in children You will recruit non obese children 5 7 years of age and follow them for three years There are few studies on this topic so you don't have a good estimate of the risk ratio you expect to find you think it may be as low as 1 3 and as high as 2 5 Epidemiologic data indicate that 15 of US children become obese Conduct an analysis to figure out how many participants you need to have 80 power to detect these different effect sizes (RR) Fill in the power calculations for the table below using SAS Set alpha (type 1 error) to 05 and use a two sided Fisher's exact test Assume you will have a 1 1 ratio for exposed and unexposed Hint in your calculations, you will be allowing the sample size and minimum detectable RR to vary Fill in Table 1 with sample size per group (exposed and unexposed) (do not use the total Ns) (2 points) Table 1 Power estimates for different sample sizes and RRs Detectable RR Sample Size Per Group (exposed or unexposed) 100 200 300 400 500 1 3 1 6 1 9 2 2 2 5 Now enter this into a new database using the below syntax and plot the power curves for each RR (i e plot the association between sample size and power for each RR) using code similar to what we used in lab (see answer key) (Replace the ' xxx' with the actual power calculations from your table) SAS coding notes values with or signs, i e 999 , are not considered a number in SAS and must be changed to actual numbers, i e 0 999 in the code below in order for SAS to be able to interpret it as a number and include it in the plot Also for the plot, you will need to use this code plot power n RR instead of the code from the lab ( plot power ratio ) data rrnpower input rr n power cards 1 3 100 xxx 1 3 200 xxx 1 3 300 xxx 1 3 400 xxx 1 3 500 xxx 1 6 100 xxx 1 6 200 xxx 1 6 300 xxx 1 6 400 xxx 1 6 500 xxx 1 9 100 xxx 1 9 200 xxx 1 9 300 xxx 1 9 400 xxx 1 9 500 xxx 2 2 100 xxx 2 2 200 xxx 2 2 300 xxx 2 2 400 xxx 2 2 500 xxx 2 5 100 xxx 2 5 200 xxx 2 5 300 xxx 2 5 400 xxx 2 5 500 xxx run Describe the relationship between statistical power, sample size and detectable differences Which parameter (n or RR) is relatively more important in terms of increasing statistical power Support your answer

The Answer is in the image, click to view ...

Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Jun 20, 2024

You are designing a cohort study to examine whether exposure to high levels of air population increases obesity in children. You will recruit non-obese children

You are designing a cohort study to examine whether exposure to high levels of air population increases obesity in children. You will recruit non-obese children 5-7 years of age and follow them for three years.There are few studies on this topic so you don't have a good estimate of the risk ratio you expect to find; you think it may be as low as 1.3 and as high as 2.5. Epidemiologic data indicate that 15% of US children become obese. Conduct an analysis to figure out how many participants you need to have 80% power to detect these different effect sizes (RR). Fill in the power calculations for the table below using SAS. Set alpha (type 1 error) to .05 and use a two-sided Fisher's exact test. Assume you will have a 1:1 ratio for exposed and unexposed. Hint: in your calculations, you will be allowing the sample size and minimum detectable RR to vary.

Fill in Table 1 with sample size per group (exposed and unexposed) (do not use the total Ns) (2 points)

Table 1. Power estimates for different sample sizes and RRs
Detectable RR	Sample Size Per Group (exposed or unexposed)
100	200	300	400	500
1.3
1.6
1.9
2.2
2.5

Now enter this into a new database using the below syntax and plot the power curves for each RR (i.e. plot the association between sample size and power for each RR) using code similar to what we used in lab (see answer key). (Replace the '.xxx' with the actual power calculations from your table).

SAS coding notes: values with > or < signs, i.e. ">.999", are not considered a number in SAS and must be changed to actual numbers, i.e. 0.999 in the code below in order for SAS to be able to interpret it as a number and include it in the plot. Also for the plot, you will need to use this code "plot power*n=RR;" instead of the code from the lab ("plot power*ratio;").

data rrnpower;

input rr n power;

cards;

1.3 100 .xxx

1.3 200 .xxx

1.3 300 .xxx

1.3 400 .xxx

1.3 500 .xxx

1.6 100 .xxx

1.6 200 .xxx

1.6 300 .xxx

1.6 400 .xxx

1.6 500 .xxx

1.9 100 .xxx

1.9 200 .xxx

1.9 300 .xxx

1.9 400 .xxx

1.9 500 .xxx

2.2 100 .xxx

2.2 200 .xxx

2.2 300 .xxx

2.2 400 .xxx

2.2 500 .xxx

2.5 100 .xxx

2.5 200 .xxx

2.5 300 .xxx

2.5 400 .xxx

2.5 500 .xxx

;

run;

Describe the relationship between statistical power, sample size and detectable differences. Which parameter (n or RR) is relatively more important in terms of increasing statistical power? Support your answer.