This exercise is based on results in McNamee (2003) on the use of two-phase sampling to estimate
Question:
We wish to estimate p = U = C2+/N from the two-phase sample; p1 = C21/N1 and p2 = C22/N2 are the proportions with the disease in strata 1 and 2, respectively. a. Epidemiologists often use the concepts of specificity and sensitivity to assess a test for a disease, with S1 = Specificity = P (test is negative | disease absent) = C11 / C1+ and S2 = Sensitivity = P (test is positive | disease present) = C22 / C2+.
Show that
b. Suppose that the optimal allocation is used (see Section 12.5.1) and that 0
Where R is the population Pearson correlation coefficient between x and y, given in (4.1). For the second term, first show that RSy = p (S2 W2)/W1W2.
c. Calculate the ratio of variances in (b) when S1 = S2 and R = min {S1 + S2
0.9, 0.95}, for S1 {0.5, 0.6, 0.7, 0.8, 0.9, 0.95} and c (1) / c (2) {0.0001, 0.01, 0.1, 0.5,1}. Display your results in a table. For which settings would you recommend two-phase sampling to estimate disease prevalence?
Step by Step Answer: