Answered step by step
Verified Expert Solution
Question
1 Approved Answer
4.5 Cancer deaths: Suppose for a set of counties 9'} E {1, . . . ,n} we have infor mation on the population size X,-
4.5 Cancer deaths: Suppose for a set of counties 9'} E {1, . . . ,n} we have infor mation on the population size X,- = number of people in 10,000s, and Y, = number of cancer fatalities. One model for the distribution of cancer fa- talities is that, given the cancer rate 3, they are independently distributed with Y, N Poisson(6'X,>). a) Identify the posterior distribution of 6 given data (Y1, X1), . . . , (Yn, Xn) and a gamma(a., b) prior distribution. The le cancer_react.dat contains 1990 population sizes (in 10,0005) and number of cancer fatalities for 10 counties in a Midwestern state that are near nuclear reactors. The le cancer_noreact .dat contains the same data on counties in the same state that are not near nuclear reactors. Consider these data as samples from two populations of counties: one is the population of counties with no neighboring reactors and a fatality rate of 91 deaths per 10,000, and the other is a population of counties having nearby reactors and a fatality rate of 62. In this exercise we will model beliefs about the rates as independent and such that 61 N gamma(a1, bl) and 62 ~ gamma(a2, 5;). b) Using the numerical values of the data, identify the posterior distri butions for 61 and 62 for any values of ((11, 51,112, b2). c) Suppose cancer rates from previous years have been roughly 0 = 2.2 per 10,000 (and note that most counties are not near reactors). For each of the following three prior opinions, compute E[61|data], E[92|data], 95% quantilebased posterior intervals for 31 and 92, and Pr(92 > 91 Idata). Also plot the posterior densities (try to put p(31'data) and p(62|data) on the same plot). Comment on the differences across posterior opinions. i. Opinion 1: (a1 = a2 = 2.2 X 100, bl = bg = 100). Cancer rates for both types of counties are similar to the average rates across all counties from previous years. ii. Opinion 2: ((11 = 2.2 X 100,51 = 100,0.2 = 2.2,b1 = 1). Cancer rates in this year for nonreactor counties are similar to rates in previous years in nonreactor counties. We don't have much in formation on reactor counties, but perhaps the rates are close to those observed previously in nonreactor counties. iii. Opinion 3: (a1 = 0.2 = 2.2, bl = b2 = 1). Cancer rates in this year could be different from rates in previous years, for both reactor and nonreactor counties. (1) In the above analysis we assumed that population size gives no infor mation about fatality rate. Is this reasonable? How would the analysis have to change if this is not reasonable? e) We encoded our beliefs about 91 and 32 such that they gave no in formation about each other (they were a. prio'rz' independent). Think about why and how you might encode beliefs such that they were a. prio'm' dependent. _ " n
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started