A large national survey conducted in 1995 indicated that 18 of American adults had ever been tested for HIV at some point in their life Suppose that in 2016 we take a simple random sample of 100 adults and find that 27 report that they have ever been tested a Assume that proportion of the population that has been tested for HIV has not changed since 1995 What is the probability that 27 or more adults out of 100 have been tested b What assumptions are you making about your data and the study design to compute this probability Clearly define the probability model that you are using c Based on your model, what is the (i) expected number tested for HIV out of a sample of 100 (ii) the variance and standard deviation d Based on this evidence, do you think that the proportion of adults tested for HIV has changed since 1995 Why or why not Unit II Continuous Probability Distributions The Normal Distribution Normal Dist1 Towards the Meaning of Continuous Probability Distribution Functions When we introduced probabilities, we spoke of discrete events S collection of all possible sample points, or elementary outcomes, ei 0 P(ei) 1 Probability of any event is between zero and one P(ei) 1 Probability of all elementary events sum to 1 (something happens) Normal Dist2 1 In particular, for the binomial distribution For the random variable X x stands for a particular value 0 P X x 1 The probability that the random variable X takes the value x is between 0 and 1, inclusive n P X x 1 x 0 The sum of the probabilities over all possible values of x is 1 Normal Dist3 A continuous variable has infinitely many possible values With infinitely many possible values, the probability of observing any one exact value is essentially zero Pr(X x) 0 e g , for x 1 0 vs 1 02 vs 1 0195 vs 1 01947, Pr(X x) is meaningless for an exact value x for a continuous random variable Instead, we consider a range of values for X Pr(a X b) Probability of X in the interval (a,b) We can make this range quite broad Pr(0 X ) or very narrow Pr(1 00 X 1 01) Normal Dist4 2 Comparing Probability Distributions for Discrete vs Continuous Random Variables We need new notation to describe probability distributions for continuous variables Discrete Continuous List all possible sample points, e g , State the range of possible values of X, e g , to S ei , i 1 to k 0 to to 0 Note is the symbol for 'infinity' Normal Dist5 For a continuous Random Variable, X, P(X x) 0 (prob of any exact value is zero) Instead, we use calculus to compute the probability of X within some interval b P a X b f x ( x)dx a This function is called the probability density function of X Don't worry if you don't know or have forgotten calculus, I won't be asking you to work with this notation Normal Dist6 3 Much of statistical inference is based upon a particular choice of a probability density function, fx(x) The Normal distribution This function is a mathematical model describing one particular pattern of variation of values It is appropriate for continuous numeric variables only Normal Dist7 The normal distribution function is appropriate for Many phenomena that occur naturally Special cases of other phenomena, e g , averages of phenomena that individually are not normally distributed For example, the sampling distribution of sample means may follow a normal distribution even when the underlying data are not normally distributed Normal Dist8 4 Notation X N(,2) We read this as X follows a Normal Distribution with mean and variance 2 or X is Normally distributed with mean and variance 2 Note It is the variance, not the standard deviation given in this notation and 2 are parameters of the Normal Distribution Normal Dist11 A Picture of the Normal Distribution fx x x The infamous Bell shaped Curve Normal Dist12 6 There are infinitely many normal distributions, each determined by different values of and 2 The Shape of the Normal Distribution is characteristically Smooth Defined everywhere on the real axis ( to ) Bell shaped Symmetric about the mean (it is defined in terms of deviations about the mean ) Normal Dist13 fx x x The area under the normal curve represents probability The total area under the curve 1 (That is, the total probability of some value across the full range of values is 1) ( x )2 2 1 2 Pr X e dx 1 2 Normal Dist14 7 If X follows a Normal Distribution Then 95 of the values of X are in the interval 1 96 99 of the values of X are in the interval 2 576 Normal Dist21 Why is the Normal Distribution so important There are two types of data that tend to follow a normal distribution 1 A number of naturally occurring phenomena For example heights of men (or women) total blood cholesterol of adults 2 Special functions of some non normally distributed phenomena, in particular sums and averages The sampling distribution of sample means tends to be Normal (Sample means are averages) Normal Dist22 11 1 Naturally occurring phenomena Phenomena that are subject to a wide range of causative factors tend to follow a normal distribution For example, heights of adult men are influenced by a large number of both genetic and environmental factors All together, across a population we observe a normal distribution of heights Normal Dist23 2 Special functions of some non normally distributed phenomena, in particular sums and averages Research often focuses on sample means Example Blood pressure can vary with time of day, stress, food, illness, etc One reading may not be a good representation of typical Distribution of a single reading of blood pressure for an individual tends to be right skewed, with a few high values Normal Dist24 12 To have a better gauge of an individual's BP, we might use the average of 5 readings The Sampling Distribution of the mean of 5 readings for an individual tends to be Normal, even when the original (or parent) distribution is not Normal Dist25 Towards the Central Limit Theorem Define an experiment Shake a pair of die On each roll, note the total of the two die faces This total can range from 2 to 12 Create a sample space listing all possible pairs of rolls (elementary outcomes) and assign probability to each outcome Define composite events as E1 Die sum to 2 E2 Die sum to 3, The most likely total is 7 (Why ) Normal Dist26 13 A Statement of the Central Limit Theorem For any population with mean and finite variance 2, the sampling distribution of means, xn, from samples of size n from this population, will be approximately normally distributed with mean , (same as population mean) and variance 2 n, for n large That is, for n large, and X (, 2) then Xn N (, 2 n) Normal Dist29 The Central Limit Theorem (CLT) is a key reason for our interest in the normal distribution Regardless of the underlying population distribution (normal or far from normal) If we take a large enough sample we can make probability statements about means from such samples based upon the normal distribution This is true, even when the underlying distribution is discrete Normal Dist30 15 Now, let a and b Then 1 X Z aX b X For X N(,2) Z N( , ) z a b 1 0 2 1 a 2 1 2 z Or 2 2 Z N(0,1) Normal Dist55 X N ( , 2 ) Z X N (0,1) We have transformed the original scale to units measured in multiples of standard deviations centered around zero A value of z 1 means the corresponding value of x is 1 standard deviation below the mean A value of z 2 5 means the corresponding value of x is 2 5 standard deviations above the mean Normal Dist56 28 This transformation is also important, because, for X N(,) if we want to know the probability of X in any range Pr(a X b) we can convert it to an equivalent calculation in terms of a standard normal a X b Pr(a X b) Pr b a Pr Z Normal Dist57 Word Problem The profit from the Massachusetts state lottery on any given week is distributed Normally with mean 10 0 million and variance 6 25 million dollars2 What is the probability that this week's profit is between 8 and 10 5 million Let X weekly profit in millions Then X N(,2) where 10 and 2 6 25 ( 2 5 ) What is Pr(8 X 10 5) Normal Dist58 29 What is Pr(8 X 10 5) Translate to Standard Normal 8 X 10 5 Pr(8 X 10 5) Pr 10 5 10 8 10 Pr Z 2 5 2 5 Pr 0 8 Z 0 2 8 8 z scale (std dev units) x scale (millions of $) 0 2 10 10 5 8 2 Pr(Z 0 2) Normal Dist59 Pr(Z

Question

A large national survey conducted in 1995 indicated that 18  of American adults had ever been tested for HIV at some point in their life  Suppose that in 2016 we take a simple random sample of 100 adults and find that 27 report that they have ever been tested  a  Assume that proportion of the population that has been tested for HIV has not changed since 1995  What is the probability that 27 or more adults out of 100 have been tested  b  What assumptions are you making about your data and the study design to compute this probability  Clearly define the probability model that you are using  c  Based on your model, what is the (i) expected number tested for HIV out of a sample of 100  (ii) the variance and standard deviation  d  Based on this evidence, do you think that the proportion of adults tested for HIV has changed since 1995  Why or why not  Unit II Continuous Probability Distributions  The Normal Distribution Normal Dist1 Towards the Meaning of Continuous Probability Distribution Functions  When we introduced probabilities, we spoke of discrete events  S   collection of all possible sample points, or elementary outcomes, ei 0 P(ei) 1 Probability of any event is between zero and one P(ei)   1 Probability of all elementary events sum to 1 (something happens) Normal Dist2 1 In particular, for the binomial distribution  For the random variable X  x stands for a particular value 0 P  X x  1 The probability that the random variable X takes the value x is between 0 and 1, inclusive  n P  X x  1 x 0 The sum of the probabilities over all possible values of x is 1  Normal Dist3 A continuous variable has infinitely many possible values  With infinitely many possible values, the probability of observing any one exact value is essentially zero   Pr(X x)    0 e g , for x 1 0 vs 1 02 vs 1 0195 vs 1 01947,     Pr(X x) is meaningless for an exact value x for a continuous random variable   Instead, we consider a range of values for X  Pr(a X b)   Probability of X in the interval (a,b)   We can make this range quite broad  Pr(0 X ) or very narrow  Pr(1 00 X 1 01) Normal Dist4 2 Comparing Probability Distributions for Discrete vs Continuous Random Variables We need new notation to describe probability distributions for continuous variables  Discrete Continuous List all possible sample points, e g , State the range of possible values of X, e g , to S  ei , i  1 to k  0 to to 0 Note  is the symbol for 'infinity' Normal Dist5 For a continuous Random Variable, X, P(X x)   0 (prob of any exact value is zero) Instead, we use calculus to compute the probability of X within some interval  b P a X b  f x ( x)dx a This function is called the probability density function of X  Don't worry   if you don't know or have forgotten calculus, I won't be asking you to work with this notation  Normal Dist6 3 Much of statistical inference is based upon a particular choice of a probability density function, fx(x)   The Normal distribution  This function is a mathematical model describing one particular pattern of variation of values  It is appropriate for continuous numeric variables only  Normal Dist7 The normal distribution function is appropriate for  Many phenomena that occur naturally  Special cases of other phenomena, e g , averages of phenomena that individually are not normally distributed  For example, the sampling distribution of sample means may follow a normal distribution even when the underlying data are not normally distributed  Normal Dist8 4 Notation  X   N(,2) We read this as    X follows a Normal Distribution with mean and variance 2    or   X is Normally distributed with mean and variance 2    Note  It is the variance, not the standard deviation given in this notation  and 2 are parameters of the Normal Distribution Normal Dist11 A Picture of the Normal Distribution fx x x The infamous   Bell shaped Curve   Normal Dist12 6 There are infinitely many normal distributions, each determined by different values of and 2  The Shape of the Normal Distribution is characteristically Smooth Defined everywhere on the real axis (  to ) Bell shaped Symmetric about the mean (it is defined in terms of deviations about the mean ) Normal Dist13 fx x x The area under the normal curve represents probability The total area under the curve   1 (That is, the total probability of some value across the full range of values is 1) ( x )2 2 1 2 Pr  X   e dx 1 2 Normal Dist14 7 If X follows a Normal Distribution Then   95  of the values of X are in the interval 1 96  99  of the values of X are in the interval 2 576 Normal Dist21 Why is the Normal Distribution so important  There are two types of data that tend to follow a normal distribution  1  A number of naturally occurring phenomena  For example   heights of men (or women) total blood cholesterol of adults 2  Special functions of some non normally distributed phenomena, in particular sums and averages  The sampling distribution of sample means tends to be   Normal  (Sample means are averages)  Normal Dist22 11 1  Naturally occurring phenomena  Phenomena that are subject to a wide range of causative factors tend to follow a normal distribution  For example, heights of adult men are influenced by a large number of both genetic and environmental factors  All together, across a population we observe a normal distribution of heights  Normal Dist23 2  Special functions of some non normally distributed phenomena, in particular sums and averages  Research often focuses on sample means Example  Blood pressure can vary with time of day, stress, food, illness, etc  One reading may not be a good representation of   typical   Distribution of a single reading of blood pressure for an individual   tends to be right skewed, with a few high values Normal Dist24 12 To have a better gauge of an individual's BP, we might use the average of 5 readings  The Sampling Distribution of the mean of 5 readings for an individual   tends to be   Normal, even when the original (or parent) distribution is not Normal Dist25 Towards the Central Limit Theorem Define an experiment  Shake a pair of die  On each roll, note the total of the two die faces  This total can range from 2 to 12  Create a sample space listing all possible pairs of rolls (elementary outcomes) and assign probability to each outcome Define composite events as E1  Die sum to 2 E2  Die sum to 3,     The most likely total is 7  (Why ) Normal Dist26 13 A Statement of the Central Limit Theorem For any population with mean and finite variance 2, the sampling distribution of means, xn, from samples of size n from this population, will be approximately normally distributed with mean , (same as population mean) and variance 2 n, for n large  That is, for n large, and X     (, 2) then Xn   N (, 2 n) Normal Dist29 The Central Limit Theorem (CLT) is a key reason for our interest in the normal distribution  Regardless of the underlying population distribution (normal or far from normal) If we take a large enough sample we can make probability statements about means from such samples based upon the normal distribution  This is true, even when the underlying distribution is discrete  Normal Dist30 15 Now, let a and b Then 1 X Z aX b X For X N(,2) Z   N( , ) z a b 1 0 2 1 a 2 1 2 z Or 2 2 Z   N(0,1) Normal Dist55 X   N ( , 2 ) Z X   N (0,1) We have transformed the original scale to units measured in multiples of standard deviations centered around zero  A value of z   1 means the corresponding value of x is 1 standard deviation below the mean A value of z 2 5 means the corresponding value of x is 2 5 standard deviations above the mean Normal Dist56 28 This transformation is also important, because, for X   N(,) if we want to know the probability of X in any range  Pr(a X b) we can convert it to an equivalent calculation in terms of a standard normal  a X b Pr(a X b) Pr b a Pr Z Normal Dist57 Word Problem The profit from the Massachusetts state lottery on any given week is distributed Normally with mean   10 0 million and variance   6 25 million dollars2  What is the probability that this week's profit is between 8 and 10 5 million  Let X   weekly profit in millions Then X   N(,2) where  10 and 2 6 25 (  2 5 ) What is Pr(8 X 10 5)   Normal Dist58 29 What is Pr(8 X 10 5)   Translate to Standard Normal  8 X 10 5 Pr(8 X 10 5) Pr 10 5 10 8 10 Pr Z 2 5 2 5 Pr 0 8 Z 0 2   8 8 z scale (std dev units) x scale (millions of $) 0  2 10 10 5   8  2 Pr(Z  0 2) Normal Dist59   Pr(Z

Accepted Answer

The Answer is in the image, click to view ...

Question

A large national survey conducted in 1995 indicated that 18% of American adults had ever been tested for HIV at some point in their life.

Step by Step Solution

Step: 1

Get Instant Access to Expert-Tailored Solutions

Step: 2

Step: 3

Ace Your Homework with AI

Recommended Textbook for

Algebra 1

Students also viewed these Mathematics questions

Question

Question

Question

Question

Question

Question

Question

Question

Question

Question

Question