By the Central
Limit Theorem (for “large” n), some statistics (not all) have an
approximate normal distribution. Thus, the normal distribution (a distribution
we have studied in detail) can be used to approximate certain probabilities.
Using the normal
approximation:
Step 1:
Is it a
binomial setting or not? Consider the variable being measured. Does it only
take two values? Or does it take a range of values? If it takes a range of
values, then it is not the binomial setting. If it only takes two values
(and the other binomial characteristics are present), then it is the binomial
setting (or approximate binomial setting—for example, we have approximate
independence if the sampling is done without replacement, but the population
size is 20 times the sample size).
Step 2:
Is it
appropriate to use the normal approximation? That is, will the approximation be
good? Note that it is always possible for the approximation to be used,
but it will not always provide a good approximation—this is an important
distinction for a statistician to make.
|
Setting |
Check |
|
Binomial |
|
|
Not binomial |
|
Recall if the original distribution is normal,
then the sample average and sample total have exact (not approximate) normal distributions, regardless of sample size.
Step
3:
For
what variable is the probability being calculated? Is it a total or an average?
|
Binomial Setting |
Not Binomial Setting |
|
Asking about a sample proportion or
percentage? Then use
|
Asking about a sample mean or average? Then use
|
|
Asking about a sample count or number? Then use X and standardize with
the appropriate mean and standard error:
|
Asking about a sample total or sum? Then use
|
Step
4:
Draw
the appropriate normal curve picture and standardize to obtain the probability
of interest (or use reverse look-up to determine the value of interest).
Motivation for the Rule-of-Thumb Check
when Using the Normal Approximation in the Binomial Setting
If
we use a normal distribution to approximate a binomial distribution, the
“tails” of the distribution must be cut off (the possible values for a binomial
are 0 to n, yet for a normal distribution the possible values extend along the
entire number line). This is not a big deal if the center of the distribution
is far enough from 0 and n that the lost tails have negligible area.
One possible rule-of-thumb:
Ensure
the mean is more than 3 standard deviations from both 0 and n:
![]()
![]()
This
gives us the rule-of-thumb:
. (This is probably the most
commonly-used rule-of-thumb, but is not the one used by our textbook.)
Another possible rule-of-thumb:
Ensure
the mean is more than 2 standard deviations from both 0 and n:
![]()
![]()
This
gives us the rule-of-thumb:
.
The textbook’s rule-of-thumb is in between these two (closer to the
second):
The normal distribution should be a good
approximation to binomial probabilities if
.