Math 207—Summary of
Large-Sample Significance Tests
As I mentioned
when we discussed confidence intervals, in some sense I want you to become
“bored” with the material, as this means you completely understand the bigger
picture (with confidence intervals the bigger picture was estimate
multiplier
(standard error of the estimate)).
In
significance testing, the bigger picture is 1) Determine the type of situation (e.g., one- or two-sample, mean
or proportion) and define the appropriate hypotheses; 2) Decide whether a large-sample test is appropriate; 3) Calculate the appropriate test
statistic (in the large-sample cases the test statistic is always a z-value); 4) Determine the p-value (remember this depends on the alternative hypothesis); and 5) Define and interpret the p-value in the context of the problem
and provide a conclusion (which might depend on a given significance level,
).
And it’s important to draw and carefully
label a picture (a normal curve) as part of the solution process.
Furthermore, there is a direct relationship between confidence
intervals and two-sided significance tests, which is included in the summaries
below. And there is an important issue called practical significance that should be addressed.
Important Note: The textbook describes a “critical-value
approach” to significance testing. This method is out-dated and limits your
conclusion to a certain significance level. The p-value approach allows you to make conclusions for any
significance level (and it provides more useful information). You must know the p-value approach.
Practical
Significance versus Statistical Significance
It’s possible for test results to be statistically significant,
yet not practically significant. For example, you might find a statistically
significant difference in means (i.e.,
you can reject the null hypothesis that the means are the same), yet in the
context for the problem the difference might not be practically important. (For
example, perhaps you find a significant difference in average decrease in
cholesterol for patients taking a drug versus patients taking a placebo, but
the magnitude of the difference in only 5 mg/dL. Doctors
probably won’t find this practically important—certainly not important enough
to put their patients on that drug.)
Hence, if you find statistically significant test results, it’s
a good idea to accompany your results with a corresponding confidence interval
(to assess the practical significance—but realize it’s an expert in the field,
not necessarily a statistician, who should assess the practical importance).
Large-Sample
Significance Test for a Population Mean
Suppose we have a large (
),
random sample from a population with unknown mean,
.
Furthermore, suppose we want to test the null
hypothesis,
.
Then we first calculate the test
statistic,
(you can estimate the population standard
deviation with the sample standard deviation), which tells us the number of
standard errors our particular sample average is from the null-hypothesized
population mean. Then by the Central Limit Theorem, we can use the standard normal
distribution to determine the approximate p-value
(recall the p-value depends on the
direction of the alternative hypothesis):
Finally, we define and
interpret the p-value in the context of the problem and provide a conclusion
(which might depend on a given significance level,
).
Relationship between Confidence Interval and Significance Test:
A level
, two-sided significance test rejects the hypothesis
exactly
when the value
falls outside
the
confidence interval for
.
(Put another way, the significance test does not reject the hypothesis if the
value
falls inside the corresponding confidence
interval.)
More
Explanation on the Relationship between Confidence Interval and Significance
Test
For a more mathematical explanation of the previous result,
consider a two-sided test using significance level
.
Then the “acceptance region” (really the “do-not-reject region”) for the test
statistic is between -1.96 and 1.96. Then (via simple algebra),
![]()
which says the value of
is inside the 95% confidence interval.
Large-Sample
Significance Test for a Population Proportion
The textbook discusses a large-sample test for a population
proportion. We do not often use this test in practice, because it’s uncommon to
have a situation where there is a precise value of p we want to test. Hence, when doing inference about a single
population proportion, we’ll stick with a confidence
interval. (That is, you can skip textbook section 9.5, with the exception
of practical significance, which you
need to know.)
Large-Sample
Significance Test for a Difference in Population Means
Suppose we have two distinct populations with unknown means,
and
.
Furthermore, suppose we have large
independent, random samples from each
population. Lastly, suppose we want to test the null hypothesis,
(this will always be the default null
hypothesis). Then we first calculate the
test statistic,
(subtracting 0 clearly does nothing to the
numerical value, but I include it to emphasize this is a z-value), which tells
us the number of standard errors our particular difference in sample averages
is from the null-hypothesized difference in population means. Then by the
Central Limit Theorem, we can use the
standard normal distribution to determine the p-value (recall the p-value depends on the direction of the
alternative hypothesis—in the same way described in detail for the one-sample
test).
Finally, we define and
interpret the p-value in the context of the problem and provide a conclusion
(which might depend on a given significance level,
).
Relationship between Confidence Interval and Significance Test:
A level
, two-sided significance test rejects the hypothesis
exactly
when the value 0 falls outside the
confidence interval
for
.
(Put another way, the significance test does not reject the hypothesis if the value 0 falls inside the corresponding
confidence interval.)
Large-Sample
Significance Test for a Difference in Population Proportions
Suppose we have two distinct populations with unknown “success”
proportions,
and
.
Furthermore, suppose we have large (
),
independent, random samples from each population. Lastly, suppose we want to
test the null hypothesis,
(this will always be the default null
hypothesis).
Brief
Remark (needed to better understand the test statistic)
Unlike the confidence-interval situation, there is information
in the null hypothesis we can use to better estimate the standard error of
.
Recall the standard error of
is
.
Since all our calculations are done assuming the null hypothesis is true, for
our test statistic we can assume
(let’s call this common success probability, p). Then the standard error reduces to
. To
estimate the unknown value of p , we can pool our information from both samples:
, where
is the number of successes in the first sample
and
is the number of successes in the second
sample.
Back
to the significance test…
Then we first calculate the
test statistic,
, where
(subtracting 0 clearly does nothing to the
numerical value, but I include it to emphasize this is a z-value), which tells
us the number of standard errors our particular difference in sample
proportions is from the null-hypothesized difference in population proportions.
Then by the Central Limit Theorem, we can use
the standard normal distribution to determine the p-value (recall the p-value depends on the direction of the
alternative hypothesis—in the same way described in detail for the one-sample
test).
Finally, we define and
interpret the p-value in the context of the problem and provide a conclusion
(which might depend on a given significance level,
).
Relationship between Confidence Interval and Significance Test:
When comparing population proportions the confidence interval
and test statistic have slightly different forms. Hence, there isn’t an exact
relationship between the interval and test, but it’s still a closely
approximate relationship: A level
, two-sided significance test rejects the hypothesis
approximately
when the value 0 falls outside the
confidence interval for
.
(Put another way, the significance test does not reject the hypothesis if the
value 0 falls inside the corresponding confidence interval.)