Elementary Statistics—More Information about Significance Testing

 

In some sense I want you to become “bored” with the material, as this means you completely understand the bigger picture (with confidence intervals the bigger picture was  estimate  multiplier (standard error of the estimate)).

 

In significance testing, the bigger picture is 1) Determine the type of situation (e.g., one- or two-sample or paired-data) and define the appropriate hypotheses; 2) Decide whether a z-test or t-test is appropriate (and check conditions of the appropriate test); 3) Calculate the appropriate test statistic (in all cases this is some sort of standardization); 4) Determine the p-value (remember this depends on the alternative hypothesis); and 5) Define and interpret the p-value in the context of the problem and provide a conclusion (which might depend on a given significance level, ). And it’s important to draw and carefully label a picture (a normal curve or t-distribution curve) as part of the solution process.

 

Furthermore, there is a direct relationship between confidence intervals and two-sided significance tests, which is included in the summaries below. And there is an important issue called practical significance that should be addressed.

 

Relationship between Confidence Interval and Significance Test for a Single Population Mean:

A level , two-sided significance test rejects the hypothesis exactly when the value falls outside the confidence interval for. (Put another way, the significance test does not reject the hypothesis if the value  falls inside the corresponding confidence interval.) This holds whether the population standard deviation is known (and a z-test is used) or if the population standard deviation is not know (and a t-test is used).

 

More Explanation of the Relationship between Confidence Interval and Significance Test

For a more mathematical explanation of the previous result, consider a two-sided test using significance level (assuming the population standard deviation is known). Then the “acceptance region” (really the “do-not-reject region”) for the test statistic is between -1.96 and 1.96. Then,

which says the value of  is inside the 95% confidence interval.

 

Relationship between Confidence Interval and Significance Test for a Difference in Population Means:

A level , two-sided significance test rejects the hypothesis exactly when the value 0 falls outside the confidence interval for. (Put another way, the significance test does not reject the hypothesis if the value 0 falls inside the corresponding confidence interval.)

 

Practical Significance versus Statistical Significance

It’s possible for test results to be statistically significant, yet not practically significant. For example, you might find a statistically significant difference in means (i.e., you can reject the null hypothesis that the means are the same), yet in the context for the problem the difference might not be practically important. (For example, perhaps you find a significant difference in average decrease in cholesterol for patients taking a drug versus patients taking a placebo, but the magnitude of the difference in only 5 mg/dL. Doctors probably won’t find this practically important—certainly not important enough to put their patients on that drug.)

 

Hence, if you find statistically significant test results, it’s a good idea to accompany your results with a corresponding confidence interval (to assess the practical significance—but realize it’s an expert in the field, not necessarily a statistician, who should assess the practical importance).