Math 207—Power of a Statistical Test (for a population mean, based on a “large” sample)

 

Recall there are two types of error in a significance test: 1) Rejecting the null hypothesis when, in fact, it is true (Type I error), and 2) accepting the null hypothesis when, in fact, it is false (Type II error). The probability of a Type I error is denoted by , and the probability of a Type II error is denoted by . [These are actually conditional probabilities:  and .

 

The power of a test is defined as . That is, the power is the probability we reject the null hypothesis when, in fact, it is false (this is a good thing). Hence, we’d like the power to be high. (The gold standard for power is 0.8)

__________________________________________________________________________________________

To find the power:

  1. At the appropriate significance level, find the “acceptance” region—that is, the “do-not-reject” region—for the test statistic (pay attention to the alternative hypothesis—the acceptance region depends on whether the alternative is one- or two-sided). [Note for this part you must consider both the significance level, , and the direction of the alternative hypothesis.]

 

  1. Rewrite the acceptance region in terms of . (At this point you’ve defined what it means to “accept .”)

 

  1. Given that the specified value of  (in the alternative hypothesis) is true, calculate the probability that  is in the “acceptance” region. That is, determine .

 

  1. Subtract this probability from 1 to determine the power.

__________________________________________________________________________________________

 

Application of these steps to a previous class example:

The mean yield of corn in the United States last year was about 120 bushels per acre. A farmer is interested in if the mean yield is different this year. A random sample of 40 farmers is taken and their yields are recorded. In this sample, s = 10.0 bushels/acre. Using a 0.05 significance level, find the power of the test if this year’s true mean corn yield is 124 bushels.

 

  1. Since it is a two-sided alternative hypothesis, we will reject the null hypothesis if the test statistic is too small or too big. We want to test at the 0.05 level, which means we want . Thus, we need to find the z-value that has area 0.975 to the left: z = 1.96 (or, 2, by the Empirical Rule).

 

We will reject the null hypothesis if the test statistic is smaller than –1.96 or greater than 1.96. Therefore, the “acceptance” region is , where z is the test statistic.

 

 

 

  1. Rewriting the acceptance region in terms of :

 

  1. Calculating:

 

 

 

 

 

  1. Since  = 0.2843, the power of the test is  

Example 1

What is the normal body temperature? A 1992 JAMA article suggests that the average body temperature may be less than 98.6 degrees Fahrenheit. A doctor plans to collect a random sample of 40 body temperatures. (From past experience, she thinks the standard deviation of body temperatures is  degrees.) She wants to test at the 0.05 significance level. Determine the power of the test of detecting a true mean of 98.4 degrees.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

What are the powers of detecting true means of 98.3, 98.2, and 98.1, respectively? We can use these values to sketch the power curve. (Note that, not surprisingly, the power increases as the specified value of the mean moves farther from the null hypothesis.) What two other ways (besides considering an average body temperature farther into the alternative hypothesis) are there to increase the power of a test?

 

 

 

 

 

 

 

Example 2 (Sample-size determination)

Suppose the average weekly salary for women in managerial and professional positions is $670. A researcher thinks that men in the same types of positions have average weekly salaries that are higher than $670. The researcher plans to randomly sample men in managerial and professional positions and record their weekly earnings. He plans to test at the 0.01 significance level, and with power 0.85 he’d like to be able to detect a true mean of $685. How large of a sample should he take? (Assume the standard deviation of the salaries for all men in managerial and professional positions is $100.)