Math 207 Solutions – Assignment 7

 

9.8

Note that , so it is appropriate to use the large-sample significance test for a population mean, since the distribution of the sample average is approximately normal. (Also note this example is completely free of context—a contrived problem with “floating numbers in space.” Not very exciting, but at least you get practice calculating the power.)

 

 

a.      Recall the probability of a Type II error is . Then, for a 0.05 significance level and a one-sided (greater than) alternative hypothesis, what does it mean to “accept ”? At the 0.05 significance level, the “acceptance” (that is, “do not reject”) region for the test statistic, z, is  (since it is a one-sided alternative). In terms of  the acceptance region is

 

(Although I do not include a picture with this solution—only because of the limitations of Word—I strongly encourage you to draw one, and I might require an appropriate normal-curve graph on the exam.)

 

So the acceptance region is and the rejection region is .

 

b.      If  then  

 

 

c.       If  then  (this makes sense, given that 2.3 is the null-hypothesized value of the mean, and we defined our test to have a 0.05 Type I error rate)

 

If  then

 

If  then  (technically, this value greater than zero, but it’s not included on Table 3, and practically it’s very close to zero)

 

 

d.      Since power =  we have the following power values:

power

2.3

1 – 0.9484 = 0.0516

2.4

1 – 0.3409 = 0.6591

2.5

1 – 0.0071 = 0.9929

2.6

1 – 0 = 1

The power curve is shown below. (As we discussed in class, the farther you move into the alternative hypothesis, the higher the power of detecting that particular value.) Note: It’s fine if you connect the dots with a smooth curve.

 

 

9.23

Note that both sample sizes are larger than 30, so it is appropriate to use the large-sample inference procedures (the difference in sample averages will follow an approximate normal distribution).

a.      Let be the mean lead level in all of Section 1, and let  be the mean lead level in all of Section 2. Then, the hypotheses are  and . The test statistic is . (That is, our difference in sample proportions is 2.26 standard deviations below the null-hypothesized value.) Then the p-value is . (The p-value is doubled because the alternative hypothesis is two-sided.) Although a normal curve picture is not included with these solutions, it is an important part of the solution process—I haven’t included it simply because I can’t draw them in Word.

 

If the population lead level means are really the same, there is only a 2.38% chance that we would obtain our sample data or data more extreme. This is very strong evidence against the null hypothesis (and strong evidence that there is a difference in average lead levels for these different sections of the city). The difference is significant at the 0.05 level (since 0.0238 < 0.05).

 

b.      The 95% confidence interval is  =  (-3.55 parts per million, -0.25 parts per million). Aside: note the null hypothesized value of 0 parts/million is not included in this confidence interval. Therefore, the difference is statistically significant at the 0.05 level. This agrees with our answer in part a.

 

c.       In practice, there is only concern if the mean difference is more than 5 parts per million. Although we found statistically significant evidence that the mean lead levels are different, our confidence interval does not include the value –5 parts/million. Hence, the results are not practically significant (that is, not of practical importance to the environmental engineers).

 

 

9.46

a.      Let  be the proportion of adults with children that are frequent moviegoers and let  be the proportion adults without children that are frequent moviegoers. We want to test the hypotheses  and .

 

Since , , , and , we can use the large-sample significance test for two proportions (the difference in sample proportions will follow an approximate normal distribution).

 

For the sample of adults with children, . For the sample of adults without children, . Furthermore, the pooled estimate of the common moviegoer proportion (under the null hypothesis) is .

 

Then the test statistic is . (That is, our difference in sample proportions is 0.74 standard deviations away from the null-hypothesized difference.)

 

And the p-value is . (The p-value is doubled because the alternative hypothesis is two-sided.) Although a normal curve picture is not included with these solutions, it is an important part of the solution process—I haven’t included it simply because I can’t draw them in Word.

 

If the moviegoer proportions are really the same, then the probability of obtaining our data or data more extreme is 0.4592. This is not at all unusual and provides no evidence against the null hypothesis. The results are not statistically significant at any reasonable significance level. That is, we have no evidence that the proportion of moviegoer parents is different from the proportion of moviegoer non-parents.

 

a.      If a significant difference were found (say, that the population of adults without children has a higher proportion of moviegoers), then appropriate advertising strategies could be employed (e.g., no advertising of children’s movies or toys). We’d need to make sure, though, that the significant difference was actually practically significant (this would be a question for the marketing team).

 

 

Extra Problem

Recall the definition of the probability of a Type II error:  . In this problem, we must determine the sample size, n. Let  be the mean systolic blood pressure (in mm Hg) for the company’s young male executives. Then we want to test the hypotheses

 

(Although I do not include pictures with this solution—only because of the limitations of Word—I strongly encourage you to draw them, and I might require appropriate normal-curve graphs on the exam.)

 

Step 1

For 0.05 significance level and a one-sided (greater than) alternative hypothesis, what does it mean to “accept ”? At the 0.05 significance level, the “acceptance” (that is, “do not reject”) region for the test statistic, z, is  (since it is a one-sided alternative).

 

Step 2

In terms of  the acceptance region is

 

So we “accept ” if  .

 

Step 3

Then, because we’re interested in the specific average value of 133 mm Hg in the alternative,

 Now we standardize with the 133 mm Hg value of the mean (keeping the standard deviation the same):

.

 

Step 4

We want a power of 0.8, which means  Hence, we need to find n such that

 . Since  is the 20th percentile of the standard normal distribution (reverse look-up in Table 3), we have:

. Since we always round up on sample-size-determination problems, the director should take a sample of size 56. (Note that this sample is “large”, at least 30, so the method we used—the large-sample test—is appropriate.)

 

More explanation of power:

The medical director decided (based on his expert opinion) the value in the alternative that he wants the test to detect with a high probability. Based on his hypotheses and a 0.05 significance test, if samples of size n=56 are repeatedly taken, then 80% of the tests will reject  when the true mean is really 133 mm Hg (that is, the testing method makes the correct decision—when 80% of the time).

 

 

Solutions to Additional Problems (not to be turned in, but necessary for exam preparation)

 

10.9 (only part a)

Let  be the mean carbon monoxide diffusing capacity for all smokers.

We want to test

Note that the sample size is small (n = 20 < 30), so we need to use the t-procedures (rather than z-procedures). The distribution of the sample diffusing capacities is mound-shaped, so the condition of a normal population is met.

 

The test statistic is .

 

The p-value is , where T has a t-distribution with 19 degrees of freedom. Because of symmetry of the t-distribution, from Table 4 we can say that the p-value is less than 0.005 (if you say the p-value is essentially 0, then that’s fine, too). (Although a picture is not included with these solutions, it is an important part of the solution process—I haven’t included it simply because I can’t draw them in Word.)

 

If the mean carbon monoxide diffusing capacity for all smokers is really 100 DL, then there is less than a 0.5% chance that we would obtain our particular sample mean or a smaller mean. This is very surprising; therefore, we have strong evidence against the mean equaling 100 DL. Furthermore, since p-value < 0.005 < 0.01, the results are statistically significant at the a = 0.01 level.

 

 

10.26

a.      Let  be the mean compartment pressure for the population of runners at rest and let be the mean compartment pressure for the population of cyclists at rest. Then the hypotheses are:

 

Note that the sample sizes are small (n = 10 < 30), so we should use the t-procedures (rather than the z-procedures). We are not given graphical information about the samples, so we cannot be sure that they look mound-shaped. Once we complete our analysis, we should state that it is contingent on the original populations being close to normal. Furthermore, we need to assume that the data are from random samples of people.

 

Since by our rule of thumb we can reasonably assume the population variances are equal. Then the pooled estimate of the population variance is:

The test statistic is

The p-value is , where T has a t distribution with (10 + 10 – 2) = 18 degrees of freedom. From Table 4, we can say that the p-value is between 0.05 and 0.10 (Minitab gives the exact value: 0.0708). Hence, the results are not statistically significant at the 0.05 level. (Although a picture is not included with these solutions, it is an important part of the solution process—I haven’t included it simply because I can’t draw them in Word.)

 

b.      Let  be the mean compartment pressure for the population of runners at 80% maximal and let be the mean compartment pressure for the population of cyclists at 80% maximal.

 

Note that the sample sizes are small, so we should use the t-procedures (rather than the z-procedures). We are not given graphical information about the samples, so we cannot be sure that they look mound-shaped. Once we complete our analysis, we should state that it is contingent on the original populations being close to normal. Furthermore, we need to assume that the data is from a random sample of people.

 

Since by our rule of thumb we can reasonably assume the population variances are equal. Then the pooled estimate of the population variance is:

 

There are 18 degrees of freedom and we want 95% confidence, so  Then a 95% confidence interval for (-) is: (-3.32, 4.72)

 

We are 95% confident that the difference in mean compartment pressures at 80% maximal between runners and cyclists is between –3.32 mm Hg and 4.72 mm Hg. (Per usual, our confidence is in the method we use to create the interval—the method is correct 95% of the time.) Note that 0 is included in the interval, so the difference is not statistically significant at the 0.05 level.

 

c.       At maximal oxygen consumption, so by our rule of thumb, we should not use the pooled t-test. (We should use the two-sample t-test that does not assume equal variances. This has a grungy formula for calculating the degrees of freedom, but Minitab can easily do this calculation for us.)

 

 

10.41

a.      Each subject was presented with both signs in random order. If the subject’s reaction time in general is high, both responses will be high, and vice versa. If a matched pairs design was not used, the large variability from subject to subject might mask the variability due to the difference in sign types. The matched pairs design eliminates the subject-to-subject variability.

 

b.      Let  be the mean for the population of (prohibitive sign – permissive sign) reaction differences. Then, the hypotheses are: .

 

Since the sample size is small (n = 10 < 30), the t-procedures should be used (rather than the z-procedures). Note that the sample distribution of differences looks fairly skewed (this is easy to see in a histogram from Minitab), therefore we should be leery of using the t-procedures (I will carry out the test for the homework, but in practice, the t-procedures perhaps should not be used—but it turns out the data are very, very significant, so a non-parametric test will almost surely agree with the t-test results, so we needn’t worry).

The test statistic is . The p-value is where T has a t-distribution with 9 degrees of freedom. If the population mean difference in reaction times is really 0 milliseconds, there is essentially no chance that we would obtain our particular sample average or a more extreme average. This is very strong evidence that the mean difference is different from 0 milliseconds (and, in fact, greater than 0). The difference is significant at the 0.05 level. (Although a picture is not included with these solutions, it is an important part of the solution process—I haven’t included it simply because I can’t draw them in Word.)

 

c.       There are 9 degrees of freedom and we want 95% confidence, so  Then the 95% confidence interval is: = (80.51 milliseconds, 133.29 milliseconds). Note that 0 is not included in the interval, showing the results are statistically significant at the 0.05 level (as we already mentioned in part b). Furthermore, we can see that this result is probably practically significant as well as statistically significant.

 

10.88

Let  be the mean weight loss (initial weight – final weight) for the population of obese people on the diet. Since the sample size is small (n = 8 <30), the t-procedures should be used (rather than the z-procedures). Furthermore, a paired t-test should be used. Note that the sample distribution of weight losses is slightly skewed right (although it is difficult to determine with only 8 observations), but the t-procedures should be robust enough to handle this (since it’s only a slight violation of the normality condition). Also, we must assume that the obese people were randomly selected.

 

There are 7 degrees of freedom and if we want 95% confidence,  Then the 95% confidence interval for  is: = (35.845 pounds, 48.405 pounds). We are 95% confident that the mean weight loss for all obese people on the diet is between 35.845 pounds and 48.405 pounds. Our confidence is in the method we use—if the sampling were (hypothetically) done repeatedly, then 95% of the intervals created would contain the true mean weight loss. Note that 0 pounds is not included in the confidence interval (i.e., it is not among the likely values at the 95% confidence level), so the results are statistically significant at the 0.05 level. Furthermore, based on the confidence interval endpoints, it seems these results are not only statistically significant, but also of practical importance (that’s a lot of pounds lost!).

 

 

10.100

Let  be the mean weight for all cans. Since the sample size is small (n = 9 < 30), t-procedures should be used (rather than z-procedures). We are not given graphical information about the sample, so we cannot be sure that it looks mound-shaped. Once we complete our analysis, we should state that it is contingent on the original population being close to normal.

 

We want to test

 

The test statistic is .

 

The p-value is , where T has a t-distribution with 8 degrees of freedom. From Table 4, we can say that the p-value is between 0.05 and 0.10 (Minitab gives the exact value: 0.055). Hence, the results are not statistically significant at the 0.05 level (that is, we do not have sufficient evidence to indicate the mean weight is less than that claimed on the label).