Math 217—One-Factor ANOVA Example

 

 

Many studies have suggested that there is a link between exercise and healthy bones. Exercise stresses the bones and this causes them to get stronger. One study (done at Purdue University) examined the effect of jumping on the bone density of growing rats. There were three treatments: a control with no jumping, a low-jump condition (the jump height was 30 cm), and a high-jump condition (60 centimeters). After 8 weeks of 10 jumps per day, 5 days per week, the bone density of the rats (expressed in mg/cm3) was measured. Comparative boxplots and separate numerical summaries for the treatment groups are shown below.

 

 

 

 

Variable               Treatment        N     Mean   StDev   Minimum       Q1   Median       Q3   Maximum

Bone Density (mg/cm3)  1 - Control     10   601.10   27.36    554.00   587.00   601.50   615.75    653.00

                       2 - Low Jump    10   612.50   19.33    588.00   595.50   606.00   632.75    638.00

                       3 - High Jump   10   638.70   16.59    622.00   625.00   637.00   650.00    674.00

 

Per usual, the first step in any statistical analysis is to summarize the variables graphically and numerically (this gives you a first look descriptively that allows you, among other things, to see if there are any issues with the data—for example, outliers or mis-recorded observations).

 

We’d like to test  against at least two of the are different. First let’s assess the equal-variances condition of the ANOVA test. From the boxplots, the variability in bone densities looks somewhat similar for the three treatment groups (not exactly the same, but not wildly different). Using our rule of thumb, the largest standard deviation (27.36 mg/cm3) is less than twice the smallest standard deviation (16.59 mg/cm3), so it seems feasible that the equal-variance condition on the errors is met).

 

From Minitab we get the following information, as well as graphs (histogram and normal probability plot) of the residuals.

 

One-way ANOVA: Bone Density (mg/cm3) versus Treatment

Source     DF     SS    MS     F      P

Treatment   2   7434  3717  7.98  0.002

Error      27  12580   466

Total      29  20013

 

S = 21.58   R-Sq = 37.14%   R-Sq(adj) = 32.49%

 

 

 

Recall the residual plots allow us to assess the normality assumption (since checking the normality separately for each treatment group can be difficult if there are few observations). The residuals seem to roughly follow a normal distribution, so this assumption seems reasonable. Note the P-value for our test is 0.002. That is, assuming the average bone density is the same for all rats regardless of treatment, there is only a 0.002 chance of getting our particular sample results or more extreme results. This gives strong evidence that there is a difference in average bone density for at least two of the treatment groups. (Although the experimental treatment only explains 37% of the variation in bone densities.) This begs the question: between which treatment groups is there a significant difference in average bone density?


Multiple Comparisons

Since we found a significant difference between at least two means, it’s appropriate to now compare the means pairwise. Included below is the Minitab output for Tukey’s method (which adjusts for multiple comparisons, and has a family error rate of 5%) and for Fisher’s method (which doesn’t adjust for multiple comparisons). Not adjusting for multiple comparisons (Fisher’s method) is only appropriate if you’re simply exploring the data looking for interesting effects to investigate in another experiment.

 

Notice that, in this case, regardless of whether we adjust for multiple comparisons we see a significant difference in average bone density between the control group and the high-jump group and between the low-jump group and high-jump group (but not between the control group and the low-jump group). We can also use these intervals to assess the practical significance of the differences.

 

Tukey 95% Simultaneous Confidence Intervals

All Pairwise Comparisons among Levels of Treatment

 

Individual confidence level = 98.04%

 

Treatment = 1-Control subtracted from:

 

Treatment       Lower  Center  Upper  -------+---------+---------+---------+--

2-Low Jump     -12.56   11.40  35.36               (-------*-------)

3-High Jump     13.64   37.60  61.56                        (-------*-------)

                                      -------+---------+---------+---------+--

                                           -30         0        30        60

 

Treatment = 2-Low Jump subtracted from:

 

Treatment      Lower  Center  Upper  -------+---------+---------+---------+--

3-High Jump     2.24   26.20  50.16                    (-------*-------)

                                     -------+---------+---------+---------+--

                                          -30         0        30        60

 

 

Fisher 95% Individual Confidence Intervals

All Pairwise Comparisons among Levels of Treatment

 

Simultaneous confidence level = 88.07%

 

Treatment = 1-Control subtracted from:

 

Treatment      Lower  Center  Upper  -----+---------+---------+---------+----

2-Low Jump     -8.41   11.40  31.21              (------*-----)

3-High Jump    17.79   37.60  57.41                       (------*-----)

                                     -----+---------+---------+---------+----

                                        -30         0        30        60

 

Treatment = 2-Low Jump subtracted from:

 

Treatment      Lower  Center  Upper  -----+---------+---------+---------+----

3-High Jump     6.39   26.20  46.01                   (------*-----)

                                     -----+---------+---------+---------+----

                                        -30         0        30        60

 

Relationship to Regression

The ANOVA table for a one-way analysis of variance is exactly the same as the ANOVA table in a regression analysis when using indicator variables (i.e., “dummy” variables). If indicator variables are created for the first two levels of the experimental treatment (control and low-jump), and then a regression is run with bone density as the response and control-treatment and low-jump-treatment as the predictors, the Minitab ANOVA information is exactly the same as that from the one-way ANOVA analysis:

 

 

Analysis of Variance

Source          DF       SS      MS     F      P

Regression       2   7433.9  3716.9  7.98  0.002

Residual Error  27  12579.5   465.9

Total           29  20013.4

 

S = 21.5849   R-Sq = 37.1%   R-Sq(adj) = 32.5%