Math 207 – Important Additions
to Chapters 1 and 2
·
Throughout
the course we will talk about the distribution of a variable or an estimator.
The distribution is defined as what values a variable takes on and how often.
In any graphical display of a distribution, look for the overall pattern and
for striking deviations from that pattern. The overall pattern of a
distribution can be described by its shape, center, and spread (i.e.,
variability).
·
The
textbook only discusses the range and the standard deviation in its section on
“measures of variability.” The interquartile range
(mentioned later in Chapter 2) is also a measure of variability. (Also note
that the standard deviation is affected by extreme values, whereas the interquartile range is not.)
·
The
mean is affected by outliers, while the median is not. Hence, for skewed
distributions, the median is a better measure of typical value. Now recall the
standard deviation measures the spread of a distribution around its mean. So if
the mean is not a good measure of center (e.g.,
for a skewed distribution), then the standard deviation shouldn’t be used as a
measure of spread. Therefore, for skewed distributions, the five-number summary
is a more appropriate and informative summary than simply reporting the mean and
standard deviation (for a symmetric, mound-shaped distribution, the mean and
standard deviation provide all the necessary information).
The next two bullet points address the
importance of writing well about statistics
·
Always put your analysis in context. That is, include information on the
individuals being measured, the variable being measured, and the units of
measurement. After you have written your analysis, ask yourself whether a
stranger could read your words and completely understand the topic and the
context.
Here
is an example progression from poor statistical writing to strong statistical
writing:
The
median is 75.
The
median is 75 beats per minute.
The
median resting heart rate is 75 beats per minute.
The
median resting heart rate of patients is 75 beats per minute.
The
median resting heart rate of recent patients at the Makemewell
Clinic is 75 beats per minute.
·
In a similar vein, be specific (rather
than general) in your descriptions, linking them to the setting of the problem.
Here
is an example progression from general (and non-informative) statistical
writing to specific statistical writing:
The
data are skewed.
The
data are skewed right.
The
distribution of resting heart rates is skewed right.
The
distribution of resting heart rates of recent patients at the Makemewell Clinic is skewed right.