Chapter 11 Language of Descriptive Statistics
Section 11.3 Statistical Measures11.3.3 Measures of Dispersion
Means and quantiles are measures of position, i.e. they give information on the absolute position of the qualitative values . If we add a constant to every value , then the position measures also increase by . In contrast, measures of dispersion are measures that give information on the dispersion or relative distribution of the data values independent of their absolute position. Consider a sample of size of a quantitative property . Let the original list be given by .
The sample variance is a measure of dispersion that describes the variability of the observation sample. The smaller the variance the "closer" the data values lie to each other. A variance is only possible if all data values are equal. Typically, it strongly increases with increasing . The standard deviation is a more appropriate measure for the "broadness" of the distribution of data values. The two formulas given above have a few pitfalls:
- Before the variance can be calculated the mean must already be known.
- The fact that in the definition of is divided by and not by is for deeper mathematical reasons that can only be discussed in a statistics lecture.
- The notation is a little misleading. You must not cancel the square by the square root, since the sum must be calculated (and this value is not defined as a single square) to determine .
- Be careful using a scientific calculator with statistical functions: the sample variance is available via the key. The key, however, provides the sum with denominator instead of . This is not the sample standard deviation.
Example 11.3.16
The data sequence has the mean and the sample standard deviation
Adding further zeros to the data sequence does not change the position measure , but the measure of deviation ,does change since the data values here are more strongly concentrated at the mean. In contrast, shifting all data values by a constant does not change the variance. For example, the data sequence has also variance .
Adding further zeros to the data sequence does not change the position measure , but the measure of deviation ,does change since the data values here are more strongly concentrated at the mean. In contrast, shifting all data values by a constant does not change the variance. For example, the data sequence has also variance .
Exercise 11.3.17
A data sequence (with an unknown number of values) has the measures , , and the median . Suppose the values of a second data sequence satisfy the equation for every . What are its measures?
Answer: the measures are
,
, and
.
Hint: recall the definitions of the mean, the sample variance, and the median consider how multiplying all -values by a factor of influences the entire expression.
Answer: the measures are
,
, and
.
Hint: recall the definitions of the mean, the sample variance, and the median consider how multiplying all -values by a factor of influences the entire expression.