Review the key concepts, formulae, and examples before starting your quiz.
🔑Concepts
Range: This is the simplest measure of dispersion, calculated as the difference between the maximum and minimum values. Visually, if data points are plotted on a number line, the range represents the total horizontal span or 'width' that the data covers from the leftmost point to the rightmost point.
Quartiles (): These values divide an ordered data set into four equal parts. The first quartile () is the percentile, the second quartile () is the median ( percentile), and the third quartile () is the percentile. In a box-and-whisker plot, these correspond to the start of the box, the line inside the box, and the end of the box respectively.
Interquartile Range (IQR): The IQR measures the spread of the middle of the data by calculating . Visually, the IQR is represented by the length of the central 'box' in a box plot. It is more robust than the range because it is not affected by extreme outliers at the ends of the distribution.
Standard Deviation (): This measure indicates the average distance of each data point from the mean (). A low standard deviation means the data points are clustered closely around the mean, while a high standard deviation indicates the data is spread out over a wider range. On a frequency curve, a smaller standard deviation creates a taller, narrower peak, while a larger one creates a flatter, wider curve.
Variance (): This is the square of the standard deviation. It provides a measure of how far a set of numbers are spread out from their average value. While standard deviation is in the same units as the data, variance is in squared units.
Outliers: These are extreme values that fall significantly outside the rest of the data. A common mathematical boundary for outliers is any value less than or greater than . Visually, outliers are often marked with individual dots or asterisks beyond the 'whiskers' of a box-and-whisker plot.
Effect of Transformations: If a constant is added to every data value (translation), the measures of dispersion (Range, IQR, SD) remain unchanged. However, if every data value is multiplied by a constant (scaling), the Range, IQR, and Standard Deviation are all multiplied by .
📐Formulae
💡Examples
Problem 1:
Find the Range and Interquartile Range (IQR) for the following set of test scores: .
Solution:
- Organize data: The data is already in ascending order: .
- Calculate Range: .
- Find Median (): , so median is the average of the and terms: .
- Find : The lower half is . .
- Find : The upper half is . .
- Calculate IQR: .
Explanation:
To find the spread of the middle half of the data, we split the ordered list into quartiles. Since there are 8 items, the lower and upper halves each contain 4 items, and we find the median of those subsets to get and .
Problem 2:
Calculate the standard deviation for the data set: .
Solution:
- Find the mean: .
- Find deviations from the mean: , , .
- Square the deviations: , , .
- Find the mean of the squared deviations (Variance): .
- Take the square root: .
Explanation:
Standard deviation measures the 'typical' distance from the mean. We square the differences to remove negative signs, average them (variance), and then return to the original units by taking the square root.