Statistics - Measures of Central Tendency (Mean, Median, Mode for grouped and ungrouped data)
Review the key concepts, formulae, and examples before starting your quiz.
πConcepts
Mean (Arithmetic Average): This is the sum of all observations divided by the total number of observations. Visually, the mean can be thought of as the 'balance point' of a data set; if you placed the data points on a seesaw, the mean is where the fulcrum would be to keep it perfectly level.
Median: The median is the middle-most value when data is arranged in ascending or descending order. If you visualize a line of students ordered by height, the student standing exactly in the center is the median height. For grouped data, the median is often located using an 'Ogive' or cumulative frequency curve, which is an S-shaped graph where the median corresponds to the -value at the position on the -axis.
Mode: The mode is the value that occurs most frequently in a data set. In a histogram (a bar graph where the area of bars represents frequency), the mode is found within the tallest bar, known as the modal class. A distribution can be unimodal (one mode), bimodal (two modes), or multimodal.
Grouped vs. Ungrouped Data: Ungrouped data is a simple list of numbers, while grouped data is organized into class intervals (e.g., ) with corresponding frequencies. To calculate the mean for grouped data, we use the 'class mark' (the midpoint of each interval) as the representative value for that group.
Assumed Mean Method: This is a technique to simplify mean calculations for large numbers. You pick a central value from the data (the 'assumed mean' ) and calculate deviations () from it. This shifts the entire data set toward zero on the number line, making the arithmetic easier.
Cumulative Frequency and the Ogive: The cumulative frequency is the running total of frequencies. When plotted against the upper class limits, it forms a 'Less Than Ogive'. To find the median visually, you draw a horizontal line from on the cumulative frequency axis to the curve, then drop a vertical line to the -axis; that -intercept is the median.
Empirical Relationship: In a moderately asymmetrical distribution, there is a fixed numerical relationship between the three measures. This can be visualized as a rule of thumb where the distance between the mean and the mode is roughly three times the distance between the mean and the median: .
πFormulae
Mean for ungrouped data:
Mean for grouped data (Direct Method):
Mean for grouped data (Assumed Mean Method): , where
Mean for grouped data (Step-deviation Method): , where and is class size
Median for ungrouped data ( is odd):
Median for ungrouped data ( is even):
Empirical Formula:
π‘Examples
Problem 1:
Find the mean of the following frequency distribution using the Assumed Mean Method: Class intervals with frequencies respectively.
Solution:
- Find Class Marks (): .
- Choose Assumed Mean .
- Calculate deviations : .
- Calculate : .
- Find sums: , .
- Apply formula: .
Explanation:
We use the Assumed Mean Method to reduce the size of the numbers being multiplied. By subtracting 25 from every class mark, we work with smaller integers, then add the average deviation back at the end.
Problem 2:
The marks obtained by 7 students in a test are: . Find the median marks.
Solution:
- Arrange the data in ascending order: .
- Count the number of observations: (which is odd).
- Use the formula for odd : .
- Calculate position: .
- Identify the 4th term: .
- The Median is .
Explanation:
Since the number of observations is odd, the median is the single value located exactly in the middle of the ordered list. There are 3 values smaller than 18 and 3 values larger than 18.