Statistics and Probability - Measures of central tendency for grouped and ungrouped data (mean, median, mode)
Review the key concepts, formulae, and examples before starting your quiz.
🔑Concepts
Measures of Central Tendency provide a single value that represents the 'center' or 'typical' value of a dataset. In a visual representation like a dot plot, the central tendency indicates where most of the data points cluster together.
The Mean (Arithmetic Average) is calculated by summing all data values and dividing by the total count. For grouped data, we estimate the mean by assuming all values in an interval occur at the mid-interval value, which acts as the balance point for that specific group.
The Median is the middle value when data is arranged in ascending order. If there is an even number of observations, it is the average of the two middle terms. In a histogram, the median is the value that divides the total area of the bars into two equal halves.
The Mode is the most frequently occurring value in a dataset. In a frequency table or bar chart, the mode is easily identified as the category with the highest frequency or the tallest bar. A dataset can be bimodal (two modes) or have no mode if all values appear with equal frequency.
Grouped Data involves organizing data into intervals or classes (e.g., ). Because individual data points are lost, we use the 'Modal Class' (the interval with the highest frequency) and the 'Mid-interval value' to perform calculations.
Mid-interval values are essential for calculating the mean of grouped data. You find the midpoint by calculating the average of the lower and upper boundaries of the class. Visually, this is the center point on the x-axis for each bar in a frequency histogram.
The choice of measure depends on the data distribution. The mean is sensitive to outliers (extreme values that sit far away from the main cluster on a number line), whereas the median is more robust and provides a better 'middle' representation for skewed data.
Cumulative Frequency is often used to find the median for grouped data. By plotting a cumulative frequency curve (an S-shaped curve or ogive), the median is the x-value corresponding to the 50% mark on the y-axis.
📐Formulae
Mean for ungrouped data:
Mean for grouped data: (where is the mid-interval value and is the frequency)
Position of the median in a sorted list of items:
Mid-interval value:
Total frequency:
💡Examples
Problem 1:
Find the mean, median, and mode for the following set of test scores: .
Solution:
- Mean: Sum the values: . Divide by the count (): . \n2. Median: Sort the data: . The middle value (3rd position) is . \n3. Mode: The value appears twice, while others appear once. Mode = .
Explanation:
This example demonstrates central tendency for a small, ungrouped dataset. The mean provides the average, the median shows the central score after ordering, and the mode identifies the most common score.
Problem 2:
Calculate the estimated mean for the following grouped frequency table: \n- Weight : Frequency \n- Weight : Frequency \n- Weight : Frequency
Solution:
- Find Mid-intervals (): \n - For : \n - For : \n - For : \n2. Calculate : \n - \n - \n - \n3. Sum the totals: \n - \n - \n4. Calculate Mean: \n - kg.
Explanation:
For grouped data, we use the midpoint of each class as a representative value for that group to estimate the sum of all values before dividing by the total frequency.