krit.club logo

Statistics and Probability - Measures of Dispersion (Mean Deviation, Variance, Standard Deviation)

Grade 11ICSE

Review the key concepts, formulae, and examples before starting your quiz.

🔑Concepts

Dispersion refers to the extent to which numerical data is spread out or scattered around a central value (like the mean or median). Visually, if you plot data points on a number line, a low dispersion results in a dense cluster of points near the center, while high dispersion shows points widely distributed across the line.

Mean Deviation (M.D.) is the arithmetic mean of the absolute values of the deviations of individual observations from a central value. By taking the absolute value xiA|x_i - A|, we ignore the sign of the deviation. Visually, this is equivalent to measuring the distance of each point from the center and treating all distances as positive, regardless of whether the point lies to the left or right of the center.

Variance (σ2\sigma^2) is the average of the squares of the deviations from the mean. Squaring the deviations ensures all terms are non-negative and gives greater weight to outliers (points far from the mean). In a graph of squared deviations, larger distances from the mean result in disproportionately larger areas, contributing to a higher variance.

Standard Deviation (σ\sigma) is the positive square root of the variance. It is preferred over variance because it is expressed in the same units as the original data. On a Normal Distribution curve (bell curve), the standard deviation defines the 'width' of the curve; a smaller σ\sigma creates a steep, narrow peak, whereas a larger σ\sigma creates a flatter, wider curve.

Coefficient of Variation (C.V.) is a relative measure of dispersion used to compare the variability of two or more sets of data, even if they have different units or means. It is calculated as the ratio of standard deviation to the mean, expressed as a percentage. A distribution with a lower C.V. is considered more 'consistent' or 'stable'.

Short-cut and Step-deviation Methods are techniques used to simplify calculations for Variance and Standard Deviation, especially when dealing with large numbers or class intervals. These methods involve shifting the origin (Assumed Mean) and scaling the data (Class Width), which visually shifts and compresses the data distribution on the x-axis to make the arithmetic easier.

📐Formulae

Mean: xˉ=xin\bar{x} = \frac{\sum x_i}{n} (for raw data) or xˉ=fixiN\bar{x} = \frac{\sum f_i x_i}{N} (for frequency distribution)

Mean Deviation about Mean: M.D.(xˉ)=xixˉnM.D. (\bar{x}) = \frac{\sum |x_i - \bar{x}|}{n}

Mean Deviation about Median: M.D.(M)=xiMnM.D. (M) = \frac{\sum |x_i - M|}{n}

Variance for Raw Data: σ2=(xixˉ)2n=xi2n(xˉ)2\sigma^2 = \frac{\sum (x_i - \bar{x})^2}{n} = \frac{\sum x_i^2}{n} - (\bar{x})^2

Standard Deviation (Discrete Frequency): σ=fi(xixˉ)2N\sigma = \sqrt{\frac{\sum f_i (x_i - \bar{x})^2}{N}}

Standard Deviation (Short-cut method): σ=fidi2N(fidiN)2\sigma = \sqrt{\frac{\sum f_i d_i^2}{N} - (\frac{\sum f_i d_i}{N})^2}, where di=xiAd_i = x_i - A

Standard Deviation (Step-deviation method): σ=h×fiui2N(fiuiN)2\sigma = h \times \sqrt{\frac{\sum f_i u_i^2}{N} - (\frac{\sum f_i u_i}{N})^2}, where ui=xiAhu_i = \frac{x_i - A}{h}

Coefficient of Variation: C.V.=σxˉ×100C.V. = \frac{\sigma}{\bar{x}} \times 100

💡Examples

Problem 1:

Calculate the Mean Deviation about the mean for the following data: 12,3,18,17,4,9,17,19,20,15,8,17,2,3,16,11,3,1,0,512, 3, 18, 17, 4, 9, 17, 19, 20, 15, 8, 17, 2, 3, 16, 11, 3, 1, 0, 5.

Solution:

Step 1: Find the mean (xˉ\bar{x}). n=20n = 20 xi=12+3+18+17+4+9+17+19+20+15+8+17+2+3+16+11+3+1+0+5=200\sum x_i = 12+3+18+17+4+9+17+19+20+15+8+17+2+3+16+11+3+1+0+5 = 200 xˉ=20020=10\bar{x} = \frac{200}{20} = 10

Step 2: Find absolute deviations xixˉ|x_i - \bar{x}|. 1210=2,310=7,1810=8,1710=7,410=6,910=1,1710=7,1910=9,2010=10,1510=5,810=2,1710=7,210=8,310=7,1610=6,1110=1,310=7,110=9,010=10,510=5|12-10|=2, |3-10|=7, |18-10|=8, |17-10|=7, |4-10|=6, |9-10|=1, |17-10|=7, |19-10|=9, |20-10|=10, |15-10|=5, |8-10|=2, |17-10|=7, |2-10|=8, |3-10|=7, |16-10|=6, |11-10|=1, |3-10|=7, |1-10|=9, |0-10|=10, |5-10|=5.

Step 3: Sum the absolute deviations. xixˉ=2+7+8+7+6+1+7+9+10+5+2+7+8+7+6+1+7+9+10+5=120\sum |x_i - \bar{x}| = 2+7+8+7+6+1+7+9+10+5+2+7+8+7+6+1+7+9+10+5 = 120

Step 4: Calculate M.D. M.D.(xˉ)=12020=6M.D.(\bar{x}) = \frac{120}{20} = 6.

Explanation:

To find Mean Deviation, we first determine the central point (mean), calculate how far each point is from that center using absolute values to remove negative signs, and then find the average of those distances.

Problem 2:

Find the variance and standard deviation for the following frequency distribution: x:4,8,11,17,20,24,32x: 4, 8, 11, 17, 20, 24, 32 f:3,5,9,5,4,3,1f: 3, 5, 9, 5, 4, 3, 1

Solution:

Step 1: Create a table for calculations where N=fi=30N = \sum f_i = 30.

xix_ifif_ifixif_i x_ixi2x_i^2fixi2f_i x_i^2
43121648
854064320
119991211089
175852891445
204804001600
243725761728
3213210241024
Total304206254

Step 2: Find the mean xˉ\bar{x}. xˉ=fixiN=42030=14\bar{x} = \frac{\sum f_i x_i}{N} = \frac{420}{30} = 14

Step 3: Calculate Variance (σ2\sigma^2). σ2=fixi2N(xˉ)2=625430(14)2\sigma^2 = \frac{\sum f_i x_i^2}{N} - (\bar{x})^2 = \frac{6254}{30} - (14)^2 σ2=208.47196=12.47\sigma^2 = 208.47 - 196 = 12.47

Step 4: Calculate Standard Deviation (σ\sigma). σ=12.473.53\sigma = \sqrt{12.47} \approx 3.53.

Explanation:

This solution uses the computational formula for variance. We calculate the sum of frequencies (NN), the sum of products (fixif_i x_i) to find the mean, and the sum of fixi2f_i x_i^2 to find the raw second moment. Variance is the difference between the average of the squares and the square of the average.