Review the key concepts, formulae, and examples before starting your quiz.
🔑Concepts
Data and Variables: Data is a collection of observations. In statistics, variables are classified as Discrete (values that can be counted, like the number of children in a family) or Continuous (values that can be measured, like height or temperature, which can take any value within a range).
Frequency and Tally Marks: Frequency refers to the number of times a particular observation occurs in a data set. To organize raw data visually, we use tally marks where vertical bars represent counts of 1 to 4, and the fifth count is shown by a diagonal line crossing the four bars to form a bundle of five.
Class Intervals (Grouped Data): When dealing with large datasets, data is organized into groups called class intervals. There are two types: Exclusive (Continuous) method, where the upper limit of one class is the lower limit of the next (e.g., ), and Inclusive (Discontinuous) method, where the limits are inclusive (e.g., ).
Class Limits and Class Boundaries: In a class interval like , 10 is the Lower Limit and 20 is the Upper Limit. For inclusive classes, we convert limits into boundaries to ensure continuity; this is visualized by closing the gap between the end of one class and the start of the next.
Class Mark (Mid-value): The class mark is the central value of a class interval. It represents the entire class in various calculations and is the point that would be plotted in the middle of a class interval on the horizontal axis of a frequency polygon.
Class Size (Width): The class size is the difference between the actual upper boundary and the lower boundary of a class. In a visual distribution like a histogram, this represents the uniform width of the rectangular bars.
Cumulative Frequency: This is the sum of the frequency of a class and all previous classes. A 'Less than' cumulative frequency table shows the total number of observations below the upper limit of each class, representing a running total that grows towards the total number of observations ().
📐Formulae
💡Examples
Problem 1:
Given the following marks of 15 students: . Prepare a frequency distribution table.
Solution:
- Identify the unique observations: .
- Count the occurrences (frequency) using tally marks:
- For 12: (Frequency = )
- For 15: (Frequency = )
- For 18: (Frequency = )
- For 20: (Frequency = )
- Construct the table: \begin{array}{|c|c|c|} \hline Marks & Tally & Frequency \ \hline 12 & |||| \setminus & 5 \ 15 & |||| & 4 \ 18 & ||| & 3 \ 20 & ||| & 3 \ \hline Total & & 15 \ \hline \end{array}
Explanation:
We list the unique data points in ascending order, use tally marks to count how many times each mark appears, and then write the final counts as frequencies.
Problem 2:
Convert the following inclusive class intervals into exclusive (continuous) class intervals and find the class mark for the first class: .
Solution:
- Find the gap between classes: .
- Calculate the correction factor: .
- Adjust boundaries:
- For : Lower Boundary = ; Upper Boundary = . Result: .
- For : Lower Boundary = ; Upper Boundary = . Result: .
- Find the Class Mark for : .
Explanation:
Inclusive classes have gaps. We calculate a correction factor (usually 0.5) to subtract from lower limits and add to upper limits to make the intervals continuous. The class mark calculation remains the same using original limits or new boundaries.