Review the key concepts, formulae, and examples before starting your quiz.
🔑Concepts
Statistics is the branch of mathematics dealing with the collection, organization, analysis, interpretation, and presentation of data. Data can be categorized as Primary Data (collected directly by the investigator for a specific purpose) or Secondary Data (collected from an existing source like newspapers or the internet).
Raw data is an unorganized collection of observations. To make it meaningful, it is arranged in ascending or descending order (an Array) or presented in a Frequency Distribution Table. Visually, this table uses tally marks where four vertical lines are crossed by a fifth diagonal line to represent a count of five, making it easy to read frequencies.
Grouped Frequency Distribution is used for large data sets by grouping values into 'Class Intervals' such as . The lower value is the Lower Class Limit and the higher value is the Upper Class Limit. In 'Exclusive' or continuous form, the upper limit of one class is the lower limit of the next; if a value equals the upper limit, it is visually placed in the next higher class interval.
A Bar Graph is a visual representation consisting of vertical or horizontal rectangular bars of uniform width with equal spacing between them. The height of each bar represents the frequency of the specific category, making it easy to compare different discrete variables at a glance.
A Histogram is a graphical representation of a continuous grouped frequency distribution. Unlike bar graphs, there are no gaps between the rectangular bars. The class intervals are plotted on the horizontal -axis and frequencies on the vertical -axis. Visually, if the first class doesn't start at zero, a 'kink' or 'zigzag' line is drawn on the -axis to show the break in the scale.
A Frequency Polygon is a visual alternative to a histogram. It is formed by joining the mid-points (class marks) of the tops of the rectangles in a histogram with straight lines. To complete the polygon, it is extended to the -axis by joining it to the mid-points of imaginary classes with zero frequency at both ends, creating a closed geometric shape.
Range is the simplest measure of dispersion, representing the spread of data. Visually, it is the distance on a number line from the smallest observation to the largest observation in the data set.
📐Formulae
💡Examples
Problem 1:
The marks obtained by 10 students in a mathematics test are: . Construct a frequency distribution table and find the range.
Solution:
- Arrange the data in ascending order: . \n2. Count the frequency of each observation: \n- : times \n- : times \n- : times \n- : time \n- : time \n3. Create the table with columns 'Marks' and 'Frequency'. \n4. Calculate Range: \n \n
Explanation:
To organize raw data, we first count how many times each value appears (frequency). The range shows the difference between the highest and lowest scores.
Problem 2:
In a grouped frequency distribution, the class intervals are . Find the class mark for the interval and the class size.
Solution:
- To find the Class Mark (): \n \n \n2. To find the Class Size (): \n \n
Explanation:
The class mark is the central value of a class interval, used for plotting frequency polygons. The class size is the uniform width of the intervals.