krit.club logo

Data Handling - Collection and Organisation of Data

Grade 7CBSE

Review the key concepts, formulae, and examples before starting your quiz.

🔑Concepts

•

Data Collection: Data is a collection of numbers gathered to give some information. Before collecting data, we must clearly define the purpose of the study. For example, to find the average height of students, we specifically collect height measurements in centimeters.

•

Organization of Data: Raw data is usually unorganized and hard to understand. To make sense of it, we use a Frequency Distribution Table. This table consists of columns for the observation, Tally Marks, and Frequency. Tally marks are vertical lines ∣| for numbers 11 to 44, and the fifth mark is a diagonal line across the first four ∣∣∣∣\cancel{||||} to represent a group of five.

•

Arithmetic Mean: The mean (or average) is the most common representative value of a data set. It is the central value that represents the whole group. Visually, it can be thought of as the 'leveling out' point of all numerical values in the set.

•

Range: The range is the measure of the spread of the data. It is calculated by subtracting the lowest observation from the highest observation. A large range indicates that the data points are spread widely apart, while a small range indicates they are clustered closely.

•

Mode: The mode is the value that occurs most frequently in the data set. In a frequency distribution table, the observation with the highest frequency is the mode. Visually, on a bar graph, the mode is represented by the tallest bar.

•

Median: The median refers to the value which lies in the middle of the data when the observations are arranged in increasing or decreasing order. Exactly half of the data points lie above the median, and half lie below it.

•

Bar Graphs: A bar graph is a visual representation of data using rectangular bars of uniform width. The height of each bar is proportional to the frequency it represents. In a bar graph, bars are drawn with equal spacing between them on the horizontal axis (X-axis), while the numerical values are plotted on the vertical axis (Y-axis).

📐Formulae

Arithmetic Mean=Sum of all observationsTotal number of observations\text{Arithmetic Mean} = \frac{\text{Sum of all observations}}{\text{Total number of observations}}

Range=Highest Observation−Lowest Observation\text{Range} = \text{Highest Observation} - \text{Lowest Observation}

Median (for odd number of observations n)=Value of the (n+12)th observation\text{Median (for odd number of observations } n) = \text{Value of the } (\frac{n+1}{2})^{th} \text{ observation}

💡Examples

Problem 1:

The weights (in kg) of 7 students are: 42,38,45,42,50,35,4242, 38, 45, 42, 50, 35, 42. Find the Mean and Range of this data.

Solution:

Step 1: Calculate the Sum of observations. Sum=42+38+45+42+50+35+42=294\text{Sum} = 42 + 38 + 45 + 42 + 50 + 35 + 42 = 294. \nStep 2: Count the total number of observations. n=7n = 7. \nStep 3: Apply the Mean formula. Mean=2947=42\text{Mean} = \frac{294}{7} = 42. \nStep 4: Find the Range. Highest value=50\text{Highest value} = 50, Lowest value=35\text{Lowest value} = 35. Range=50−35=15\text{Range} = 50 - 35 = 15.

Explanation:

To find the mean, we average the total weight across all students. The range shows the difference between the heaviest and lightest student.

Problem 2:

Find the Mode and Median of the following data: 13,16,12,14,19,12,14,13,1413, 16, 12, 14, 19, 12, 14, 13, 14.

Solution:

Step 1: Arrange data in ascending order: 12,12,13,13,14,14,14,16,1912, 12, 13, 13, 14, 14, 14, 16, 19. \nStep 2: Find the Mode. The value 1414 appears 3 times, which is more frequent than any other value. So, Mode=14\text{Mode} = 14. \nStep 3: Find the Median. There are n=9n = 9 observations (odd). \nStep 4: Calculate the position: (9+12)th=5th(\frac{9+1}{2})^{th} = 5^{th} position. \nStep 5: Identify the 5th5^{th} value in the sorted list. The 5th5^{th} value is 1414. So, Median=14\text{Median} = 14.

Explanation:

The data is first sorted to easily identify the middle value and count frequencies. Here, both the most frequent value (Mode) and the middle value (Median) happen to be the same.