Review the key concepts, formulae, and examples before starting your quiz.
🔑Concepts
Bivariate Data: Data involving two variables, often represented as coordinates to investigate potential relationships.
Positive Correlation: As the independent variable () increases, the dependent variable () also increases.
Negative Correlation: As the independent variable () increases, the dependent variable () decreases.
Zero/No Correlation: No apparent relationship exists between the variables; points are randomly scattered.
Strength of Correlation: Described as 'Strong' if points lie close to a straight line, or 'Weak' if they are widely spread.
Line of Best Fit: A straight line drawn through the center of the data points, used to model the relationship.
Mean Point: The point representing the average of all and values. The Line of Best Fit must pass through this point.
Interpolation: Estimating a value within the range of the given data set (usually reliable).
Extrapolation: Estimating a value outside the range of the given data set (often unreliable as the trend may not continue).
Correlation vs. Causation: A relationship between two variables does not necessarily mean one causes the other.
📐Formulae
Mean of :
Mean of :
Equation of Line of Best Fit:
Gradient (): (calculated using two points on the line of best fit)
💡Examples
Problem 1:
A student records the number of hours spent studying () and the test scores () for 5 students: (2, 40), (4, 55), (6, 65), (8, 75), (10, 90). (a) Identify the type of correlation. (b) Calculate the mean point .
Solution:
(a) Positive Correlation. (b) . . Mean point = .
Explanation:
Since test scores increase as study hours increase, the correlation is positive. The mean point is found by averaging all values and all values respectively.
Problem 2:
A scatter diagram shows a strong negative correlation between the age of a car ( years) and its value ( dollars). The line of best fit passes through (2, 20000) and (8, 8000). Predict the value of a car that is 5 years old.
Solution:
- Find gradient: . 2. Use with point (2, 20000): . 3. Equation: . 4. For : . Predicted value: $14,000.
Explanation:
We first determine the linear equation that represents the line of best fit and then substitute the target value (age) to find the predicted value (price). Since 5 years is within the range of 2 to 8, this is an example of interpolation.