Review the key concepts, formulae, and examples before starting your quiz.
🔑Concepts
Regression Lines and the Point of Intersection: There are two regression lines, the line of on and the line of on . These lines represent the best linear fit for predicting one variable given the other. Visually, these two lines always intersect at the point , which represents the arithmetic means of the two variables.
Nature of Regression Coefficients: The regression coefficients, and , are the slopes of the regression lines of on and on respectively. measures the change in for a unit change in , while measures the change in for a unit change in . On a coordinate plane, the steeper the regression line of on , the larger the absolute value of .
Sign Consistency Property: The correlation coefficient () and both regression coefficients ( and ) must always have the same sign. If is positive, both regression lines slope upwards from left to right; if is negative, both lines slope downwards. It is impossible for one coefficient to be positive and the other negative.
Geometric Mean Property: The correlation coefficient is the geometric mean of the two regression coefficients. This is mathematically expressed as . When calculating , the sign of is determined by the sign of the regression coefficients.
The Magnitude Property: The product of the two regression coefficients cannot exceed 1 (). This implies that if one regression coefficient is greater than 1 in absolute value, the other must be less than 1. This ensures that the correlation coefficient never exceeds 1.
Angle Between the Lines: The angle between the two regression lines indicates the strength of the correlation between the variables. If , the angle is and the lines coincide. If , the angle is and the lines are perpendicular. As moves from 0 toward 1, the visual 'gap' between the two lines closes as they rotate toward each other.
Property of Origin and Scale: Regression coefficients are independent of the change of origin but are dependent on the change of scale. If we transform to and to , the new coefficient is related to the old one by the ratio of the scales: .
📐Formulae
Regression Line of on :
Regression Line of on :
Regression Coefficient
Regression Coefficient
Correlation Coefficient:
Direct Calculation of :
Direct Calculation of :
Covariance Relationship: and
💡Examples
Problem 1:
Given the two regression lines and , find the mean values of and , and the correlation coefficient .
Solution:
-
To find the means , we solve the equations simultaneously: Multiply (ii) by 3: . Subtract (i): . Substitute into (ii): .
-
To find , we first identify the coefficients. Assume (i) is on : . Thus . Assume (ii) is on : . Thus . Check validity: . Since , the assumption is correct. (negative sign because both coefficients are negative).
Explanation:
We use the property that regression lines intersect at the means to find and . To find , we must correctly assign which equation is on such that the product of the slopes is .
Problem 2:
If the regression coefficient of on is , the regression coefficient of on is , and the variance of is , find the variance of .
Solution:
- We are given , , and (so ).
- We know the relationship: and .
- Dividing the two gives: .
- Substitute the values: .
- .
- The variance of is .
Explanation:
This solution utilizes the algebraic relationship between the two regression coefficients and the ratio of the standard deviations of the variables.