๐Ÿ“– What Is Two-Variable Data?

Two-variable data examines the relationship between two variables. We use scatterplots and lines of best fit to analyze these relationships.

Scatterplot
A graph showing the relationship between two variables. Each point represents one data pair.
Positive Correlation
As x increases, y increases.
๐Ÿ“ˆ Height โ†’ Weight
Negative Correlation
As x increases, y decreases.
๐Ÿ“‰ Study Time โ†’ Errors
No Correlation
No clear relationship between variables.
๐Ÿ“Š Shoe Size โ†’ IQ

1. Scatterplots

๐Ÿ“Œ RULE: Reading Scatterplots
Step 1: Look at the axes โ€” what do x and y represent?
Step 2: Look at the overall pattern โ€” is there a trend?
Step 3: Identify the direction (positive/negative) and strength (strong/weak).
Step 4: Look for outliers (points far from the pattern).

2. Correlation

๐Ÿ“Œ RULE: Types of Correlation
Positive Correlation: Points go up from left to right.
\(x \uparrow \rightarrow y \uparrow\)

Negative Correlation: Points go down from left to right.
\(x \uparrow \rightarrow y \downarrow\)

No Correlation: Points are scattered randomly.

Strong vs. Weak: The closer the points are to a straight line, the stronger the correlation.
๐Ÿ’ก Strategy โ€” Correlation

1. Look for the overall direction (up or down).
2. Check how tightly the points cluster around a line.
3. Correlation does NOT imply causation!

3. Line of Best Fit

๐Ÿ“Œ RULE: Line of Best Fit
The line of best fit (or trend line) is a straight line that best represents the data on a scatterplot.

\[ y = mx + b \]

Use the line to make predictions. Substitute the x-value into the equation to find the predicted y-value.
๐Ÿ“Œ RULE: Residuals
A residual is the difference between the actual y-value and the predicted y-value from the line of best fit.

\[ \text{Residual} = \text{Actual} - \text{Predicted} \]

Positive residual: Actual is above the line.
Negative residual: Actual is below the line.
๐Ÿ“Œ Key Concept

The line of best fit minimizes the sum of the squared residuals. You don't need to calculate the line yourself, but you need to interpret it.

๐Ÿ“ Solved Examples

Study these examples carefully. Each shows the step-by-step solution process.

๐Ÿ“ SOLVED EXAMPLE 1 โ€” Reading a Scatterplot
A scatterplot shows the relationship between hours studied and test scores. As study hours increase, test scores also increase. What type of correlation is this?
Step 1: As x (study hours) increases, y (test scores) increases
Step 2: This is a positive correlation
โœ… Positive correlation
๐Ÿ’ก Tip: More study time โ†’ higher scores โ†’ positive relationship.
๐Ÿ“ SOLVED EXAMPLE 2 โ€” Line of Best Fit
A line of best fit is \(y = 2x + 10\). What is the predicted y-value when \(x = 5\)?
Step 1: Substitute \(x = 5\) into the equation
Step 2: \(y = 2(5) + 10 = 10 + 10 = \color{var(--math)}{20}\)
โœ… \(y = 20\)
๐Ÿ’ก Tip: Use the line of best fit equation to predict y for any given x.
๐Ÿ“ SOLVED EXAMPLE 3 โ€” Residual
A line of best fit predicts \(y = 18\) for a data point. The actual value is \(21\). What is the residual?
Step 1: Residual = Actual โˆ’ Predicted
Step 2: Residual = \(21 โˆ’ 18 = \color{var(--math)}{3}\)
โœ… Residual = 3 (point is above the line)
๐Ÿ’ก Tip: Positive residual = actual is above the line.
๐Ÿ“ SOLVED EXAMPLE 4 โ€” Interpreting Slope
A line of best fit is \(y = 1.5x + 20\) where x = hours studied and y = test score. What does the slope 1.5 mean?
Step 1: Slope = 1.5 means for every 1 hour increase in study time...
Step 2: The test score increases by 1.5 points
โœ… Each additional hour of study increases the predicted test score by 1.5 points
๐Ÿ’ก Tip: Slope = rate of change. For each unit increase in x, y changes by the slope.
๐Ÿ“ SOLVED EXAMPLE 5 โ€” Interpreting y-Intercept
A line of best fit is \(y = 1.5x + 20\). What does the y-intercept 20 mean?
Step 1: y-intercept = 20 is the predicted value when \(x = 0\)
Step 2: When \(x = 0\) hours studied, the predicted test score is 20
โœ… The predicted test score with 0 hours of study is 20
๐Ÿ’ก Tip: y-intercept is the starting value when x = 0.

๐Ÿงช Practice Questions

Solve each problem using the rules above. Click "Show Answer" to see the full solution.

Question 1
A scatterplot shows points going up from left to right. What type of correlation is this?
A) Negative correlation
B) Positive correlation
C) No correlation
D) Strong correlation
โœ“ Answer: B
Points going up from left to right indicate a positive correlation.
๐Ÿ“ Solution: Positive correlation
Question 2
A line of best fit is \(y = 3x + 5\). What is the predicted y-value when \(x = 4\)?
A) 12
B) 15
C) 17
D) 20
โœ“ Answer: C
\(y = 3(4) + 5 = 12 + 5 = 17\)
๐Ÿ“ Solution: \(3(4) + 5 = 17\)
Question 3
A line of best fit predicts \(y = 25\) for a data point. The actual value is 28. What is the residual?
A) -3
B) 0
C) 2
D) 3
โœ“ Answer: D
Residual = Actual - Predicted = 28 - 25 = 3
๐Ÿ“ Solution: \(28 - 25 = 3\)
Question 4
A line of best fit is \(y = 2.5x + 10\), where x = hours worked and y = money earned. What does the slope 2.5 mean?
A) The starting pay is $10
B) Each hour worked earns $10
C) Each hour worked earns $2.50
D) You need to work 2.5 hours
โœ“ Answer: C
Slope = 2.5 means for each additional hour worked, money earned increases by $2.50.
๐Ÿ“ Solution: Slope = rate of change = $2.50 per hour.
Question 5
A scatterplot shows points going down from left to right. What type of correlation is this?
A) Negative correlation
B) Positive correlation
C) No correlation
D) Strong correlation
โœ“ Answer: A
Points going down from left to right indicate a negative correlation.
๐Ÿ“ Solution: Negative correlation
Question 6
A line of best fit is \(y = 0.8x + 15\). What is the predicted y-value when \(x = 10\)?
A) 15
B) 18
C) 20
D) 23
โœ“ Answer: D
\(y = 0.8(10) + 15 = 8 + 15 = 23\)
๐Ÿ“ Solution: \(0.8(10) + 15 = 23\)
Question 7
A line of best fit is \(y = 4x - 2\), where x = number of items and y = total cost. What is the predicted total cost for 5 items?
A) 18
B) 18
C) 20
D) 22
โœ“ Answer: B
\(y = 4(5) - 2 = 20 - 2 = 18\)
๐Ÿ“ Solution: \(4(5) - 2 = 18\)
Question 8
A scatterplot shows points that are randomly scattered with no clear pattern. What type of correlation is this?
A) No correlation
B) Positive correlation
C) Negative correlation
D) Strong correlation
โœ“ Answer: A
Randomly scattered points with no clear pattern indicate no correlation.
๐Ÿ“ Solution: No correlation
Question 9
A line of best fit predicts \(y = 30\) for a data point. The actual value is 27. What is the residual?
A) -3
B) 0
C) 3
D) 30
โœ“ Answer: A
Residual = Actual - Predicted = 27 - 30 = -3 (point is below the line)
๐Ÿ“ Solution: \(27 - 30 = -3\)
Question 10
A line of best fit is \(y = 1.2x + 8\), where x = years of experience and y = salary in thousands. What does the y-intercept 8 mean?
A) Each year of experience adds $8,000
B) The salary increases by 8% each year
C) The predicted salary with 0 years experience is $8,000
D) The salary is always $8,000
โœ“ Answer: C
y-intercept = 8 means when x = 0 (0 years experience), the predicted salary is 8 (thousand) = $8,000.
๐Ÿ“ Solution: y-intercept = predicted value when x = 0.
๐ŸŽ‰ WELL DONE!

You've completed the Two-Variable Data lesson. You now know how to read scatterplots, identify correlation, use lines of best fit, and interpret residuals.

โ† Back to Topic List ๐Ÿ“– Previous: One-Variable Data ๐Ÿ“– Next: Probability & Conditional Probability โ†’