Learning Materials

Structured explanations, one concept at a time.

Correlation and Causality

When analysing data, it is important to understand that correlation does not imply causality. This means that even if two variables appear to be related, one does not necessarily cause the other.

 

 

What Correlation Means

Correlation describes a relationship between two variables.

 

A scatter diagram may show:
• positive correlation
• negative correlation
• little or no correlation

 

Correlation only tells us that variables change together in some way. It does not explain why this happens.

 

 

What Causality Means

Causality means that one variable directly causes a change in another variable.

 

To claim causality, there must be strong evidence that:
• one variable affects the other
• no other factors are responsible for the change

 

This level of evidence cannot be obtained from a scatter diagram alone.

 

 

Why Correlation Does Not Prove Causation

Two variables may be correlated because:
• one causes the other
• the second causes the first
• a third variable affects both
• the relationship is coincidental

 

For example, ice cream sales and the number of people swimming may show a positive correlation. This does not mean ice cream sales cause people to swim. A third factor, such as hot weather, affects both.

 

An apparent link may be explained by another factor

 

 

Using Scatter Diagrams Carefully

Scatter diagrams are useful for:
• identifying relationships
• suggesting patterns
• supporting or rejecting hypotheses

 

They are not sufficient for:
• proving cause and effect
• making strong claims about reasons

 

Any conclusion should be cautious and based on what the data actually shows.

 

 

Drawing Sensible Conclusions

When interpreting correlation:
• describe the type of correlation
• avoid stating that one variable causes the other
• consider other possible explanations
• use careful language

 

Good conclusions refer to association, not causation.

 

 

Common Errors to Avoid

Common mistakes include:
• saying one variable causes another based only on correlation
• ignoring possible hidden variables
• over interpreting patterns

 

Always question whether another explanation could exist.

 

 

Key Points to Remember

Correlation shows association, not cause.
Causality requires stronger evidence than a scatter diagram.
A third variable may affect both variables.
Scatter diagrams suggest relationships but do not explain them.
Conclusions must be cautious and evidence based.

 

Understanding that correlation does not imply causality is essential for interpreting data responsibly and avoiding misleading conclusions in statistics.