Correlation and Causality
When analysing data, it is important to understand that correlation does not imply causality. This means that even if two variables appear to be related, one does not necessarily cause the other.
What Correlation Means
Correlation describes a relationship between two variables.
A scatter diagram may show:
• positive correlation
• negative correlation
• little or no correlation
Correlation only tells us that variables change together in some way. It does not explain why this happens.
What Causality Means
Causality means that one variable directly causes a change in another variable.
To claim causality, there must be strong evidence that:
• one variable affects the other
• no other factors are responsible for the change
This level of evidence cannot be obtained from a scatter diagram alone.
Why Correlation Does Not Prove Causation
Two variables may be correlated because:
• one causes the other
• the second causes the first
• a third variable affects both
• the relationship is coincidental
For example, ice cream sales and the number of people swimming may show a positive correlation. This does not mean ice cream sales cause people to swim. A third factor, such as hot weather, affects both.
An apparent link may be explained by another factor
Using Scatter Diagrams Carefully
Scatter diagrams are useful for:
• identifying relationships
• suggesting patterns
• supporting or rejecting hypotheses
They are not sufficient for:
• proving cause and effect
• making strong claims about reasons
Any conclusion should be cautious and based on what the data actually shows.
Drawing Sensible Conclusions
When interpreting correlation:
• describe the type of correlation
• avoid stating that one variable causes the other
• consider other possible explanations
• use careful language
Good conclusions refer to association, not causation.
Common Errors to Avoid
Common mistakes include:
• saying one variable causes another based only on correlation
• ignoring possible hidden variables
• over interpreting patterns
Always question whether another explanation could exist.
Key Points to Remember
Correlation shows association, not cause.
Causality requires stronger evidence than a scatter diagram.
A third variable may affect both variables.
Scatter diagrams suggest relationships but do not explain them.
Conclusions must be cautious and evidence based.
Understanding that correlation does not imply causality is essential for interpreting data responsibly and avoiding misleading conclusions in statistics.