This is a post about basics. That's because I think a point needs to be made which is surprisingly not as well-known as its elementary nature would have you guess.
Correlation -in its most used version, due to Pearson- is a measure of how two quantities can be observed to be in linear dependence on one another. It is a very common quantity to report the results of scientific studies, particularly but not exclusively in the social sciences. Researchers try to evidence the presence of a correlation between two phenomena as a preliminary step to investigating whether one can be the cause of the other.
There is of course nothing wrong in measuring correlation. The problem is of course when interpreting the results. If I see a tight correlation between chocolate consumption in a country and its rate of Nobel prize recipients, should I conclude that eating chocolate makes one smarter ? Or should I rather conclude that winning a Nobel prize makes one eat more chocolate ?
Puns aside, the distinction between correlation and causation should be clear to anybody reading this blog. For instance those arguing that vaccines cause autism on the basis of vague correlation measurements should have a look at the graph above (courtesy Hank's Facebook account) which would have them conclude that organic foods are rather the cause of autism! (But others might conclude that it is parents of autistic children who buy all the organic foods...)
So, the point about correlation and causation is clear. But there is another point to make which I think is not always clear to everybody. The absence of correlation between two variables is a much weaker condition than their independence. We often use "uncorrelated" as a synonim and substitute of "independent", but this is completely wrong from a mathematical standpoint! Two uncorrelated variables may in fact be completely dependent one of the other!
Wikipedia has a nice figure to illustrate the point. It is shown below.
As you can see from the bottom set of graphs, you can have many different interdependence patterns between two variables with a zero correlation coefficient. But what the graph does not show is that you can even have an exact functional dependence between two variables (e.g. meaning, in the case of organic foods and autism, that if you told me the sales in $ of organic foods I could tell you exactly how many cases of autism are diagnosed that year) and still get a zero correlation coefficient!
Such is for instance the case of y=x^2, when x is in [-1,1]. This is a perfect parabolic relationship, and sets of points drawn at random from the curve will have a correlation coefficient compatible with zero (in the ensemble sense that you will find x% of sets with zero correlation at confidence level x%).
This means that, while you must be careful about concluding that there is some cause-effect relationship between two observable quantities based on their correlation, you must be even more careful when attempting to conclude that there is independence of the two from the absence of a significant correlation between them!
Please remember this often overlooked fact!
- PHYSICAL SCIENCES
- EARTH SCIENCES
- LIFE SCIENCES
- SOCIAL SCIENCES
Subscribe to the newsletter
Stay in touch with the scientific world!
Know Science And Want To Write?
- Kudos To "The Independent" Newspaper For Debunking Nibiru "Blood Moon" Hoax
- Climate Change Has Less Impact On Drought Than Previously Expected
- Your Microbiome Did Not Cause Your Weight Problem
- Dogs Understand Both Vocabulary And Intonation Of Human Speech
- Fewer Cardiovascular Drugs Being Studied In Clinical Trials
- State Of Academic Freedom 2016
- USDA Microbiologist Warns Bacteria In Vaping Products May Be A Health Concern
- "The question about pions and iron goes back to Mesonic Atoms (See: SciAm, Sergio De Bennedetti..."
- "Yes it is interesting. Actually planet 9 would be as big as Neptune if it exists, several times..."
- "Apparently planet 9 exists. Not Nobiru but another dwarf planet. This planet could destroy a couple..."
- "Thank you. You know, I am of the firm belief that if a blog has a title that includes Science in..."
- "Firstly, I want to inform everyone apart from Kaylee that the following doesn't apply to the topic..."
- <a href="/news/2016/08/30/how-do-herpes-drugs-work-9941">How Do Herpes Drugs Work? <i class="fa fa-angle-double-right"></i></a>
- <a href="/news/2016/08/30/school-meals-eat-tu-principals-9938">School meals - Eat tu Principals? <i class="fa fa-angle-double-right"></i></a>
- <a href="/annual-reports">Annual Reports <i class="fa fa-angle-double-right"></i></a>
- <a href="/news/2016/08/30/breastmilk-sugar-helps-prevent-group-b-strep-infection-babies-9934">Breastmilk Sugar Helps Prevent Group B Strep Infection in Babies <i class="fa fa-angle-double-right"></i></a>
- <a href="/news/2016/08/30/concussion-recovery-time-cut-half-when-athletes-immediately-sit-9933">Concussion Recovery Time Cut in Half When Athletes Immediately Sit <i class="fa fa-angle-double-right"></i></a>
- <a href="/news/2016/08/29/icymi-acshs-most-popular-articles-summer-9931">ICYMI: ACSH's Most Popular Articles of the Summer <i class="fa fa-angle-double-right"></i></a>
- Progress in refining the genetic causes of schizophrenia
- Researchers discover machines can learn by simply observing
- Technique could assess historic changes to Antarctic sea ice and glaciers
- Factors associated with improvement in survival following heart attack
- System may help treat rare genetic disorder, reduce severe side effects