More on Correlation

0
23

Share on LinkedIn

Just in case you need more convicing that correlation is not causation, spend some time browsing the Spurious Correlation Generator.

Every day it finds a new (but almost invariably bogus) statistical correlation between two unrelated data sets. Today, for example, we learn that U.S. spending on science and technolgy is very strongly correlated (0.992) with suicides by hanging.

In the past we’ve learned that the divorce rate in Maine is correlated to per-capita margarine consumption, and that the number of beehives is inversely correlated with arrests for juvenile pot possession.

These correlations are completely bogus, of course. The point is to illustrate the fact that if you look at enough different data points you will find lots of spurious statistical relationships. With computers and big data, it’s trivially simple to generate thousands of correlations with very high statistical significance which also happen to be utterly meaningless.

Republished with author's permission from original post.

Peter Leppik
Peter U. Leppik is president and CEO of Vocalabs. He founded Vocal Laboratories Inc. in 2001 to apply scientific principles of data collection and analysis to the problem of improving customer service. Leppik has led efforts to measure, compare and publish customer service quality through third party, independent research. At Vocalabs, Leppik has assembled a team of professionals with deep expertise in survey methodology, data communications and data visualization to provide clients with best-in-class tools for improving customer service through real-time customer feedback.

ADD YOUR COMMENT

Please use comments to add value to the discussion. Maximum one link to an educational blog post or article. We will NOT PUBLISH brief comments like "good post," comments that mainly promote links, or comments with links to companies, products, or services.

Please enter your comment!
Please enter your name here