These days I am preparing a three-hour course of statistics for particle physicists which I will give at a winter school in a couple of months. This stimulating task forces me to find nice and simple examples of good and bad applications of basic statistics. Stuff with high didactical value, and hopefully also entertaining.
For today, I am happy with a simple illustration of why to be a physicist you need to know basic Statistics. The example is of course based on a real analysis in particle physics. It is based on a claim made in 1969 by McCusker and Cairns that they had observed the track of a special charged particle in a bubble chamber exposed to energetic air showers. The track appeared to ionize the gas of the chamber less than half as much as what it should have: 110 droplets along a unit-length path instead of 229.
Ionization is (well, was) measured in bubble chambers by counting with a microscope the number of droplets along the particle path. These droplets form by condensation around the points where the incident particle scatters with the gas of the detector.
The figure on the right, taken from PRL 23 (1969), page 658, shows a bunch of parallel tracks caused by a shower of charged particles from a energetic cosmic ray. Among the tracks there is one which is much fainter than the others (I leave you to guess which one it is): that is the one which McCuster and Cairns claimed to be due to a fractionally charged particle.
The question one must answer, in order to start fiddling with the idea that the track in question is due to a free quark or some other exotic thing produced in the very high-energy interaction in the atmosphere, is how likely it is that one observed, among 55,000 tracks, one track with 110 or fewer droplets per unit path length, when the average expected number is 229 (the latter determined by studying the total sample of tracks).
I can hear some of you radiating confidence as you think: "Ah! A simple problem in Statistics... The number of droplets is a Poisson variable, since the Poisson distribution describes the probability of finding n events in a given time, if these occur independently from a constant rate process". Very good then: what you have in mind is the formula
P(n) = [mu^n e^(-mu)]/n!
where mu is the average number in the given time (or given path length, in our case). If for a track we observe n=110 when mu=229, we may ask "What is the probability that a track produces less than 111 droplets, if the expected number of droplets is 229?". The required probability is the sum of P(n) with n running from 0 to 110, and the result is P(n<=110) = 1.6 x 10^(-18).
Okay, then if we have 55,000 of these, the total probability to observe one or more of these is P(>=1 weird track in 55000) = 1-[1-P(n<=110)]^(55000), which is in the whereabouts of 10^-13. One in ten thousand billions! That must surely be a fractional-charge particle ! We are surpassing the five-sigma "observation-level" significance head, shoulders, and tail here.
Hmmm, not so fast.
If you are a good physicist, you know that a single scattering of a charged particle off a nucleus in the vapour of the chamber produces more than a single droplet. In fact, this number is four, on average. The droplet production is itself a Poisson process. Fine, so we have two separate Poisson processes - particle scattering, and droplet formation. Does it change the picture ? No, if you are a good physicist who does not know squat about Statistics.
If you have at least a good hunch of basic Statistics, however, you know that what you are describing is not a simple Poisson process, but a compound Poisson process. And the two are rather different things. The fact that scatterings yield an average of four droplets in fact dramatically increases the likelihood of tracks with small droplet multiplicity: by using the correct distribution function of the compound Poisson (where now mu=4, lambda*mu=229, and N is the number of scatterings):
If we then ask what is the chance to have seen at least one such low-ionization track in the 55,000 sample, the probability is now 1-(1-P')^(55000), which is a striking 92.5% ! Ooooops! We should have rather been surprised to NOT have observed such a low-ionization track!
The summary of this example is simple to spell: You may know your detector and the underlying physics as well as you know your ***, but if you do not know basic Statistics you are going to be fooled !
The McCusker and Cairns PRL article soon received its due rebuttal [R.Adair and H.Kasha, PRL 23, 1355 (1969)]. For an evidence of quarks, one would have to wait five more years...
- PHYSICAL SCIENCES
- EARTH SCIENCES
- LIFE SCIENCES
- SOCIAL SCIENCES
Subscribe to the newsletter
Stay in touch with the scientific world!
Know Science And Want To Write?
- How Gut Bacteria Ensure A Healthy Brain – and Could Play A Role In Treating Depression
- Researchers Created A Laser Bullet To See What It Would Look Like - And Here It Is
- The Strange Organic Molecules In Titan's Atmosphere
- We're Too Late To Prevent 137,000 More Ebola Cases, Says Epidemiology Paper
- The Quote Of The Week - Shocked And Disappointed
- It Takes More Than Singing To Strike A Chord In Music Education
- Type 1 Diabetes Surges In White Kids
- "On the luminosity leveling : my limited understanding is that at hl-lhc, the beam lifetime is completely..."
- "The “non-negotiable mathematical reasons” for raising the height of flood defences are in a..."
- "For background, look at Paul Bloom’s 2004 book Descartes’ Baby. This has interesting material..."
- " It is possible that survival of Ebola virus victims would be much improved if an artificial fever..."
- "Priceless! I really needed a kick in the pants to get me to laugh at myself and this post did it..."
- Why Chobani reversed course, to make yoghurt only from milk from cows not fed GMO grain
- Evolution is sometimes messy or even outright ridiculous
- Rich and famous embrace Vandana Shiva, wealthy self-described savior of the poor
- US Ebola hysteria and money pit highlight lack of resources to confront diseases that kill far more people
- Addiction can be measured by epigenetics
- Coffee grounds turned biofuel can heat your home
- Global boom in hydropower expected this decade
- For brain hemorrhage, risk of death is lower at high-volume hospitals
- Roman-Britons had less gum disease than modern Britons
- 'Swingers' multiple drug use heightens risk of sexually transmitted diseases
- Were clinical trial practices in East Germany questionable?