BAYES and FREQUENTISM: The Return of an Old Controversy 1 Louis Lyons Imperial College and Oxford University Heidelberg April 2013.

Slides:

Advertisements

Similar presentations

DO’S AND DONT’S WITH LIKELIHOODS Louis Lyons Oxford (CDF)

Advertisements

1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Oxford University LBL, January 2008.

27 th March CERN Higgs searches: CL s W. J. Murray RAL.

Practical Statistics for LHC Physicists Bayesian Inference Harrison B. Prosper Florida State University CERN Academic Training Lectures 9 April, 2015.

BAYES and FREQUENTISM: The Return of an Old Controversy 1 Louis Lyons Imperial College and Oxford University CERN Latin American School March 2015.

Psychology 290 Special Topics Study Course: Advanced Meta-analysis April 7, 2014.

1 Methods of Experimental Particle Physics Alexei Safonov Lecture #21.

1 LIMITS Why limits? Methods for upper limits Desirable properties Dealing with systematics Feldman-Cousins Recommendations.

Bayesian inference Gil McVean, Department of Statistics Monday 17 th November 2008.

1 BAYES versus FREQUENTISM The Return of an Old Controversy The ideologies, with examples Upper limits Systematics Louis Lyons, Oxford University and CERN.

1 Do’s and Dont’s with L ikelihoods Louis Lyons Oxford and IC CDF IC, February 2008.

Statistics In HEP 2 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.

1 Do’s and Dont’s with L ikelihoods Louis Lyons Oxford CDF Mexico, November 2006.

Statistics for Particle Physics: Intervals Roger Barlow Karlsruhe: 12 October 2009.

Slide 1 Statistics for HEP Roger Barlow Manchester University Lecture 1: Probability.

1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Oxford University CERN, October 2006 Lecture 4.

1 Practical Statistics for Physicists Dresden March 2010 Louis Lyons Imperial College and Oxford CDF experiment at FNAL CMS expt at LHC

1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Oxford University SLAC, January 2007.

1 Do’s and Dont’s with L ikelihoods Louis Lyons Oxford CDF CERN, October 2006.

Credibility of confidence intervals Dean Karlen / Carleton University Advanced Statistical Techniques in Particle Physics Durham, March 2002.

G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.

1 Do’s and Dont’s with L ikelihoods Louis Lyons Oxford CDF Manchester, 16 th November 2005.

I The meaning of chance Axiomatization. E Plurbus Unum.

1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Imperial College and Oxford University Vienna May 2011.

G. Cowan 2011 CERN Summer Student Lectures on Statistics / Lecture 41 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability.

G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.

1 Statistical Inference Problems in High Energy Physics and Astronomy Louis Lyons Particle Physics, Oxford BIRS Workshop Banff.

G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.

Everything You Always Wanted To Know About Limits* Roger Barlow Manchester University YETI06 *But were afraid to ask.

1 Practical Statistics for Physicists Stockholm Lectures Sept 2008 Louis Lyons Imperial College and Oxford CDF experiment at FNAL CMS expt at LHC

Lecture 9: p-value functions and intro to Bayesian thinking Matthew Fox Advanced Epidemiology.

Statistical Analysis of Systematic Errors and Small Signals Reinhard Schwienhorst University of Minnesota 10/26/99.

Statistical aspects of Higgs analyses W. Verkerke (NIKHEF)

1 Probability and Statistics  What is probability?  What is statistics?

Statistical Decision Theory

G. Cowan 2009 CERN Summer Student Lectures on Statistics1 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability densities,

Theory of Probability Statistics for Business and Economics.

1 Is there evidence for a peak in this data?. 2 “Observation of an Exotic S=+1 Baryon in Exclusive Photoproduction from the Deuteron” S. Stepanyan et.

G. Cowan Lectures on Statistical Data Analysis Lecture 1 page 1 Lectures on Statistical Data Analysis London Postgraduate Lectures on Particle Physics;

Statistics Lectures: Questions for discussion Louis Lyons Imperial College and Oxford CERN Summer Students July

Practical Statistics for Particle Physicists Lecture 3 Harrison B. Prosper Florida State University European School of High-Energy Physics Anjou, France.

Bayesian vs. frequentist inference frequentist: 1) Deductive hypothesis testing of Popper--ruling out alternative explanations Falsification: can prove.

Statistics In HEP Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.

Frequentistic approaches in physics: Fisher, Neyman-Pearson and beyond Alessandro Palma Dottorato in Fisica XXII ciclo Corso di Probabilità e Incertezza.

1 Is there evidence for a peak in this data?. 2 “Observation of an Exotic S=+1 Baryon in Exclusive Photoproduction from the Deuteron” S. Stepanyan et.

Making sense of randomness

Confidence Intervals First ICFA Instrumentation School/Workshop At Morelia, Mexico, November 18-29, 2002 Harrison B. Prosper Florida State University.

Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.

1 Methods of Experimental Particle Physics Alexei Safonov Lecture #25.

BAYES and FREQUENTISM: The Return of an Old Controversy 1 Louis Lyons Imperial College and Oxford University CERN Summer Students July 2014.

Experience from Searches at the Tevatron Harrison B. Prosper Florida State University 18 January, 2011 PHYSTAT 2011 CERN.

1 Methods of Experimental Particle Physics Alexei Safonov Lecture #24.

G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.

1 Introduction to Statistics − Day 3 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Lecture 4 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.

In Bayesian theory, a test statistics can be defined by taking the ratio of the Bayes factors for the two hypotheses: The ratio measures the probability.

G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.

G. Cowan RHUL Physics Statistical Issues for Higgs Search page 1 Statistical Issues for Higgs Search ATLAS Statistics Forum CERN, 16 April, 2007 Glen Cowan.

1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Imperial College and Oxford University Amsterdam Oct 2009.

Statistical Issues in Searches for New Physics

BAYES and FREQUENTISM: The Return of an Old Controversy

What is Probability? Bayes and Frequentism

Statistical Issues for Dark Matter Searches

BAYES and FREQUENTISM: The Return of an Old Controversy

Confidence Intervals and Limits

Lecture 4 1 Probability (90 min.)

Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.

Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.

Statistics for HEP Roger Barlow Manchester University

Presentation transcript:

BAYES and FREQUENTISM: The Return of an Old Controversy 1 Louis Lyons Imperial College and Oxford University Heidelberg April 2013

2

Topics Who cares? What is probability? Bayesian approach Examples Frequentist approach Summary. Will discuss mainly in context of PARAMETER ESTIMATION. Also important for GOODNESS of FIT and HYPOTHESIS TESTING 3

4 It is possible to spend a lifetime analysing data without realising that there are two very different fundamental approaches to statistics: Bayesianism and Frequentism.

5 How can textbooks not even mention Bayes / Frequentism? For simplest case with no constraint onthen at some probability, for both Bayes and Frequentist (but different interpretations) See Bob Cousins “Why isn’t every physicist a Bayesian?” Amer Jrnl Phys 63(1995)398

6 We need to make a statement about Parameters, Given Data The basic difference between the two: Bayesian : Probability (parameter, given data) (an anathema to a Frequentist!) Frequentist : Probability (data, given parameter) (a likelihood function)

7 PROBABILITY MATHEMATICAL Formal Based on Axioms FREQUENTIST Ratio of frequencies as n  infinity Repeated “identical” trials Not applicable to single event or physical constant BAYESIAN Degree of belief Can be applied to single event or physical constant (even though these have unique truth) Varies from person to person *** Quantified by “fair bet”

8 Bayesian versus Classical Bayesian P(A and B) = P(A;B) x P(B) = P(B;A) x P(A) e.g. A = event contains t quark B = event contains W boson or A = I am in Heidelberg B = I am giving a lecture P(A;B) = P(B;A) x P(A) /P(B) Completely uncontroversial, provided….

9 Bayesian posteriorlikelihoodprior Problems: p(param) Has particular value “Degree of belief” Prior What functional form? Coverage Bayes’ Theorem p(param | data) α p(data | param) * p(param)

10 P(parameter) Has specific value “Degree of Belief” Credible interval Prior: What functional form? Uninformative prior: flat? In which variable? e.g. m, m 2, ln m,….? Even more problematic with more params Unimportant if “data overshadows prior” Important for limits Subjective or Objective prior?

11 Data overshadows prior Mass of Z boson (from LEP)

12 Prior Even more important for UPPER LIMITS

13 Prior = zero in unphysical region Mass-squared of neutrino

Bayes: Specific example Particle decays exponentially: dn/dt = (1/τ) exp(-t/τ) Observe 1 decay at time t 1 : L (τ) = (1/τ) exp(-t 1 /τ) Choose prior π(τ) for τ e.g. constant up to some large τ L Then posterior p(τ) = L (τ) * π(τ) has almost same shape as L (τ) Use p(τ) to choose interval for τ τ in usual way Contrast frequentist method for same situation later. 14

Bayesian posterior  intervals 15 Upper limit Lower limit Central interval Shortest

16 Ilya Narsky, FNAL CLW 2000 Upper Limits from Poisson data Upper Limits important for excluding models

17 P (Data;Theory) P (Theory;Data) HIGGS SEARCH at CERN Is data consistent with Standard Model? or with Standard Model + Higgs? End of Sept 2000: Data not very consistent with S.M. Prob (Data ; S.M.) < 1% valid frequentist statement Turned by the press into: Prob (S.M. ; Data) 99% i.e. “It is almost certain that the Higgs has been seen”

18 P (Data;Theory) P (Theory;Data)

19 P (Data;Theory) P (Theory;Data) Theory = male or female Data = pregnant or not pregnant P (pregnant ; female) ~ 3%

20 P (Data;Theory) P (Theory;Data) Theory = male or female Data = pregnant or not pregnant P (pregnant ; female) ~ 3% but P (female ; pregnant) >>>3%

21 Example 1 : Is coin fair ? Toss coin: 5 consecutive tails What is P(unbiased; data) ? i.e. p = ½ Depends on Prior(p) If village priest: prior ~ δ (p = 1/2 ) If stranger in pub: prior ~ 1 for 0 < p <1 (also needs cost function)

22 Example 2 : Particle Identification Try to separate π’s and protons probability (p tag; real p) = 0.95 probability ( π tag; real p) = 0.05 probability (p tag; real π ) = 0.10 probability ( π tag; real π ) = 0.90 Particle gives proton tag. What is it? If proton beam, very likely If general secondary particles, more even If pure π beam, ~ 0 Depends on prior = fraction of protons

Peasant and Dog 1)Dog d has 50% probability of being 100 m. of Peasant p 2)Peasant p has 50% probability of being within 100m of Dog d ? 23 dp x River x =0River x =1 km

24 Given that: a) Dog d has 50% probability of being 100 m. of Peasant, is it true that: b ) Peasant p has 50% probability of being within 100m of Dog d ? Additional information Rivers at zero & 1 km. Peasant cannot cross them. Dog can swim across river - Statement a) still true If dog at –101 m, Peasant cannot be within 100m of dog Statement b) untrue

25

26 Classical Approach Neyman “confidence interval” avoids pdf for Uses only P( x; ) Confidence interval : P( contains ) = True for any Varying intervals from ensemble of experiments fixed Gives range of for which observed value was “likely” ( ) Contrast Bayes : Degree of belief = is in

27 μ≥0 No prior for μ Classical (Neyman) Confidence Intervals Uses only P(data|theory)

Frequentism: Specific example Particle decays exponentially: dn/dt = (1/τ) exp(-t/τ) Observe 1 decay at time t 1 : L (τ) = (1/τ) exp(-t 1 /τ) Construct 68% central interval t =.17τ dn/dt τ t t = 1.8τ t 1 t 28 68% conf. int. for τ from t 1 /1.8  t 1 /0.17

Bob Cousins, “Why isn’t every physicist a Bayesian?” Amer Jrnl Phys 63(1995) Gaussian example Mass-squared of neutrino

Poisson, n=3 P n = e -µ µ n /n! 30 Frequentist Bayes and Likelihood

Poisson, n=3 P n = e -µ µ n /n! 31 Bayes. Prior =1/µ B and L

32 90% Classical interval for Gaussian σ = 1 μ ≥ 0 e.g. m 2 (ν e ), length of small object Other methods have different behaviour at negative x

33 at 90% confidence and known, but random unknown, but fixed Probability statement about and Frequentist Bayesian and known, and fixed unknown, and random Probability/credible statement about

Coverage Fraction of intervals containing true value Property of method, not of result Can vary with param Frequentist concept. Built in to Neyman construction Some Bayesians reject idea. Coverage not guaranteed Integer data (Poisson)  discontinuities Ideal coverage plot C μ 34

Coverage : L approach (Not frequentist) 35 P(n, μ) = e -μ μ n /n! (Joel Heinrich CDF note 6438) -2 lnλ< 1 λ = P(n,μ)/P(n,μ best ) UNDERCOVERS

Frequentist central intervals, NEVER undercovers (Conservative at both ends) 36

Feldman-Cousins Unified intervals Frequentist, so NEVER undercovers 37

Classical Intervals Problems 38 Hard to understand e.g. d’Agostini Arbitrary choice of interval Possibility of empty range Nuisance parameters (systematic errors ) Advantages Widely applicable Well defined coverage

FELDMAN - COUSINS Wants to avoid empty classical intervals  Uses “ L -ratio ordering principle” to resolve ambiguity about “which 90% region?”  [Neyman + Pearson say L -ratio is best for hypothesis testing] No ‘Flip-Flop’ problem 39

40 X obs = -2 now gives upper limit

41 Black lines Classical 90% central interval Red dashed: Classical 90% upper limit Flip-flop

42

43 Poisson confidence intervals. Background = 3 Standard Frequentist Feldman - Cousins

44

45

46 Importance of Ordering Rule Neyman construction in 1 parameter 2 measurements An aside: Determination of single parameter p via Acceptable level of Range of parameters given by 1)Values of for which data is likely i.e. p( ) is acceptable or 2) 2) is good 1) Range depends on [“Confidence interval coupled to goodness of fit”]

47 * * * Neyman Construction For given, acceptable (, ) satisfy = Defines cylinder in space Experiment gives interval Range depends on Range and goodness of fit are coupled

48 That was using Probability Ordering Now change to Likelihood Ratio Ordering For,no value of gives very good fit For Neyman Construction at fixed, compare: with where giving Cutting on Likelihood Ratio Ordering gives:

49 Therefore, range of is Constant Width Independent of Confidence Range and Goodness of Fit are completely decoupled

50 Standard Frequentist Pros: Coverage Widely applicable Cons: Hard to understand Small or empty intervals Difficult in many variables (e.g. systematics) Needs ensemble

51 Bayesian Pros: Easy to understand Physical interval Cons: Needs prior Coverage not guaranteed Hard to combine

52 Method: Mixed Frequentist - Bayesian Bayesian for nuisance parameters and Frequentist to extract range Philosophical/aesthetic problems? Highland and Cousins (Motivation was paradoxical behaviour of Poisson limit when LA not known exactly)

Basis of method Bayes Theorem  Posterior probability distribution Uses pdf for data, for fixed parameters Meaning of probability Degree of beliefFrequentist definition Prob of parameters? YesAnathema Needs prior?YesNo Choice of interval? YesYes (except F+C) Data considered Only data you have….+ other possible data Likelihood principle? YesNo 53 Bayesian versus Frequentism Bayesian Frequentist

Bayesian versus Frequentism Ensemble of experiment NoYes (but often not explicit) Final statement Posterior probability distribution Parameter values  Data is likely Unphysical/ empty ranges Excluded by priorCan occur SystematicsIntegrate over priorExtend dimensionality of frequentist construction CoverageUnimportantBuilt-in Decision making Yes (uses cost function)Not useful 54 Bayesian Frequentist

55 Bayesianism versus Frequentism “Bayesians address the question everyone is interested in, by using assumptions no-one believes” “Frequentists use impeccable logic to deal with an issue of no interest to anyone”

56 Next time: p-values, Upper Limits, Discoveries Hope: LHC moves us from era of ‘Upper Limits’ to that of even more DISCOVERIES