1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Oxford University LBL, January 2008.

Slides:



Advertisements
Similar presentations
TWO STEP EQUATIONS 1. SOLVE FOR X 2. DO THE ADDITION STEP FIRST
Advertisements

You have been given a mission and a code. Use the code to complete the mission and you will save the world from obliteration…
Fundamentals of Probability
Bayes rule, priors and maximum a posteriori
Chapter 1 The Study of Body Function Image PowerPoint
1 Copyright © 2010, Elsevier Inc. All rights Reserved Fig 2.1 Chapter 2.
By D. Fisher Geometric Transformations. Reflection, Rotation, or Translation 1.
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination.
Thursday, March 7 Duality 2 – The dual problem, in general – illustrating duality with 2-person 0-sum game theory Handouts: Lecture Notes.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
Title Subtitle.
Jeopardy Q 1 Q 6 Q 11 Q 16 Q 21 Q 2 Q 7 Q 12 Q 17 Q 22 Q 3 Q 8 Q 13
0 - 0.
2 pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt 2 pt 3 pt 4 pt 5 pt 1 pt ShapesPatterns Counting Number.
DIVIDING INTEGERS 1. IF THE SIGNS ARE THE SAME THE ANSWER IS POSITIVE 2. IF THE SIGNS ARE DIFFERENT THE ANSWER IS NEGATIVE.
FACTORING ax2 + bx + c Think “unfoil” Work down, Show all steps.
Addition Facts
Year 6 mental test 5 second questions
Lecture 2 ANALYSIS OF VARIANCE: AN INTRODUCTION
1 Contact details Colin Gray Room S16 (occasionally) address: Telephone: (27) 2233 Dont hesitate to get in touch.
ZMQS ZMQS
Richmond House, Liverpool (1) 26 th January 2004.
ABC Technology Project
1 Undirected Breadth First Search F A BCG DE H 2 F A BCG DE H Queue: A get Undiscovered Fringe Finished Active 0 distance from A visit(A)
VOORBLAD.
1 Breadth First Search s s Undiscovered Discovered Finished Queue: s Top of queue 2 1 Shortest path from s.
BIOLOGY AUGUST 2013 OPENING ASSIGNMENTS. AUGUST 7, 2013  Question goes here!
Squares and Square Root WALK. Solve each problem REVIEW:
© 2012 National Heart Foundation of Australia. Slide 2.
Understanding Generalist Practice, 5e, Kirst-Ashman/Hull
Chapter 5 Test Review Sections 5-1 through 5-4.
GG Consulting, LLC I-SUITE. Source: TEA SHARS Frequently asked questions 2.
Addition 1’s to 20.
Model and Relationships 6 M 1 M M M M M M M M M M M M M M M M
25 seconds left…...
Januar MDMDFSSMDMDFSSS
Week 1.
1. 2 No lecture on Wed February 8th Thursday 9 th Feb Friday 27 th Jan Friday 10 th Feb Thursday 14: :00 Friday 16:00 – 19:00 HS N.
DO’S AND DONT’S WITH LIKELIHOODS Louis Lyons Oxford (CDF)
We will resume in: 25 Minutes.
CSE 473/573 Computer Vision and Image Processing (CVIP) Ifeoma Nwogu Lecture 27 – Overview of probability concepts 1.
©Brooks/Cole, 2001 Chapter 12 Derived Types-- Enumerated, Structure and Union.
PSSA Preparation.
IP, IST, José Bioucas, Probability The mathematical language to quantify uncertainty  Observation mechanism:  Priors:  Parameters Role in inverse.
1 PART 1 ILLUSTRATION OF DOCUMENTS  Brief introduction to the documents contained in the envelope  Detailed clarification of the documents content.
How Cells Obtain Energy from Food
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Chapter 14 From Randomness to Probability.
Basics of Statistical Estimation
BAYES and FREQUENTISM: The Return of an Old Controversy 1 Louis Lyons Imperial College and Oxford University CERN Latin American School March 2015.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #21.
1 LIMITS Why limits? Methods for upper limits Desirable properties Dealing with systematics Feldman-Cousins Recommendations.
BAYES and FREQUENTISM: The Return of an Old Controversy 1 Louis Lyons Imperial College and Oxford University Heidelberg April 2013.
1 BAYES versus FREQUENTISM The Return of an Old Controversy The ideologies, with examples Upper limits Systematics Louis Lyons, Oxford University and CERN.
1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Oxford University CERN, October 2006 Lecture 4.
1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Oxford University SLAC, January 2007.
1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Imperial College and Oxford University Vienna May 2011.
BAYES and FREQUENTISM: The Return of an Old Controversy 1 Louis Lyons Imperial College and Oxford University CERN Summer Students July 2014.
1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Imperial College and Oxford University Amsterdam Oct 2009.
BAYES and FREQUENTISM: The Return of an Old Controversy
What is Probability? Bayes and Frequentism
Statistical Issues for Dark Matter Searches
BAYES and FREQUENTISM: The Return of an Old Controversy
Confidence Intervals and Limits
Mathematical Foundations of BME Reza Shadmehr
Presentation transcript:

1 BAYES and FREQUENTISM: The Return of an Old Controversy Louis Lyons Oxford University LBL, January 2008

2 Commercial break Workshop on Multivariate methods for separating Signal from Background Caltech, Monday Feb 11 th Jan 23rd, 29th, 30 th : p-values and discovery Learning to love the error matrix Upper Limits

3 I am here till end of January Room B Tel

4

5

6

7 Topics Who cares? What is probability? Bayesian approach Examples Frequentist approach Systematics Summary

8 It is possible to spend a lifetime analysing data without realising that there are two very different fundamental approaches to statistics: Bayesianism and Frequentism.

9 How can textbooks not even mention Bayes / Frequentism? For simplest case with no constraint onthen at some probability, for both Bayes and Frequentist (but different interpretations) See Bob Cousins “Why isn’t every physicist a Bayesian?” Amer Jrnl Phys 63(1995)398

10 We need to make a statement about Parameters, Given Data The basic difference between the two: Bayesian : Probability (parameter, given data) (an anathema to a Frequentist!) Frequentist : Probability (data, given parameter) (a likelihood function)

11 PROBABILITY MATHEMATICAL Formal Based on Axioms FREQUENTIST Ratio of frequencies as n  infinity Repeated “identical” trials Not applicable to single event or physical constant BAYESIAN Degree of belief Can be applied to single event or physical constant (even though these have unique truth) Varies from person to person *** Quantified by “fair bet”

12 Bayesian versus Classical Bayesian P(A and B) = P(A;B) x P(B) = P(B;A) x P(A) e.g. A = event contains t quark B = event contains W boson or A = I am in LBL B = I am giving a lecture P(A;B) = P(B;A) x P(A) /P(B) Completely uncontroversial, provided….

13 Bayesian posteriorlikelihoodprior Problems: p(param) Has particular value “Degree of belief” Prior What functional form? Coverage Bayes’ Theorem p(param | data) α p(data | param) * p(param)

14 P(parameter) Has specific value “Degree of Belief” Credible interval Prior: What functional form? Uninformative prior: flat? In which variable? Even more problematic with more params Unimportant if “data overshadows prior” Important for limits Subjective or Objective prior?

15

16 Prior

17 Prior = zero in unphysical region

18 Bayes: Specific example Particle decays exponentially: dn/dt = (1/τ) exp(-t/τ) Observe 1 decay at time t 1 : L (τ) = (1/τ) exp(-t 1 /τ) Choose prior π(τ) for τ e.g. constant up to some large τ L Then posterior p(τ) = L (τ) * π(τ) has almost same shape as L (τ) Use p(τ) to choose interval for τ τ in usual way Contrast frequentist method for same situation later.

19 Bayesian posterior  intervals Upper limit Lower limit Central interval Shortest

20 Ilya Narsky, FNAL CLW 2000

21 P (Data;Theory) P (Theory;Data) HIGGS SEARCH at CERN Is data consistent with Standard Model? or with Standard Model + Higgs? End of Sept 2000: Data not very consistent with S.M. Prob (Data ; S.M.) < 1% valid frequentist statement Turned by the press into: Prob (S.M. ; Data) 99% i.e. “It is almost certain that the Higgs has been seen”

22 P (Data;Theory) P (Theory;Data) Theory = male or female Data = pregnant or not pregnant P (pregnant ; female) ~ 3%

23 P (Data;Theory) P (Theory;Data) Theory = male or female Data = pregnant or not pregnant P (pregnant ; female) ~ 3% but P (female ; pregnant) >>>3%

24 Example 1 : Is coin fair ? Toss coin: 5 consecutive tails What is P(unbiased; data) ? i.e. p = ½ Depends on Prior(p) If village priest: prior ~ δ (p = 1/2 ) If stranger in pub: prior ~ 1 for 0 < p <1 (also needs cost function)

25 Example 2 : Particle Identification Try to separate π’s and protons probability (p tag; real p) = 0.95 probability ( π tag; real p) = 0.05 probability (p tag; real π ) = 0.10 probability ( π tag; real π ) = 0.90 Particle gives proton tag. What is it? If proton beam, very likely If general secondary particles, more even If pure π beam, ~ 0 Depends on prior = fraction of protons

26 Hunter and Dog 1)Dog d has 50% probability of being 100 m. of Hunter h 2)Hunter h has 50% probability of being within 100m of Dog d dh x River x =0River x =1 km

27 Given that: a) Dog d has 50% probability of being 100 m. of Hunter, is it true that: b ) Hunter h has 50% probability of being within 100m of Dog d ? Additional information Rivers at zero & 1 km. Hunter cannot cross them. Dog can swim across river - Statement a) still true If dog at –101 m, hunter cannot be within 100m of dog Statement b) untrue

28

29 Classical Approach Neyman “confidence interval” avoids pdf for Uses only P( x; ) Confidence interval : P( contains ) = True for any Varying intervals from ensemble of experiments fixed Gives range of for which observed value was “likely” ( ) Contrast Bayes : Degree of belief = is in

30 μ≥0 No prior for μ

31 Frequentism: Specific example Particle decays exponentially: dn/dt = (1/τ) exp(-t/τ) Observe 1 decay at time t 1 : L (τ) = (1/τ) exp(-t 1 /τ) Construct 68% central interval t =.17τ dn/dt τ t t = 1.8τ t 1 t

32 90% Classical interval for Gaussian σ = 1 μ ≥ 0 e.g. m 2 (ν e )

33 at 90% confidence and known, but random unknown, but fixed Probability statement about and Frequentist Bayesian and known, and fixed unknown, and random Probability/credible statement about

34 Coverage Fraction of intervals containing true value Property of method, not of result Can vary with param Frequentist concept. Built in to Neyman construction Some Bayesians reject idea. Coverage not guaranteed Integer data (Poisson)  discontinuities Ideal coverage plot C μ

35 Classical Intervals Problems Hard to understand e.g. d’Agostini Arbitrary choice of interval Possibility of empty range Nuisance parameters (systematic errors ) Advantages Widely applicable Well defined coverage

36 FELDMAN - COUSINS Wants to avoid empty classical intervals  Uses “ L -ratio ordering principle” to resolve ambiguity about “which 90% region?”  [Neyman + Pearson say L -ratio is best for hypothesis testing] No ‘Flip-Flop’ problem

37 X obs = -2 now gives upper limit

38 Black lines Classical 90% central interval Red dashed: Classical 90% upper limit Flip-flop

39

40 Poisson confidence intervals. Background = 3 Standard Frequentist Feldman - Cousins

41

42

43 Importance of Ordering Rule Neyman construction in 1 parameter 2 measurements An aside: Determination of single parameter p via Acceptable level of Range of parameters given by 1)Values of for which data is likely i.e. p( ) is acceptable or 2) 2) is good 1) Range depends on [“Confidence interval coupled to goodness of fit”]

44 * * * Neyman Construction For given, acceptable (, ) satisfy = Defines cylinder in space Experiment gives interval Range depends on Range and goodness of fit are coupled

45 That was using Probability Ordering Now change to Likelihood Ratio Ordering For,no value of gives very good fit For Neyman Construction at fixed, compare: with where giving Cutting on Likelihood Ratio Ordering gives:

46 Therefore, range of is Constant Width Independent of Confidence Range and Goodness of Fit are completely decoupled

47 Standard Frequentist Pros: Coverage Widely applicable Cons: Hard to understand Small or empty intervals Difficult in many variables (e.g. systematics) Needs ensemble

48 Bayesian Pros: Easy to understand Physical interval Cons: Needs prior Coverage not guaranteed Hard to combine

49 SYSTEMATICS For example Observed for statistical errors Physics parameter we need to know these, probably from other measurements (and/or theory) Uncertainties  error in Some are arguably statistical errors Shift Central Value Bayesian Frequentist Mixed

50 Simplest Method Evaluate using and Move nuisance parameters (one at a time) by their errors  If nuisance parameters are uncorrelated, combine these contributions in quadrature  total systematic Shift Nuisance Parameters

51 Bayesian Without systematics prior With systematics Then integrate over LA and b

52 If = constant and = truncated Gaussian TROUBLE! Upper limit on from Significance from likelihood ratio for and

53 Frequentist Full Method Imagine just 2 parameters and LA and 2 measurements N and M PhysicsNuisance Do Neyman construction in 4-D Use observed N and M, to give Confidence Region for LA and LA 68%

54 Then project onto axis This results in OVERCOVERAGE Aim to get better shaped region, by suitable choice of ordering rule Example: Profile likelihood ordering

55 Full frequentist method hard to apply in several dimensions Used in 3 parameters For example: Neutrino oscillations (CHOOZ) Normalisation of data Use approximate frequentist methods that reduce dimensions to just physics parameters e.g. Profile pdf i.e. Contrast Bayes marginalisation Distinguish “profile ordering” See Giovanni Punzi, PHYSTAT05 page 88

56 Talks at FNAL CONFIDENCE LIMITS WORKSHOP (March 2000) by: Gary Feldman Wolfgang Rolke hep-ph/ version 2 Acceptance uncertainty worse than Background uncertainty Limit of C. Lim. as σ  0 Need to check Coverage Lim σ

57 Method: Mixed Frequentist - Bayesian Bayesian for nuisance parameters and Frequentist to extract range Philosophical/aesthetic problems? Highland and Cousins (Motivation was paradoxical behaviour of Poisson limit when LA not known exactly)

58 Bayesian versus Frequentism Basis of method Bayes Theorem  Posterior probability distribution Uses pdf for data, for fixed parameters Meaning of probability Degree of beliefFrequentist definition Prob of parameters? YesAnathema Needs prior?YesNo Choice of interval? YesYes (except F+C) Data considered Only data you have….+ other possible data Likelihood principle? YesNo Bayesian Frequentist

59 Bayesian versus Frequentism Ensemble of experiment NoYes (but often not explicit) Final statement Posterior probability distribution Parameter values  Data is likely Unphysical/ empty ranges Excluded by priorCan occur SystematicsIntegrate over priorExtend dimensionality of frequentist construction CoverageUnimportantBuilt-in Decision making Yes (uses cost function)Not useful Bayesian Frequentist

60 Bayesianism versus Frequentism “Bayesians address the question everyone is interested in, by using assumptions no-one believes” “Frequentists use impeccable logic to deal with an issue of no interest to anyone”