G. Cowan Statistical methods for HEP / Freiburg 27-29 June 2011 / Lecture 2 1 Statistical Methods for Discovery and Limits in HEP Experiments Day 2: Discovery.

Slides:

Advertisements

Similar presentations

Using the Profile Likelihood in Searches for New Physics / PHYSTAT 2011 G. Cowan 1 Using the Profile Likelihood in Searches for New Physics arXiv:

Advertisements

G. Cowan Statistics for HEP / LAL Orsay, 3-5 January 2012 / Lecture 2 1 Statistical Methods for Particle Physics Lecture 2: Tests based on likelihood ratios.

G. Cowan Statistics for HEP / LAL Orsay, 3-5 January 2012 / Lecture 3 1 Statistical Methods for Particle Physics Lecture 3: More on discovery and limits.

G. Cowan RHUL Physics Profile likelihood for systematic uncertainties page 1 Use of profile likelihood to determine systematic uncertainties ATLAS Top.

G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem, random variables, pdfs 2Functions.

G. Cowan Statistics for HEP / NIKHEF, December 2011 / Lecture 3 1 Statistical Methods for Particle Physics Lecture 3: Limits for Poisson mean: Bayesian.

G. Cowan RHUL Physics Comment on use of LR for limits page 1 Comment on definition of likelihood ratio for limits ATLAS Statistics Forum CERN, 2 September,

G. Cowan Statistical methods for HEP / Birmingham 9 Nov Recent developments in statistical methods for particle physics Particle Physics Seminar.

G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.

G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.

G. Cowan 2011 CERN Summer Student Lectures on Statistics / Lecture 41 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability.

G. Cowan RHUL Physics Higgs combination note status page 1 Status of Higgs Combination Note ATLAS Statistics/Higgs Meeting Phone, 7 April, 2008 Glen Cowan.

G. Cowan Statistics for HEP / NIKHEF, December 2011 / Lecture 2 1 Statistical Methods for Particle Physics Lecture 2: Tests based on likelihood ratios.

G. Cowan Statistical Data Analysis / Stat 4 1 Statistical Data Analysis Stat 4: confidence intervals, limits, discovery London Postgraduate Lectures on.

G. Cowan Shandong seminar / 1 September Some Developments in Statistical Methods for Particle Physics Particle Physics Seminar Shandong University.

G. Cowan Discovery and limits / DESY, 4-7 October 2011 / Lecture 2 1 Statistical Methods for Discovery and Limits Lecture 2: Tests based on likelihood.

G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.

G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.

G. Cowan Discovery and limits / DESY, 4-7 October 2011 / Lecture 3 1 Statistical Methods for Discovery and Limits Lecture 3: Limits for Poisson mean: Bayesian.

G. Cowan RHUL Physics Bayesian Higgs combination page 1 Bayesian Higgs combination using shapes ATLAS Statistics Meeting CERN, 19 December, 2007 Glen Cowan.

G. Cowan Statistics for HEP / NIKHEF, December 2011 / Lecture 4 1 Statistical Methods for Particle Physics Lecture 4: More on discovery and limits.

Graduierten-Kolleg RWTH Aachen February 2014 Glen Cowan

G. Cowan 2009 CERN Summer Student Lectures on Statistics1 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability densities,

G. Cowan Lectures on Statistical Data Analysis Lecture 3 page 1 Lecture 3 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.

G. Cowan Orsay 2014 / Discussion on Statistics1 Topics in Statistics for Particle Physics Discussion on Statistics LAL Orsay 16 June 2014 Glen Cowan Physics.

G. Cowan CLASHEP 2011 / Topics in Statistical Data Analysis / Lecture 21 Topics in Statistical Data Analysis for HEP Lecture 2: Statistical Tests CERN.

G. Cowan Statistical techniques for systematics page 1 Statistical techniques for incorporating systematic/theory uncertainties Theory/Experiment Interplay.

G. Cowan Aachen 2014 / Statistics for Particle Physics, Lecture 51 Statistical Methods for Particle Physics Lecture 5: systematics, Bayesian methods Graduierten-Kolleg.

G. Cowan Weizmann Statistics Workshop, 2015 / GDC Lecture 31 Statistical Methods for Particle Physics Lecture 3: asymptotics I; Asimov data set Statistical.

G. Cowan RHUL Physics page 1 Status of search procedures for ATLAS ATLAS-CMS Joint Statistics Meeting CERN, 15 October, 2009 Glen Cowan Physics Department.

G. Cowan, RHUL Physics Discussion on significance page 1 Discussion on significance ATLAS Statistics Forum CERN/Phone, 2 December, 2009 Glen Cowan Physics.

G. Cowan RHUL Physics LR test to determine number of parameters page 1 Likelihood ratio test to determine best number of parameters ATLAS Statistics Forum.

G. Cowan CERN Academic Training 2010 / Statistics for the LHC / Lecture 41 Statistics for the LHC Lecture 4: Bayesian methods and further topics Academic.

G. Cowan S0S 2010 / Statistical Tests and Limits 1 Statistical Tests and Limits Lecture 1: general formalism IN2P3 School of Statistics Autrans, France.

G. Cowan St. Andrews 2012 / Statistics for HEP / Lecture 31 Statistics for HEP Lecture 3: Further topics 69 th SUSSP LHC Physics St. Andrews August,

1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.

Easy Limit Statistics Andreas Hoecker CAT Physics, Mar 25, 2011.

G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.

1 Introduction to Statistics − Day 3 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.

G. Cowan NExT Workshop, 2015 / GDC Lecture 11 Statistical Methods for Particle Physics Lecture 1: introduction & statistical tests Fifth NExT PhD Workshop:

G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Lecture 4 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.

G. Cowan Statistical methods for particle physics / Cambridge Recent developments in statistical methods for particle physics HEP Phenomenology.

G. Cowan Computing and Statistical Data Analysis / Stat 9 1 Computing and Statistical Data Analysis Stat 9: Parameter Estimation, Limits London Postgraduate.

G. Cowan, RHUL Physics Statistics for early physics page 1 Statistics jump-start for early physics ATLAS Statistics Forum EVO/Phone, 4 May, 2010 Glen Cowan.

G. Cowan Statistical methods for particle physics / Wuppertal Recent developments in statistical methods for particle physics Particle Physics.

G. Cowan RHUL Physics Status of Higgs combination page 1 Status of Higgs Combination ATLAS Higgs Meeting CERN/phone, 7 November, 2008 Glen Cowan, RHUL.

G. Cowan Lectures on Statistical Data Analysis Lecture 9 page 1 Statistical Data Analysis: Lecture 9 1Probability, Bayes’ theorem 2Random variables and.

G. Cowan Cargese 2012 / Statistics for HEP / Lecture 21 Statistics for HEP Lecture 2: Discovery and Limits International School Cargèse August 2012 Glen.

G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.

G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.

G. Cowan IHEP seminar / 19 August Some Developments in Statistical Methods for Particle Physics Particle Physics Seminar IHEP, Beijing 19 August,

G. Cowan St. Andrews 2012 / Statistics for HEP / Lecture 21 Statistics for HEP Lecture 2: Discovery and Limits 69 th SUSSP LHC Physics St. Andrews

G. Cowan iSTEP 2014, Beijing / Statistics for Particle Physics / Lecture 31 Statistical Methods for Particle Physics Lecture 3: systematic uncertainties.

G. Cowan RHUL Physics Statistical Issues for Higgs Search page 1 Statistical Issues for Higgs Search ATLAS Statistics Forum CERN, 16 April, 2007 Glen Cowan.

G. Cowan CERN Academic Training 2010 / Statistics for the LHC / Lecture 21 Statistics for the LHC Lecture 2: Discovery Academic Training Lectures CERN,

G. Cowan SLAC Statistics Meeting / 4-6 June 2012 / Two Developments 1 Two developments in discovery tests: use of weighted Monte Carlo events and an improved.

G. Cowan CERN Academic Training 2012 / Statistics for HEP / Lecture 21 Statistics for HEP Lecture 2: Discovery and Limits Academic Training Lectures CERN,

Discussion on significance

Computing and Statistical Data Analysis / Stat 11

Estimating Statistical Significance

Statistics for the LHC Lecture 3: Setting limits

Some Statistical Tools for Particle Physics

Comment on Event Quality Variables for Multivariate Analyses

Recent developments in statistical methods for particle physics

Statistical Methods for Particle Physics (II)

TAE 2017 / Statistics Lecture 3

TAE 2018 / Statistics Lecture 3

Statistical Methods for HEP Lecture 3: Discovery and Limits

Statistical Methods for Particle Physics Lecture 3: further topics

Presentation transcript:

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 1 Statistical Methods for Discovery and Limits in HEP Experiments Day 2: Discovery Vorlesungen des GK Physik an Hadron-Beschleunigern, Freiburg, June, 2011 Glen Cowan Physics Department Royal Holloway, University of London

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 2 Outline Day 1: Introduction and basic formalism Probability, statistical tests, parameter estimation. Day 2: Discovery Quantifying discovery significance and sensitivity Systematic uncertainties (nuisance parameters) Day 3: Exclusion limits Frequentist and Bayesian intervals/limits

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 page 3 Outline for Day 2 Large-sample statistical formulae for a search at the LHC Cowan, Cranmer, Gross, Vitells, arXiv: , EPJC 71 (2011) 1-19 Significance test using profile likelihood ratio Systematics included via nuisance parameters Distributions in large sample limit, no MC used. Progress on related issues (some updates from PHYSTAT2011): The “look elsewhere effect” The “CLs” problem Combining measurements Improving treatment of systematics

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 4 A simple example For each event we measure two variables, x = (x 1, x 2 ). Suppose that for background events (hypothesis H 0 ), and for a certain signal model (hypothesis H 1 ) they follow where x 1, x 2 ≥ 0 and C is a normalization constant.

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 5 Likelihood ratio as test statistic In a real-world problem we usually wouldn’t have the pdfs f(x|H 0 ) and f(x|H 1 ), so we wouldn’t be able to evaluate the likelihood ratio for a given observed x, hence the need for multivariate methods to approximate this with some other function. But in this example we can find contours of constant likelihood ratio such as:

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 6 Event selection using the LR Using Monte Carlo, we can find the distribution of the likelihood ratio or equivalently of signal (H 1 ) background (H 0 ) From the Neyman-Pearson lemma we know that by cutting on this variable we would select a signal sample with the highest signal efficiency (test power) for a given background efficiency.

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 7 Search for the signal process But what if the signal process is not known to exist and we want to search for it. The relevant hypotheses are therefore H 0 : all events are of the background type H 1 : the events are a mixture of signal and background Rejecting H 0 with Z > 5 constitutes “discovering” new physics. Suppose that for a given integrated luminosity, the expected number of signal events is s, and for background b. The observed number of events n will follow a Poisson distribution:

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 8 Likelihoods for full experiment We observe n events, and thus measure n instances of x = (x 1, x 2 ). The likelihood function for the entire experiment assuming the background-only hypothesis (H 0 ) is and for the “signal plus background” hypothesis (H 1 ) it is where  s and  b are the (prior) probabilities for an event to be signal or background, respectively.

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 9 Likelihood ratio for full experiment We can define a test statistic Q monotonic in the likelihood ratio as To compute p-values for the b and s+b hypotheses given an observed value of Q we need the distributions f(Q|b) and f(Q|s+b). Note that the term –s in front is a constant and can be dropped. The rest is a sum of contributions for each event, and each term in the sum has the same distribution. Can exploit this to relate distribution of Q to that of single event terms using (Fast) Fourier Transforms (Hu and Nielsen, physics/ ).

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 10 Distribution of Q Take e.g. b = 100, s = 20. f (Q|b) f (Q|s+b) p-value of b onlyp-value of s+b Suppose in real experiment Q is observed here.

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 11 Systematic uncertainties Up to now we assumed all parameters were known exactly. In practice they have some (systematic) uncertainty. Suppose e.g. uncertainty in expected number of background events b is characterized by a (Bayesian) pdf  (b). Maybe take a Gaussian, i.e., where b 0 is the nominal (measured) value and  b is the estimated uncertainty. In fact for many systematics a Gaussian pdf is hard to defend – more on this later.

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 12 Distribution of Q with systematics To get the desired p-values we need the pdf f (Q), but this depends on b, which we don’t know exactly. But we can obtain the Bayesian model average: With Monte Carlo, sample b from  (b), then use this to generate Q from f (Q|b), i.e., a new value of b is used to generate the data for every simulation of the experiment. This broadens the distributions of Q and thus increases the p-value (decreases significance Z) for a given Q obs.

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 13 Distribution of Q with systematics (2) For s = 20, b 0 = 100,  b = 10 this gives f (Q|b) f (Q|s+b) p-value of b onlyp-value of s+b

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 14 Using the likelihood ratio L(s)/L(s) ˆ Instead of the likelihood ratio L s+b /L b, suppose we use as a test statistic Intuitively this is a measure of the level of agreement between the data and the hypothesized value of s. low : poor agreement high  : better agreement 0 ≤ ≤ 1 maximizes L(s)

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 15 L(s)/L(s) for counting experiment ˆ Consider an experiment where we only count n events with n ~ Poisson(s + b). Then. To establish discovery of signal we test the hypothesis s = 0 using whereas previously we had used which is monotonic in n and thus equivalent to using n as the test statistic.

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 16 L(s)/L(s) for counting experiment (2) ˆ But if we only consider the possibility of signal being present when n > b, then in this range (0) is also monotonic in n, so both likelihood ratios lead to the same test. b

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 17 L(s)/L(s) for general experiment ˆ If we do not simply count events but also measure for each some set of numbers, then the two likelihood ratios do not necessarily give equivalent tests, but in practice will be very close. (s) has the important advantage that for a sufficiently large event sample, its distribution approaches a well defined form (Wilks’ Theorem). In practice the approach to the asymptotic form is rapid and one obtains a good approximation even for relatively small data samples (but need to check with MC). This remains true even when we have adjustable nuisance parameters in the problem, i.e., parameters that are needed for a correct description of the data but are otherwise not of interest (key to dealing with systematic uncertainties).

18 Prototype search analysis Search for signal in a region of phase space; result is histogram of some variable x giving numbers: Assume the n i are Poisson distributed with expectation values G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 signal where background strength parameter

19 Prototype analysis (II) Often also have a subsidiary measurement that constrains some of the background and/or shape parameters: Assume the m i are Poisson distributed with expectation values G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 nuisance parameters (  s,  b,b tot ) Likelihood function is

20 The profile likelihood ratio Base significance test on the profile likelihood ratio: G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 maximizes L for Specified  maximize L The likelihood ratio of point hypotheses gives optimum test (Neyman-Pearson lemma). The profile LR hould be near-optimal in present analysis with variable  and nuisance parameters .

21 Test statistic for discovery Try to reject background-only (  = 0) hypothesis using G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 i.e. here only regard upward fluctuation of data as evidence against the background-only hypothesis. Note that even though here physically  ≥ 0, we allow to be negative. In large sample limit its distribution becomes Gaussian, and this will allow us to write down simple expressions for distributions of our test statistics.

22 p-value for discovery G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Large q 0 means increasing incompatibility between the data and hypothesis, therefore p-value for an observed q 0,obs is will get formula for this later From p-value get equivalent significance,

G. Cowan page 23 Significance from p-value Often define significance Z as the number of standard deviations that a Gaussian variable would fluctuate in one direction to give the same p-value. 1 - TMath::Freq TMath::NormQuantile Statistical methods for HEP / Freiburg June 2011 / Lecture 2

24 Expected (or median) significance / sensitivity When planning the experiment, we want to quantify how sensitive we are to a potential discovery, e.g., by given median significance assuming some nonzero strength parameter  ′. G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 So for p-value, need f(q 0 |0), for sensitivity, will need f(q 0 |  ′),

25 Test statistic for upper limits For purposes of setting an upper limit on  use G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Note for purposes of setting an upper limit, one does not regard an upwards fluctuation of the data as representing incompatibility with the hypothesized . From observed q  find p-value: 95% CL upper limit on  is highest value for which p-value is not less than where

26 Alternative test statistic for upper limits Assume physical signal model has  > 0, therefore if estimator for  comes out negative, the closest physical model has  = 0. Therefore could also measure level of discrepancy between data and hypothesized  with G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Performance not identical to but very close to q  (of previous slide). q  is simpler in important ways: asymptotic distribution is independent of nuisance parameters.

27 Wald approximation for profile likelihood ratio To find p-values, we need: For median significance under alternative, need: G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Use approximation due to Wald (1943) sample size

28 Noncentral chi-square for  2ln  (  ) G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 If we can neglect the O(1/√N) term,  2ln  (  ) follows a noncentral chi-square distribution for one degree of freedom with noncentrality parameter As a special case, if  ′ =  then Λ = 0 and  2ln  (  ) follows a chi-square distribution for one degree of freedom (Wilks).

29 The Asimov data set To estimate median value of  2ln  (  ), consider special data set where all statistical fluctuations suppressed and n i, m i are replaced by their expectation values (the “Asimov” data set): G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Asimov value of  2ln  (  ) gives noncentrality param. Λ  or equivalently, 

30 Relation between test statistics and G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2

31 Distribution of q 0 Assuming the Wald approximation, we can write down the full distribution of q  as G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 The special case  ′ = 0 is a “half chi-square” distribution:

32 Cumulative distribution of q 0, significance From the pdf, the cumulative distribution of q  is found to be G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 The special case  ′ = 0 is The p-value of the  = 0 hypothesis is Therefore the discovery significance Z is simply

33 Relation between test statistics and G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Assuming the Wald approximation for – 2ln  (  ), q  and q  both have monotonic relation with . ~ And therefore quantiles of q , q  can be obtained directly from those οf  (which is Gaussian). ˆ ̃ ~

34 Distribution of q  Similar results for q  G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2

35 Distribution of q  G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Similar results for q  ̃ ̃

36 Monte Carlo test of asymptotic formula G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Here take  = 1. Asymptotic formula is good approximation to 5  level (q 0 = 25) already for b ~ 20.

37 Monte Carlo test of asymptotic formulae G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 For very low b, asymptotic formula underestimates Z 0. Then slight overshoot before rapidly converging to MC value. Significance from asymptotic formula, here Z 0 = √q 0 = 4, compared to MC (true) value.

38 Monte Carlo test of asymptotic formulae G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Asymptotic f (q 0 |1) good already for fairly small samples. Median[q 0 |1] from Asimov data set; good agreement with MC.

39 Monte Carlo test of asymptotic formulae G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Consider again n ~ Poisson (  s + b), m ~ Poisson(  b) Use q  to find p-value of hypothesized  values. E.g. f (q 1 |1) for p-value of  =1. Typically interested in 95% CL, i.e., p-value threshold = 0.05, i.e., q 1 = 2.69 or Z 1 = √q 1 = Median[q 1 |0] gives “exclusion sensitivity”. Here asymptotic formulae good for s = 6, b = 9.

40 Monte Carlo test of asymptotic formulae G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Same message for test based on q . q  and q  give similar tests to the extent that asymptotic formulae are valid. ~ ~

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 41 Discovery significance for n ~ Poisson(s + b) Consider again the case where we observe n events, model as following Poisson distribution with mean s + b (assume b is known). 1) For an observed n, what is the significance Z 0 with which we would reject the s = 0 hypothesis? 2) What is the expected (or more precisely, median ) Z 0 if the true value of the signal rate is s?

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 42 Gaussian approximation for Poisson significance For large s + b, n → x ~ Gaussian( ,  ),  = s + b,  = √(s + b). For observed value x obs, p-value of s = 0 is Prob(x > x obs | s = 0),: Significance for rejecting s = 0 is therefore Expected (median) significance assuming signal rate s is

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 43 Better approximation for Poisson significance Likelihood function for parameter s is or equivalently the log-likelihood is Find the maximum by setting gives the estimator for s:

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 44 Approximate Poisson significance (continued) The likelihood ratio statistic for testing s = 0 is For sufficiently large s + b, (use Wilks’ theorem), To find median[Z 0 |s+b], let n → s + b (i.e., the Asimov data set): This reduces to s/√b for s << b.

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 45 n ~ Poisson(  s+b), median significance, assuming  = 1, of the hypothesis  = 0 “Exact” values from MC, jumps due to discrete data. Asimov √q 0,A good approx. for broad range of s, b. s/√b only good for s « b. CCGV, arXiv:

46 Example 2: Shape analysis G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Look for a Gaussian bump sitting on top of:

47 Monte Carlo test of asymptotic formulae G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Distributions of q  here for  that gave p  = 0.05.

48 Using f(q  |0) to get error bands G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 We are not only interested in the median [q μ |0]; we want to know how much statistical variation to expect from a real data set. But we have full f(q  |0); we can get any desired quantiles.

49 Distribution of upper limit on  G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 ±1  (green) and ±2  (yellow) bands from MC; Vertical lines from asymptotic formulae

50 Limit on  versus peak position (mass) G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 ±1  (green) and ±2  (yellow) bands from asymptotic formulae; Points are from a single arbitrary data set.

51 Using likelihood ratio L s+b /L b G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Many searches at the Tevatron have used the statistic likelihood of  = 1 model (s+b) likelihood of  = 0 model (bkg only) This can be written

52 Wald approximation for L s+b /L b G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Assuming the Wald approximation, q can be written as i.e. q is Gaussian distributed with mean and variance of To get  2 use 2 nd derivatives of lnL with Asimov data set.

53 Example with L s+b /L b G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Consider again n ~ Poisson (  s + b), m ~ Poisson(  b) b = 20, s = 10,  = 1. So even for smallish data sample, Wald approximation can be useful; no MC needed.

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 54 The Look-Elsewhere Effect Eilam Gross and Ofer Vitells, arXiv: (→ EPJC) Suppose a model for a mass distribution allows for a peak at a mass m with amplitude  The data show a bump at a mass m 0. How consistent is this with the no-bump (  = 0) hypothesis?

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 55 p-value for fixed mass First, suppose the mass m 0 of the peak was specified a priori. Test consistency of bump with the no-signal (  = 0) hypothesis with e.g. likelihood ratio where “fix” indicates that the mass of the peak is fixed to m 0. The resulting p-value gives the probability to find a value of t fix at least as great as observed at the specific mass m 0. Eilam Gross and Ofer Vitells, arXiv: (→EPJC)

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 56 p-value for floating mass But suppose we did not know where in the distribution to expect a peak. What we want is the probability to find a peak at least as significant as the one observed anywhere in the distribution. Include the mass as an adjustable parameter in the fit, test significance of peak using (Note m does not appear in the  = 0 model.) Eilam Gross and Ofer Vitells, arXiv: (→EPJC)

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 57 Distributions of t fix, t float Eilam Gross and Ofer Vitells, arXiv: (→EPJC) For a sufficiently large data sample, t fix ~chi-square for 1 degree of freedom (Wilks’ theorem). For t float there are two adjustable parameters,  and m, and naively Wilks theorem says t float ~ chi-square for 2 d.o.f. In fact Wilks’ theorem does not hold in the floating mass case because on of the parameters (m) is not-defined in the  = 0 model. So getting t float distribution is more difficult.

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 58 Trials factor We would like to be able to relate the p-values for the fixed and floating mass analyses (at least approximately). Gross and Vitells show that the “trials factor” can be approximated by where ‹N› = average number of “upcrossings” of  2lnL in fit range and is the significance for the fixed mass case. So we can either carry out the full floating-mass analysis (e.g. use MC to get p-value), or do fixed mass analysis and apply a correction factor (much faster than MC). Eilam Gross and Ofer Vitells, arXiv: (→EPJC)

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 59 Upcrossings of  2lnL The Gross-Vitells formula for the trials factor requires the mean number “upcrossings” of  2ln L in the fit range based on fixed threshold. estimate with MC at low reference level Eilam Gross and Ofer Vitells, arXiv: (→EPJC)

60 G. Cowan Multidimensional look-elsewhere effect Generalization to multiple dimensions: number of upcrossings replaced by expectation of Euler characteristic: Applications: astrophysics (coordinates on sky), search for resonance of unknown mass and width,... Statistical methods for HEP / Freiburg June 2011 / Lecture 2 Eilam Gross and Ofer Vitells, PHYSTAT2011

Remember the Look-Elsewhere Effect is when we test a single model (e.g., SM) with multiple observations, i..e, in mulitple places. Note there is no look-elsewhere effect when considering exclusion limits. There we test specific signal models (typically once) and say whether each is excluded. With exclusion there is, however, the analogous issue of testing many signal models (or parameter values) and thus excluding some even in the absence of signal (“spurious exclusion”) Approximate correction for LEE should be sufficient, and one should also report the uncorrected significance. “There's no sense in being precise when you don't even know what you're talking about.” –– John von Neumann 61 G. Cowan Summary on Look-Elsewhere Effect Statistical methods for HEP / Freiburg June 2011 / Lecture 2

G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 62 Extra Slides

63 G. Cowan RooStats G. Schott PHYSTAT2011 Statistical methods for HEP / Freiburg June 2011 / Lecture 2

64 G. Cowan RooFit Workspaces Able to construct full likelihood for combination of channels (or experiments). Statistical methods for HEP / Freiburg June 2011 / Lecture 2 G. Schott PHYSTAT2011

Statistical methods for HEP / Freiburg June 2011 / Lecture 265 G. Cowan Combined ATLAS/CMS Higgs search K. Cranmer PHYSTAT2011 Given p-values p 1,..., p N of H, what is combined p? Better, given the results of N (usually independent) experiments, what inferences can one draw from their combination? Full combination is difficult but worth the effort for e.g. Higgs search.