G. Cowan Statistical techniques for systematics page 1 Statistical techniques for incorporating systematic/theory uncertainties Theory/Experiment Interplay.

Slides:



Advertisements
Similar presentations
Using the Profile Likelihood in Searches for New Physics / PHYSTAT 2011 G. Cowan 1 Using the Profile Likelihood in Searches for New Physics arXiv:
Advertisements

1 Bayesian statistical methods for parton analyses Glen Cowan Royal Holloway, University of London DIS 2006.
G. Cowan Statistics for HEP / LAL Orsay, 3-5 January 2012 / Lecture 2 1 Statistical Methods for Particle Physics Lecture 2: Tests based on likelihood ratios.
G. Cowan RHUL Physics Profile likelihood for systematic uncertainties page 1 Use of profile likelihood to determine systematic uncertainties ATLAS Top.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem, random variables, pdfs 2Functions.
G. Cowan Statistics for HEP / NIKHEF, December 2011 / Lecture 3 1 Statistical Methods for Particle Physics Lecture 3: Limits for Poisson mean: Bayesian.
Search for B     with SemiExclusive reconstruction C.Cartaro, G. De Nardo, F. Fabozzi, L. Lista Università & INFN - Sezione di Napoli.
G. Cowan RHUL Physics Comment on use of LR for limits page 1 Comment on definition of likelihood ratio for limits ATLAS Statistics Forum CERN, 2 September,
G. Cowan Statistical methods for HEP / Birmingham 9 Nov Recent developments in statistical methods for particle physics Particle Physics Seminar.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan 2011 CERN Summer Student Lectures on Statistics / Lecture 41 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability.
G. Cowan RHUL Physics Statistical Methods for Particle Physics / 2007 CERN-FNAL HCP School page 1 Statistical Methods for Particle Physics (2) CERN-FNAL.
G. Cowan RHUL Physics Higgs combination note status page 1 Status of Higgs Combination Note ATLAS Statistics/Higgs Meeting Phone, 7 April, 2008 Glen Cowan.
G. Cowan Statistics for HEP / NIKHEF, December 2011 / Lecture 2 1 Statistical Methods for Particle Physics Lecture 2: Tests based on likelihood ratios.
G. Cowan Lectures on Statistical Data Analysis Lecture 14 page 1 Statistical Data Analysis: Lecture 14 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Shandong seminar / 1 September Some Developments in Statistical Methods for Particle Physics Particle Physics Seminar Shandong University.
G. Cowan Discovery and limits / DESY, 4-7 October 2011 / Lecture 2 1 Statistical Methods for Discovery and Limits Lecture 2: Tests based on likelihood.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
G. Cowan Discovery and limits / DESY, 4-7 October 2011 / Lecture 3 1 Statistical Methods for Discovery and Limits Lecture 3: Limits for Poisson mean: Bayesian.
G. Cowan RHUL Physics Bayesian Higgs combination page 1 Bayesian Higgs combination using shapes ATLAS Statistics Meeting CERN, 19 December, 2007 Glen Cowan.
Statistical aspects of Higgs analyses W. Verkerke (NIKHEF)
880.P20 Winter 2006 Richard Kass 1 Confidence Intervals and Upper Limits Confidence intervals (CI) are related to confidence limits (CL). To calculate.
G. Cowan 2009 CERN Summer Student Lectures on Statistics1 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability densities,
G. Cowan Lectures on Statistical Data Analysis Lecture 3 page 1 Lecture 3 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
G. Cowan CLASHEP 2011 / Topics in Statistical Data Analysis / Lecture 21 Topics in Statistical Data Analysis for HEP Lecture 2: Statistical Tests CERN.
G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 4: Discovery and limits 清华大学高能物理研究中心 2010 年 4 月 12—16 日 Glen.
G. Cowan RHUL Physics Bayesian Higgs combination page 1 Bayesian Higgs combination based on event counts (follow-up from 11 May 07) ATLAS Statistics Forum.
G. Cowan Weizmann Statistics Workshop, 2015 / GDC Lecture 31 Statistical Methods for Particle Physics Lecture 3: asymptotics I; Asimov data set Statistical.
G. Cowan RHUL Physics page 1 Status of search procedures for ATLAS ATLAS-CMS Joint Statistics Meeting CERN, 15 October, 2009 Glen Cowan Physics Department.
G. Cowan ATLAS Statistics Forum / Minimum Power for PCL 1 Minimum Power for PCL ATLAS Statistics Forum EVO, 10 June, 2011 Glen Cowan* Physics Department.
G. Cowan, RHUL Physics Discussion on significance page 1 Discussion on significance ATLAS Statistics Forum CERN/Phone, 2 December, 2009 Glen Cowan Physics.
G. Cowan RHUL Physics LR test to determine number of parameters page 1 Likelihood ratio test to determine best number of parameters ATLAS Statistics Forum.
G. Cowan CERN Academic Training 2010 / Statistics for the LHC / Lecture 41 Statistics for the LHC Lecture 4: Bayesian methods and further topics Academic.
G. Cowan RHUL Physics Input from Statistics Forum for Exotics page 1 Input from Statistics Forum for Exotics ATLAS Exotics Meeting CERN/phone, 22 January,
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
Easy Limit Statistics Andreas Hoecker CAT Physics, Mar 25, 2011.
G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.
1 Introduction to Statistics − Day 3 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
G. Cowan NExT Workshop, 2015 / GDC Lecture 11 Statistical Methods for Particle Physics Lecture 1: introduction & statistical tests Fifth NExT PhD Workshop:
G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Lecture 4 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
G. Cowan Statistical methods for particle physics / Cambridge Recent developments in statistical methods for particle physics HEP Phenomenology.
G. Cowan Computing and Statistical Data Analysis / Stat 9 1 Computing and Statistical Data Analysis Stat 9: Parameter Estimation, Limits London Postgraduate.
G. Cowan RHUL Physics Analysis Strategies Workshop -- Statistics Forum Report page 1 Report from the Statistics Forum ATLAS Analysis Strategies Workshop.
G. Cowan, RHUL Physics Statistics for early physics page 1 Statistics jump-start for early physics ATLAS Statistics Forum EVO/Phone, 4 May, 2010 Glen Cowan.
G. Cowan RHUL Physics Status of Higgs combination page 1 Status of Higgs Combination ATLAS Higgs Meeting CERN/phone, 7 November, 2008 Glen Cowan, RHUL.
G. Cowan Systematic uncertainties in statistical data analysis page 1 Systematic uncertainties in statistical data analysis for particle physics DESY Seminar.
In Bayesian theory, a test statistics can be defined by taking the ratio of the Bayes factors for the two hypotheses: The ratio measures the probability.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 1 Statistical Methods for Discovery and Limits in HEP Experiments Day 2: Discovery.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Systematics in Hfitter. Reminder: profiling nuisance parameters Likelihood ratio is the most powerful discriminant between 2 hypotheses What if the hypotheses.
G. Cowan RHUL Physics Statistical Issues for Higgs Search page 1 Statistical Issues for Higgs Search ATLAS Statistics Forum CERN, 16 April, 2007 Glen Cowan.
G. Cowan CERN Academic Training 2010 / Statistics for the LHC / Lecture 21 Statistics for the LHC Lecture 2: Discovery Academic Training Lectures CERN,
G. Cowan SLAC Statistics Meeting / 4-6 June 2012 / Two Developments 1 Two developments in discovery tests: use of weighted Monte Carlo events and an improved.
G. Cowan CERN Academic Training 2012 / Statistics for HEP / Lecture 21 Statistics for HEP Lecture 2: Discovery and Limits Academic Training Lectures CERN,
Discussion on significance
Status of the Higgs to tau tau
Statistics for the LHC Lecture 3: Setting limits
Comment on Event Quality Variables for Multivariate Analyses
Lecture 4 1 Probability (90 min.)
Recent developments in statistical methods for particle physics
Computing and Statistical Data Analysis / Stat 8
TAE 2018 / Statistics Lecture 3
Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
Computing and Statistical Data Analysis / Stat 7
Statistical Methods for HEP Lecture 3: Discovery and Limits
Decomposition of Stat/Sys Errors
Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
Computing and Statistical Data Analysis / Stat 10
Presentation transcript:

G. Cowan Statistical techniques for systematics page 1 Statistical techniques for incorporating systematic/theory uncertainties Theory/Experiment Interplay at the LHC RHUL, 8 April, 2010 Glen Cowan Physics Department, Royal Holloway, University of London

G. Cowan Statistical techniques for systematics page 2 Outline General statistical formalism for a search at the LHC Mainly frequentist Significance test using profile likelihood ratio Distributions of profile likelihood ratio in large sample limit (with E. Gross, O. Vitells, K. Cranmer) General strategy for dealing with systematics Improve model by including additional parameters Example 1: tau hadronic mass distribution Example 2: b → s 

page 3 Prototype analysis Search for signal in a region of phase space; result is histogram of some variable x giving numbers: Assume the n i are Poisson distributed with expectation values G. Cowan Statistical techniques for systematics signal where background strength parameter

page 4 Prototype analysis (II) Often also have a subsidiary measurement that constrains some of the background and/or shape parameters: Assume the m i are Poisson distributed with expectation values G. Cowan Statistical techniques for systematics nuisance parameters (  s,  b,b tot ) Likelihood function is (N.B. here m = number of counts, not mass!)

page 5 The profile likelihood ratio Can base significance test on the profile likelihood ratio: G. Cowan Statistical techniques for systematics maximizes L for specified  maximize L The likelihood ratio gives optimum test between two point hypotheses (Neyman-Pearson lemma). Should be near-optimal in present analysis with variable  and nuisance parameters .

page 6 Test statistic for discovery Try to reject background-only (  = 0) hypothesis using G. Cowan Statistical techniques for systematics Large q 0 means increasing incompatibility between the data and hypothesis, therefore p-value for an observed q 0,obs is will get formula for this later i.e. only regard upward fluctuation of data as evidence against the background-only hypothesis.

page 7 Test statistic for upper limits For purposes of setting an upper limit on  use G. Cowan Statistical techniques for systematics Note for purposes of setting an upper limit, one does not regard an upwards fluctuation of the data as representing incompatibility with the hypothesized .

page 8 G. Cowan Statistical techniques for systematics p-value / significance of hypothesized  Test hypothesized  by giving p-value, probability to see data with ≤ compatibility with  compared to data observed: Equivalently use significance, Z, defined as equivalent number of sigmas for a Gaussian fluctuation in one direction:

page 9 Using p-value for discovery/exclusion Carry out significance test of various hypotheses (background-only, signal plus background, …) Result is p-value. Exclude hypothesis if p-value below threshold: Discovery: test of background-only hypothesis. Exclude if p 5) Limits: test signal (+background) hypothesis. Exclude if p < 0.05 (i.e. 95% CL limit) G. Cowan Statistical techniques for systematics

page 10 Wald approximation for profile likelihood ratio To find p-values, we need: For median significance under alternative, need: G. Cowan Statistical techniques for systematics Use approximation due to Wald (1943) sample size

page 11 Distribution of q 0 Assuming the Wald approximation, we can write down the full distribution of q  as G. Cowan Statistical techniques for systematics The special case  ′ = 0 is a “half chi-square” distribution:

page 12 Cumulative distribution of q 0, significance From the pdf, the cumulative distribution of q  is found to be G. Cowan Statistical techniques for systematics The special case  ′ = 0 is The p-value of the  = 0 hypothesis is Therefore the discovery significance Z is simply

page 13 Distribution of q  Similar results for q  G. Cowan Statistical techniques for systematics

page 14 An example G. Cowan Statistical techniques for systematics

page 15 Error bands G. Cowan Statistical techniques for systematics

G. Cowan Statistical techniques for systematics page 16 Dealing with systematics Suppose one needs to know the shape of a distribution. Initial model (e.g. MC) is available, but known to be imperfect. Q: How can one incorporate the systematic error arising from use of the incorrect model? A: Improve the model. That is, introduce more adjustable parameters into the model so that for some point in the enlarged parameter space it is very close to the truth. Then use profile the likelihood with respect to the additional (nuisance) parameters. The correlations with the nuisance parameters will inflate the errors in the parameters of interest. Difficulty is deciding how to introduce the additional parameters. S. Caron, G. Cowan, S. Horner, J. Sundermann, E. Gross, 2009 JINST 4 P10009

page 17 Example of inserting nuisance parameters G. Cowan Statistical techniques for systematics Fit of hadronic mass distribution from a specific  decay mode. Important uncertainty in background from non-signal  modes. Background rate from other measurements, shape from MC. Want to include uncertainty in rate, mean, width of background component in a parametric fit of the mass distribution. fit from MC

page 18 Step 1: uncertainty in rate G. Cowan Statistical techniques for systematics Scale the predicted background by a factor r: b i → rb i Uncertainty in r is  r Regard r 0 = 1 (“best guess”) as Gaussian (or not, as appropriate) distributed measurement centred about the true value r, which becomes a new “nuisance” parameter in the fit. New likelihood function is: For a least-squares fit, equivalent to

page 19 Dealing with nuisance parameters G. Cowan Statistical techniques for systematics Ways to eliminate the nuisance parameter r from likelihood. 1) Profile likelihood: 2) Bayesian marginal likelihood: (prior) Profile and marginal likelihoods usually very similar. Both are broadened relative to original, reflecting the uncertainty connected with the nuisance parameter.

page 20 Step 2: uncertainty in shape G. Cowan Statistical techniques for systematics Key is to insert additional nuisance parameters into the model. E.g. consider a distribution g(y). Let y → x(y),

page 21 More uncertainty in shape G. Cowan Statistical techniques for systematics The transformation can be applied to a spline of original MC histogram (which has shape uncertainty). Continuous parameter  shifts distribution right/left. Can play similar game with width (or higher moments), e.g.,

page 22 A sample fit (no systematic error) G. Cowan Statistical techniques for systematics Consider a Gaussian signal, polynomial background, and also a peaking background whose form is take from MC: Template from MC True mean/width of signal: True mean/width of back- ground from MC: Fit result:

page 23 Sample fit with systematic error G. Cowan Statistical techniques for systematics Suppose now the MC template for the peaking background was systematically wrong, having Now fitted values of signal parameters wrong, poor goodness-of-fit:

page 24 Sample fit with adjustable mean/width G. Cowan Statistical techniques for systematics Suppose one regards peak position and width of MC template to have systematic uncertainties: Incorporate this by regarding the nominal mean/width of the MC template as measurements, so in LS fit add to  2 a term: orignal mean of MC template altered mean of MC template

page 25 Sample fit with adjustable mean/width (II) G. Cowan Statistical techniques for systematics Result of fit is now “good”: In principle, continue to add nuisance parameters until data are well described by the model.

page 26 Systematic error converted to statistical G. Cowan Statistical techniques for systematics One can regard the quadratic difference between the statistical errors with and without the additional nuisance parameters as the contribution from the systematic uncertainty in the MC template: Formally this part of error has been converted to part of statistical error (because the extended model is ~correct!).

page 27 Systematic error from “shift method” G. Cowan Statistical techniques for systematics Note that the systematic error regarded as part of the new statistical error (previous slide) is much smaller than the change one would find by simply “shifting” the templates plus/minus one standard deviation, holding them constant, and redoing the fit. This gives: This is not necessarily “wrong”, since here we are not improving the model by including new parameters. But in any case it’s best to improve the model!

G. Cowan Statistical techniques for systematics page 28 Issues with finding an improved model Sometimes, e.g., if the data set is very large, the total  2 can be very high (bad), even though the absolute deviation between model and data may be small. It may be that including additional parameters "spoils" the parameter of interest and/or leads to an unphysical fit result well before it succeeds in improving the overall goodness-of-fit. Include new parameters in a clever (physically motivated, local) way, so that it affects only the required regions. Use Bayesian approach -- assign priors to the new nuisance parameters that constrain them from moving too far (or use equivalent frequentist penalty terms in likelihood). Unfortunately these solutions may not be practical and one may be forced to use ad hoc recipes (last resort).

G. Cowan Statistical techniques for systematics page 29 Summary and conclusions Key to covering a systematic uncertainty is to include the appropriate nuisance parameters, constrained by all available info. Enlarge model so that for at least one point in its parameter space, its difference from the truth is negligible. In frequentist approach can use profile likelihood (similar with integrated product of likelihood and prior -- not discussed today). Too many nuisance parameters spoils information about parameter(s) of interest. In Bayesian approach, need to assign priors to (all) parameters. Can provide important flexibility over frequentist methods. Can be difficult to encode uncertainty in priors. Exploit recent progress in Bayesian computation (MCMC). Finally, when the LHC announces a 5 sigma effect, it's important to know precisely what the "sigma" means.

page 30 Extra slides G. Cowan Statistical techniques for systematics

G. Cowan Statistical techniques for systematics page 31 Fit example: b → s  (BaBar) B. Aubert et al. (BaBar), Phys. Rev. D 77, (R) (2008). Decay of one B fully reconstructed (B tag ). Look for high-energy  from rest of event. Signal and background yields from fit to m ES in bins of E . e-e-e-e- D*D*D*D*  e+e+e+e+ B tag B signal Xs  high-energy  "recoil method"

G. Cowan Statistical techniques for systematics page 32 Fitting m ES distribution for b → s  Fit m ES distribution using superposition of Crystal Ball and Argus functions: Crystal Ball Argus shapes rates obs./pred. events in ith bin log-likelihood:

G. Cowan Statistical techniques for systematics page 33 Simultaneous fit of all m ES distributions Need fits of m ES distributions in 14 bins of E  : At high E , not enough events to constrain shape, so combine all E  bins into global fit: Start with no energy dependence, and include one by one more parameters until data well described. Shape parameters could vary (smoothly) with E . So make Ansatz for shape parameters such as

G. Cowan Statistical techniques for systematics page 34 Finding appropriate model flexibility Here for Argus  parameter, linear dependence gives significant improvement; fitted coefficient of linear term  ± . Inclusion of additional free parameters (e.g., quadratic E dependence for parameter  ) do not bring significant improvement. So including the additional energy dependence for the shape parameters converts the systematic uncertainty into a statistical uncertainty on the parameters of interest. D. Hopkins, PhD thesis, RHUL (2007).  2 (1)   2 (2) = 3.48 p-value of (1) = →data want extra par.