PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, 17-19 January, 2011 Glen Cowan Physics Department Royal Holloway, University of London.

Slides:



Advertisements
Similar presentations
Using the Profile Likelihood in Searches for New Physics / PHYSTAT 2011 G. Cowan 1 Using the Profile Likelihood in Searches for New Physics arXiv:
Advertisements

27 th March CERN Higgs searches: CL s W. J. Murray RAL.
Bayesian inference “Very much lies in the posterior distribution” Bayesian definition of sufficiency: A statistic T (x 1, …, x n ) is sufficient for 
Practical Statistics for LHC Physicists Bayesian Inference Harrison B. Prosper Florida State University CERN Academic Training Lectures 9 April, 2015.
G. Cowan Statistics for HEP / LAL Orsay, 3-5 January 2012 / Lecture 2 1 Statistical Methods for Particle Physics Lecture 2: Tests based on likelihood ratios.
G. Cowan Statistics for HEP / LAL Orsay, 3-5 January 2012 / Lecture 3 1 Statistical Methods for Particle Physics Lecture 3: More on discovery and limits.
Statistics In HEP 2 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.
G. Cowan RHUL Physics Profile likelihood for systematic uncertainties page 1 Use of profile likelihood to determine systematic uncertainties ATLAS Top.
G. Cowan Statistics for HEP / NIKHEF, December 2011 / Lecture 3 1 Statistical Methods for Particle Physics Lecture 3: Limits for Poisson mean: Bayesian.
G. Cowan RHUL Physics Bayesian methods for HEP / DESY Terascale School page 1 Bayesian statistical methods for HEP Terascale Statistics School DESY, Hamburg.
1 Nuisance parameters and systematic uncertainties Glen Cowan Royal Holloway, University of London IoP Half.
G. Cowan RHUL Physics Comment on use of LR for limits page 1 Comment on definition of likelihood ratio for limits ATLAS Statistics Forum CERN, 2 September,
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Statistics for HEP / NIKHEF, December 2011 / Lecture 2 1 Statistical Methods for Particle Physics Lecture 2: Tests based on likelihood ratios.
G. Cowan Statistical Data Analysis / Stat 4 1 Statistical Data Analysis Stat 4: confidence intervals, limits, discovery London Postgraduate Lectures on.
G. Cowan Lectures on Statistical Data Analysis Lecture 14 page 1 Statistical Data Analysis: Lecture 14 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Discovery and limits / DESY, 4-7 October 2011 / Lecture 2 1 Statistical Methods for Discovery and Limits Lecture 2: Tests based on likelihood.
G. Cowan Lectures on Statistical Data Analysis Lecture 13 page 1 Statistical Data Analysis: Lecture 13 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
G. Cowan Discovery and limits / DESY, 4-7 October 2011 / Lecture 3 1 Statistical Methods for Discovery and Limits Lecture 3: Limits for Poisson mean: Bayesian.
G. Cowan RHUL Physics Bayesian Higgs combination page 1 Bayesian Higgs combination using shapes ATLAS Statistics Meeting CERN, 19 December, 2007 Glen Cowan.
Statistical aspects of Higgs analyses W. Verkerke (NIKHEF)
Discovery Experience: CMS Giovanni Petrucciani (UCSD)
1 Glen Cowan Statistics Forum News Glen Cowan Eilam Gross ATLAS Statistics Forum CERN, 3 December, 2008.
Statistical Decision Theory
Harrison B. Prosper Workshop on Top Physics, Grenoble Bayesian Statistics in Analysis Harrison B. Prosper Florida State University Workshop on Top Physics:
Results of combination Higgs toy combination, within and across experiments, with RooStats Grégory Schott Institute for Experimental Nuclear Physics of.
G. Cowan Lectures on Statistical Data Analysis Lecture 1 page 1 Lectures on Statistical Data Analysis London Postgraduate Lectures on Particle Physics;
Practical Statistics for Particle Physicists Lecture 3 Harrison B. Prosper Florida State University European School of High-Energy Physics Anjou, France.
Statistics In HEP Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.
G. Cowan Statistical techniques for systematics page 1 Statistical techniques for incorporating systematic/theory uncertainties Theory/Experiment Interplay.
Statistical Decision Theory Bayes’ theorem: For discrete events For probability density functions.
G. Cowan RHUL Physics Bayesian Higgs combination page 1 Bayesian Higgs combination based on event counts (follow-up from 11 May 07) ATLAS Statistics Forum.
G. Cowan RHUL Physics page 1 Status of search procedures for ATLAS ATLAS-CMS Joint Statistics Meeting CERN, 15 October, 2009 Glen Cowan Physics Department.
G. Cowan ATLAS Statistics Forum / Minimum Power for PCL 1 Minimum Power for PCL ATLAS Statistics Forum EVO, 10 June, 2011 Glen Cowan* Physics Department.
G. Cowan, RHUL Physics Discussion on significance page 1 Discussion on significance ATLAS Statistics Forum CERN/Phone, 2 December, 2009 Glen Cowan Physics.
G. Cowan RHUL Physics LR test to determine number of parameters page 1 Likelihood ratio test to determine best number of parameters ATLAS Statistics Forum.
G. Cowan CERN Academic Training 2010 / Statistics for the LHC / Lecture 41 Statistics for the LHC Lecture 4: Bayesian methods and further topics Academic.
Experience from Searches at the Tevatron Harrison B. Prosper Florida State University 18 January, 2011 PHYSTAT 2011 CERN.
G. Cowan Report from the Statistics Forum / CERN, 23 June Report from the Statistics Forum ATLAS Week Physics Plenary CERN, 23 June, 2011 Glen Cowan,
G. Cowan RHUL Physics Input from Statistics Forum for Exotics page 1 Input from Statistics Forum for Exotics ATLAS Exotics Meeting CERN/phone, 22 January,
G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.
1 Introduction to Statistics − Day 3 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Lecture 4 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
G. Cowan Statistical methods for particle physics / Cambridge Recent developments in statistical methods for particle physics HEP Phenomenology.
G. Cowan Computing and Statistical Data Analysis / Stat 9 1 Computing and Statistical Data Analysis Stat 9: Parameter Estimation, Limits London Postgraduate.
G. Cowan RHUL Physics Analysis Strategies Workshop -- Statistics Forum Report page 1 Report from the Statistics Forum ATLAS Analysis Strategies Workshop.
G. Cowan, RHUL Physics Statistics for early physics page 1 Statistics jump-start for early physics ATLAS Statistics Forum EVO/Phone, 4 May, 2010 Glen Cowan.
G. Cowan RHUL Physics Status of Higgs combination page 1 Status of Higgs Combination ATLAS Higgs Meeting CERN/phone, 7 November, 2008 Glen Cowan, RHUL.
G. Cowan CERN Academic Training 2012 / Statistics for HEP / Lecture 41 Statistics for HEP Lecture 4: Unfolding Academic Training Lectures CERN, 2–5 April,
G. Cowan Cargese 2012 / Statistics for HEP / Lecture 21 Statistics for HEP Lecture 2: Discovery and Limits International School Cargèse August 2012 Glen.
G. Cowan Lectures on Statistical Data Analysis Lecture 6 page 1 Statistical Data Analysis: Lecture 6 1Probability, Bayes’ theorem 2Random variables and.
In Bayesian theory, a test statistics can be defined by taking the ratio of the Bayes factors for the two hypotheses: The ratio measures the probability.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 1 Statistical Methods for Discovery and Limits in HEP Experiments Day 2: Discovery.
1 Summary and Conclusions Kyle Cranmer (New York University) Harrison B. Prosper (Florida State University) Sezen Sekmen (CERN) LPCC Workshop: Likelihoods.
G. Cowan St. Andrews 2012 / Statistics for HEP / Lecture 21 Statistics for HEP Lecture 2: Discovery and Limits 69 th SUSSP LHC Physics St. Andrews
G. Cowan RHUL Physics Statistical Issues for Higgs Search page 1 Statistical Issues for Higgs Search ATLAS Statistics Forum CERN, 16 April, 2007 Glen Cowan.
G. Cowan CERN Academic Training 2010 / Statistics for the LHC / Lecture 21 Statistics for the LHC Lecture 2: Discovery Academic Training Lectures CERN,
G. Cowan SLAC Statistics Meeting / 4-6 June 2012 / Two Developments 1 Two developments in discovery tests: use of weighted Monte Carlo events and an improved.
Practical Statistics for Particle Physicists Lecture 3 Harrison B. Prosper Florida State University European School of High-Energy Physics Parádfürdő,
Discussion on significance
Status of the Higgs to tau tau
Computing and Statistical Data Analysis / Stat 11
Statistics for HEP Lecture 3: More on Discovery and Limits
Grégory Schott Institute for Experimental Nuclear Physics
Statistical Methods for the LHC
Computing and Statistical Data Analysis / Stat 10
Presentation transcript:

PHYSTAT 2011 Summary G. Cowan 1 PHYSTAT 2011 Workshop Summary CERN, January, 2011 Glen Cowan Physics Department Royal Holloway, University of London

PHYSTAT 2011 Summary G. Cowan 2 Outline Frequentist methods Bayesian methods In practice: Tevatron, CMS, ATLAS, LHCb Combining results Look-elsewhere effect Software tools Applications: partons, gravity, astro Banff Challenge 2a Outlook

PHYSTAT 2011 Summary G. Cowan 3 Frequentist methods Order statistics for discovery (D. Cox) Limits, etc. (L. Demortier, D. van Dyk) Likelihood ratio tests (GDC, C. Roever, J. Conway) More on p-values (F. Beaujean)

PHYSTAT 2011 Summary 4 G. Cowan Order statistics for discovery Can HEP use this to look for a bump in a histogram? Need to use modified version where signal smeared over several bins. D. Cox

PHYSTAT 2011 Summary 5 G. Cowan Limits, etc. L. Demortier PCL, CLs, “unconstrained” limits for measurement of X ~ Gaussian(μ)

PHYSTAT 2011 Summary 6 G. Cowan Limits, etc. (2) D. van Dyk Proposed procedure: But with PCL it should be easy to communicate where the limit (bound) is the observed one, and where it is the power threshold.

PHYSTAT 2011 Summary 7 G. Cowan Improving the model: template morphing J. Conway Often shift in nuisance parameters causes complicated change in a distribution – parametrize with e.g. template morphing: Need to do this e.g. to properly include systematics due to jet energy scale, which affects different distributions in different ways.

PHYSTAT 2011 Summary 8 G. Cowan Marginalize vs. maximize J. Conway, C. Roever The point was raised as to whether it is better in some sense to construct a ratio of marginalized or profile likelihoods. Conway, Roever see little difference:

PHYSTAT 2011 Summary 9 G. Cowan

PHYSTAT 2011 Summary 10 G. Cowan

PHYSTAT 2011 Summary 11 G. Cowan One more comment on Asimov... GDC n ~ Poisson(μs+b), median significance, assuming μ = 1, with which one would reject μ = 0.

PHYSTAT 2011 Summary 12 G. Cowan Bayesian methods Reference priors (L. Demortier, J. Bernardo, M. Pierini) Bayes factors (J. Berger) Application to sparse spectra (A. Caldwell)

PHYSTAT 2011 Summary 13 G. Cowan Bayes factors J. Berger E.g. apply to Poisson counting problem: N ~ Poisson(s+b) (Not easy for HEP applications)(1) (2) → →

PHYSTAT 2011 Summary 14 G. Cowan Bayes factors (2) (3)Lower bound on Bayes factor: make π(s) a delta function at J. Berger (4?) Can we not simply plot the Bayes factor vs. s (i.e. report B 0s, not B 01 )? So even lowest Bayes factors substantially bigger than the p-values.

PHYSTAT 2011 Summary 15 G. Cowan Bayes factors (3) “Bayes factor more intuitive than a p-value.” (“5σ” ↔ B 01 = ?) “Automatic” inclusion of look-elsewhere effect (through prior). Can we use e.g. But, marginal likelihoods can be difficult to compute: (K. Cranmer/R. Trotta)

PHYSTAT 2011 Summary 16 G. Cowan K. Cranmer/R. Trotta

PHYSTAT 2011 Summary 17 G. Cowan Reference priors J. Bernardo, L. Demortier, M. Pierini Maximize the expected Kullback–Leibler divergence of posterior relative to prior: This maximizes the expected posterior information about θ when the prior density is π(θ). Finding reference priors “easy” for one parameter:

PHYSTAT 2011 Summary 18 G. Cowan Reference priors (2) J. Bernardo, L. Demortier, M. Pierini Actual recipe to find reference prior nontrivial; see references from Bernardo’s talk, website of Berger ( and also Demortier, Jain, Prosper, PRD 82:33, arXiv: : Prior depends on order of parameters. (Is order dependence important? Symmetrize? Sample result from different orderings?)

PHYSTAT 2011 Summary 19 G. Cowan L. Demortier

PHYSTAT 2011 Summary 20 G. Cowan Bayesian discovery with sparse data A. Caldwell Meaningful elicitation of prior from community consensus, here: Consensus priors doable in practice? (Committee?)

PHYSTAT 2011 Summary 21 G. Cowan More on p-values (model validation) F. Beaujean Suppose we’re only given p-values – how should this influence a Bayesian’s degree of belief? OK but... not same as P(M 0 |data), rather a justification of p-value. Or, justify by saying that a bizarre result prompts one to comment on (and quantify) just how bizarre it is.

PHYSTAT 2011 Summary 22 G. Cowan Decision theory J. Bernardo, D. van Dyk Statistical methods can be formulated in elegant but (for particle physicists) not fully familiar language of decision theory: Specify loss function, minimize expected loss, etc. How does this map onto the usual ways that physicists view discovery/limits? What are specific benefits for HEP of this approach? “It’s a tool to be aware of.” – D. van Dyk

PHYSTAT 2011 Summary 23 G. Cowan Combining results N. Krasnikov, K. Cranmer Given p-values p 1,..., p N of H, what is combined p? Rather, given the results of N (usually independent) experiments, what inferences can one draw from their combination? Full combination is difficult but worth the effort for e.g. combined ATLAS/CMS Higgs search. Form full likelihood function of the joint experiment; Use to construct statistical tests (e.g. likelihood ratio) Single common parameter of interest: μ = σ/σ SM Also (in principle) common nuisance parameters (e.g., luminosity, parton uncerainties,...) New software: RooStats (RooFit/ROOT).

PHYSTAT 2011 Summary 24 G. Cowan Combined ATLAS/CMS Higgs search K. Cranmer Given p-values p 1,..., p N of H, what is combined p? Better, given the results of N (usually independent) experiments, what inferences can one draw from their combination? Full combination is difficult but worth the effort for e.g. Higgs search.

PHYSTAT 2011 Summary 25 G. Cowan Look-elsewhere effect O. Vitells, G. Ranucci, A. Caldwell

PHYSTAT 2011 Summary 26 G. Cowan Look-elsewhere effect (2) O. Vitells Correction to p-value from theory of random fields; related to mean number of “upcrossings” of likelihood ratio (Davies, 1987). estimate with MC at low reference level

PHYSTAT 2011 Summary 27 G. Cowan Look-elsewhere effect (3) O. Vitells Generalization to multiple dimensions: number of upcrossings replaced by expectation of Euler characteristic: Applications: astrophysics (coordinates on sky), search for resonance of unknown mass and width,...

PHYSTAT 2011 Summary 28 G. Cowan Look-elsewhere effect in time series G. Ranucci

PHYSTAT 2011 Summary 29 G. Cowan Bayesian Analysis Toolkit (BAT) S. Pashapour General framework specifically for Bayesian computation, especially MCMC for marginalizing posterior probabilities. E.g. convergence diagnostics à la Gelman & Rubin, Wish list(?): computation of marginal likelihoods; support for important types of priors (reference,...)

PHYSTAT 2011 Summary 30 G. Cowan RooStats G. Schott

PHYSTAT 2011 Summary 31 G. Cowan RooFit Workspaces G. Schott Able to construct full likelihood for combination of channels (or experiments).

PHYSTAT 2011 Summary 32 G. Cowan RooStats G. Schott Example of tools: Bayesian priors from formal rules:

PHYSTAT 2011 Summary 33 G. Cowan Examples of using RooStats V. Zhukov, K. Cranmer E.g. studying effect of correlated systematics in combined fit using various methods with RooStats (V. Zhukov): And of course the previously mentioned ATLAS/CMS Higgs combination (K. Cranmer).

PHYSTAT 2011 Summary 34 G. Cowan Lessons from the Tevatron H. Prosper Example: search for single top with multivariate analysis. Need accurate understanding of background (mature experiments). Would community accept MVA as readily if this were SUSY?

PHYSTAT 2011 Summary 35 G. Cowan Bayesian analysis for cross section of single top H. Prosper

PHYSTAT 2011 Summary 36 G. Cowan Converted to frequentist p-value for discovery of single top H. Prosper

PHYSTAT 2011 Summary 37 G. Cowan In practice: CMS, ATLAS, LHCb A. Harel, D. Casadei, J. Morata Publications from LHC have started to arrive!!!!! A. Harel (CMS) D. Casadei (ATLAS)

PHYSTAT 2011 Summary 38 G. Cowan Example: CMS W’ search A. Harel

PHYSTAT 2011 Summary 39 G. Cowan Example: ATLAS dijet resonance search D. Casadei

PHYSTAT 2011 Summary 40 G. Cowan Example: LHCb search for B s → μμ J. Morata Multivariate methods used: MLP, “Δχ 2 ”, BDT, etc.

PHYSTAT 2011 Summary 41 G. Cowan Applications: parton densities S. Forte Fits to parton densities(e.g., MSTW, CTEQ) have found “reasonable” χ 2, but the standard procedure for errors, Δχ 2 = 1, leads to unrealistically small variation of parton densities. Could be consequence of: inconsistent data; inadequate model, e.g., parametric functions in fit (MSTW): NNPDF approach: parameterize using neural network; much larger number (37×7 = 259) of parameters compared to MSTW (20) or CTEQ (26); regularize using cross validation.

PHYSTAT 2011 Summary 42 G. Cowan Partons (2) S. Forte NNPDF predictions quote uncertainties using “standard” Bayesian propagation of experimental uncertainties as obtained by CTEQ and MSTW when using very large Δχ 2 (50 – 100). I.e. limited parametric form was important source of the “Δχ 2 =?” problem? Still need to address other systematics (e.g. theory erorrs), data compatibility, etc.

PHYSTAT 2011 Summary 43 G. Cowan Applications: gravitational waves C. Roever, S. Sardy Wavelet transforms to search for blips in time series: Regularization to suppress noisy wavelet coefficients, → bias-variance trade-off; mathematics similar to unfolding.

PHYSTAT 2011 Summary 44 G. Cowan Applications: astrophysics O. Lahav Wide variety of problems – mostly Bayesian approach. Parameters can be: fundamental (e.g., curvature of universe); “geographical” (eccentricity of an orbit). Wide use of Bayes factors for model selection. HEP should look at astro community’s tools for computing these (e.g. MultiNest).

PHYSTAT 2011 Summary 45 G. Cowan Bayesian exoplanet search O. Lahav ← priors

PHYSTAT 2011 Summary 46 G. Cowan The Banff Challenge 2a T. Junk Two problems: The winners:

PHYSTAT 2011 Summary 47 G. Cowan Some thoughts on 5σ (van Dyk) D. van Dyk

PHYSTAT 2011 Summary 48 G. Cowan Some more thoughts D. van Dyk Need to develop more ways to insert nuisance parameters into models in clever, physically motivated, controlled ways. And figure out how to constrain these parameters with control measurements; assign meaningful priors to them.

PHYSTAT 2011 Summary 49 G. Cowan Outlook Great progress in methods/software/sophistication since the first “Confidence Limits” workshop at CERN in And many areas where progress still being made: Spurious exclusion when no sensitivity (CLs, PCL, other?) Bayesian priors (reference and otherwise) Look-elsewhere effect (solved?) Software tools (RooStats, BAT) Ways to improve models... The LHC data floodgates are open – we need to continue to improve and develop our analysis methods, taking full advantage of the lessons from the statistics community and other fields.

PHYSTAT 2011 Summary 50 G. Cowan Thanks Thanks to all the speakers for contributing highly interesting talks. Thanks to the non-particle physicists and especially the professional statisticians for sharing their insights and expertise. Thanks to the organisers, Louis, Albert, Michelangelo, for arranging such a stimulating and productive meeting.