G. Cowan RHUL Physics page 1 Status of search procedures for ATLAS ATLAS-CMS Joint Statistics Meeting CERN, 15 October, 2009 Glen Cowan Physics Department.

Slides:



Advertisements
Similar presentations
Using the Profile Likelihood in Searches for New Physics / PHYSTAT 2011 G. Cowan 1 Using the Profile Likelihood in Searches for New Physics arXiv:
Advertisements

27 th March CERN Higgs searches: CL s W. J. Murray RAL.
G. Cowan Statistics for HEP / LAL Orsay, 3-5 January 2012 / Lecture 2 1 Statistical Methods for Particle Physics Lecture 2: Tests based on likelihood ratios.
Statistics In HEP 2 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.
G. Cowan RHUL Physics Profile likelihood for systematic uncertainties page 1 Use of profile likelihood to determine systematic uncertainties ATLAS Top.
G. Cowan Statistics for HEP / NIKHEF, December 2011 / Lecture 3 1 Statistical Methods for Particle Physics Lecture 3: Limits for Poisson mean: Bayesian.
G. Cowan RHUL Physics Comment on use of LR for limits page 1 Comment on definition of likelihood ratio for limits ATLAS Statistics Forum CERN, 2 September,
G. Cowan Statistical methods for HEP / Birmingham 9 Nov Recent developments in statistical methods for particle physics Particle Physics Seminar.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan / RHUL Discovery Potential for the SM Higgs at ATLAS / La Thuile, 1-7 March Discovery Potential for the Standard Model Higgs at ATLAS La.
G. Cowan 2011 CERN Summer Student Lectures on Statistics / Lecture 41 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability.
G. Cowan RHUL Physics Statistical Methods for Particle Physics / 2007 CERN-FNAL HCP School page 1 Statistical Methods for Particle Physics (2) CERN-FNAL.
G. Cowan RHUL Physics Higgs combination note status page 1 Status of Higgs Combination Note ATLAS Statistics/Higgs Meeting Phone, 7 April, 2008 Glen Cowan.
G. Cowan Statistics for HEP / NIKHEF, December 2011 / Lecture 2 1 Statistical Methods for Particle Physics Lecture 2: Tests based on likelihood ratios.
G. Cowan Statistical Data Analysis / Stat 4 1 Statistical Data Analysis Stat 4: confidence intervals, limits, discovery London Postgraduate Lectures on.
G. Cowan Shandong seminar / 1 September Some Developments in Statistical Methods for Particle Physics Particle Physics Seminar Shandong University.
G. Cowan Discovery and limits / DESY, 4-7 October 2011 / Lecture 2 1 Statistical Methods for Discovery and Limits Lecture 2: Tests based on likelihood.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
G. Cowan Discovery and limits / DESY, 4-7 October 2011 / Lecture 3 1 Statistical Methods for Discovery and Limits Lecture 3: Limits for Poisson mean: Bayesian.
G. Cowan RHUL Physics Bayesian Higgs combination page 1 Bayesian Higgs combination using shapes ATLAS Statistics Meeting CERN, 19 December, 2007 Glen Cowan.
G. Cowan Statistics for HEP / NIKHEF, December 2011 / Lecture 4 1 Statistical Methods for Particle Physics Lecture 4: More on discovery and limits.
Statistical aspects of Higgs analyses W. Verkerke (NIKHEF)
Discovery Experience: CMS Giovanni Petrucciani (UCSD)
1 Glen Cowan Statistics Forum News Glen Cowan Eilam Gross ATLAS Statistics Forum CERN, 3 December, 2008.
G. Cowan SUSSP65, St Andrews, August 2009 / Statistical Methods 2 page 1 Statistical Methods in Particle Physics Lecture 2: Limits and Discovery.
G. Cowan 2009 CERN Summer Student Lectures on Statistics1 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability densities,
Results of combination Higgs toy combination, within and across experiments, with RooStats Grégory Schott Institute for Experimental Nuclear Physics of.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #23.
G. Cowan CLASHEP 2011 / Topics in Statistical Data Analysis / Lecture 21 Topics in Statistical Data Analysis for HEP Lecture 2: Statistical Tests CERN.
G. Cowan Statistical techniques for systematics page 1 Statistical techniques for incorporating systematic/theory uncertainties Theory/Experiment Interplay.
ROOT and statistics tutorial Exercise: Discover the Higgs, part 2 Attilio Andreazza Università di Milano and INFN Caterina Doglioni Université de Genève.
G. Cowan RHUL Physics Bayesian Higgs combination page 1 Bayesian Higgs combination based on event counts (follow-up from 11 May 07) ATLAS Statistics Forum.
G. Cowan Weizmann Statistics Workshop, 2015 / GDC Lecture 31 Statistical Methods for Particle Physics Lecture 3: asymptotics I; Asimov data set Statistical.
G. Cowan ATLAS Statistics Forum / Minimum Power for PCL 1 Minimum Power for PCL ATLAS Statistics Forum EVO, 10 June, 2011 Glen Cowan* Physics Department.
G. Cowan, RHUL Physics Discussion on significance page 1 Discussion on significance ATLAS Statistics Forum CERN/Phone, 2 December, 2009 Glen Cowan Physics.
G. Cowan RHUL Physics LR test to determine number of parameters page 1 Likelihood ratio test to determine best number of parameters ATLAS Statistics Forum.
G. Cowan RHUL Physics Input from Statistics Forum for Exotics page 1 Input from Statistics Forum for Exotics ATLAS Exotics Meeting CERN/phone, 22 January,
G. Cowan S0S 2010 / Statistical Tests and Limits 1 Statistical Tests and Limits Lecture 1: general formalism IN2P3 School of Statistics Autrans, France.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
Easy Limit Statistics Andreas Hoecker CAT Physics, Mar 25, 2011.
G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.
1 Introduction to Statistics − Day 3 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
Study of pair-produced doubly charged Higgs bosons with a four muon final state at the CMS detector (CMS NOTE 2006/081, Authors : T.Rommerskirchen and.
G. Cowan NExT Workshop, 2015 / GDC Lecture 11 Statistical Methods for Particle Physics Lecture 1: introduction & statistical tests Fifth NExT PhD Workshop:
G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Lecture 4 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
G. Cowan Statistical methods for particle physics / Cambridge Recent developments in statistical methods for particle physics HEP Phenomenology.
G. Cowan Computing and Statistical Data Analysis / Stat 9 1 Computing and Statistical Data Analysis Stat 9: Parameter Estimation, Limits London Postgraduate.
G. Cowan RHUL Physics Analysis Strategies Workshop -- Statistics Forum Report page 1 Report from the Statistics Forum ATLAS Analysis Strategies Workshop.
G. Cowan, RHUL Physics Statistics for early physics page 1 Statistics jump-start for early physics ATLAS Statistics Forum EVO/Phone, 4 May, 2010 Glen Cowan.
G. Cowan Statistical methods for particle physics / Wuppertal Recent developments in statistical methods for particle physics Particle Physics.
G. Cowan RHUL Physics Status of Higgs combination page 1 Status of Higgs Combination ATLAS Higgs Meeting CERN/phone, 7 November, 2008 Glen Cowan, RHUL.
G. Cowan Systematic uncertainties in statistical data analysis page 1 Systematic uncertainties in statistical data analysis for particle physics DESY Seminar.
In Bayesian theory, a test statistics can be defined by taking the ratio of the Bayes factors for the two hypotheses: The ratio measures the probability.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Statistical methods for HEP / Freiburg June 2011 / Lecture 2 1 Statistical Methods for Discovery and Limits in HEP Experiments Day 2: Discovery.
1 Studies on Exclusion Limits N.Andari, L. Fayard, M.Kado, F.Polci LAL- Orsay LAL meeting 23/07/2009.
G. Cowan RHUL Physics Statistical Issues for Higgs Search page 1 Statistical Issues for Higgs Search ATLAS Statistics Forum CERN, 16 April, 2007 Glen Cowan.
G. Cowan CERN Academic Training 2010 / Statistics for the LHC / Lecture 21 Statistics for the LHC Lecture 2: Discovery Academic Training Lectures CERN,
G. Cowan SLAC Statistics Meeting / 4-6 June 2012 / Two Developments 1 Two developments in discovery tests: use of weighted Monte Carlo events and an improved.
G. Cowan CERN Academic Training 2012 / Statistics for HEP / Lecture 21 Statistics for HEP Lecture 2: Discovery and Limits Academic Training Lectures CERN,
Discussion on significance
Statistical Methods used for Higgs Boson Searches
Statistics for the LHC Lecture 3: Setting limits
Grégory Schott Institute for Experimental Nuclear Physics
Comment on Event Quality Variables for Multivariate Analyses
TAE 2018 / Statistics Lecture 3
Statistical Methods for the LHC
Statistical Methods for HEP Lecture 3: Discovery and Limits
Application of CLs Method to ATLAS Higgs Searches
Presentation transcript:

G. Cowan RHUL Physics page 1 Status of search procedures for ATLAS ATLAS-CMS Joint Statistics Meeting CERN, 15 October, 2009 Glen Cowan Physics Department Royal Holloway, University of London

G. Cowan / RHUL page 2 Introduction This talk describes (part of) the view from ATLAS with emphasis on searches using profile likelihood-based techniques; using as an example the combination of Higgs channels described in Expected Performance of the ATLAS Experiment: Detector, Trigger and Physics, arXiv: , CERN-OPEN Other methods are also being explored and the view presented here should not be taken as a final word on methodology. Some issues are: Basic formalism, definition of significance Method used for setting limits Look-elsewhere effect

G. Cowan / RHUL page 3 Search formalism Define a test variable whose distribution is sensitive to whether hypothesis is background-only or signal + background. E.g. count n events in signal region: events found expected signal expected background strength parameter  =  s /  s,nominal

G. Cowan / RHUL page 4 Search formalism with multiple bins (channels) Bin i of a given channel has n i events, expectation value is Expected signal and background are:  is global strength parameter, common to all channels.  = 0 means background only,  = 1 is nominal signal hypothesis. b tot,  s,  b are nuisance parameters

G. Cowan / RHUL page 5 Subsidiary measurements for background One may have a subsidiary measurement to constrain the background based on a control region where one expects no signal. In bin i of control histogram find m i events; expectation value is where the u i can be found from MC and  includes parameters related to the background (mainly rate, sometimes also shape). In some measurements there may be no explicit subsidiary measurement but the sidebands around a signal peak effectively play the same role in constraining the background.

G. Cowan / RHUL page 6 Likelihood function For an individual search channel, n i ~ Poisson(  s i +b i ), m i ~ Poisson(u i ). The likelihood is: Parameter of interest Here  represents all nuisance parameters For multiple independent channels there is a likelihood L i ( ,  i ) for each. The full likelihood function is

G. Cowan / RHUL page 7 Systematics "built in" as long as some point in  -space = "truth". Presence of nuisance parameters leads to broadening of the profile likelihood, reflecting the loss of information, and gives appropriately reduced discovery significance, weaker limits.

G. Cowan / RHUL page 8 Modified test statistic for exclusion limits For e.g. data generated with , -2 ln (  ) can come out large for If , then data more compatible with a higher value of . so do not include this in critical region. For upper limit, test hypothesis that strength parameter is ≥ . Upper limit is smallest value of  where this hypothesis can be rejected at significance level less than 1  CL. Critical region of test is region with less compatibility with the hypothesis than the observed

G. Cowan / RHUL page 9 Test statistic for exclusion limits Therefore for exclusion limits, define the test statistic to be critical region Thus distribution of modified q  corresponds to lower branch only of U-shaped plot above. For low , this distribution falls off more quickly than the asymptotic chi-square form and thus gives conservative limit.

G. Cowan / RHUL page 10 p-value / significance of hypothesized  Test hypothesized  by giving p-value, probability to see data with ≤ compatibility with  compared to data observed: Equivalently use significance, Z, defined as equivalent number of sigmas for a Gaussian fluctuation in one direction:

G. Cowan / RHUL page 11 Distribution of q  So to find the p-value we need f(q  |  ). Method 1: generate toy MC experiments with hypothesis , obtain at distribution of q . OK for e.g. ~10 3 or 10 4 experiments, 95% CL limits. But for discovery usually want 5 , p-value =  , so need to generate ~10 8 toy experiments (for every point in param. space). Method 2: Wilk's theorem says that for large enough sample, f(q  |  ) ~ chi-square(1 dof) This is the approach used in the ATLAS Higgs Combination exercise; not yet validated to 5  level. If/when we are fortunate enough to see a signal, then focus MC resources on that point in parameter space.

G. Cowan / RHUL page 12 Example from validation exercise: ZZ (*) → 4l 5  level (One minus) cumulative distributions. Band gives 68% CL limits. ½2½2 ½2½2 ½2½2 ½2½2 Distributions of q 0 for 2, 10 fb  from MC compared to ½  2

G. Cowan / RHUL page 13 Significance from q  If we take f(q  |  ) ~  2 for 1dof, then the significance is simply: For n ~ Poisson (  s+b) with b known, testing  = 0 gives To quantify sensitivity give e.g. expected Z under s+b hypothesis

G. Cowan / RHUL page 14 Sensitivity Discovery: Generate data under s+b (  = 1) hypothesis; Test hypothesis  = 0 → p-value → Z. Exclusion: Generate data under background-only (  = 0) hypothesis; Test hypothesis . If  = 1 has p-value < 0.05 exclude m H at 95% CL. Estimate median significance (sensitivity) either from MC or by using a single data set with observed numbers set equal to the expectation values ("Asimov" data set).

G. Cowan / RHUL page 15 Example of ATLAS Higgs search Combination of Higgs search channels (ATLAS) Expected Performance of the ATLAS Experiment: Detector, Trigger and Physics, arXiv: , CERN-OPEN Standard Model Higgs channels considered: H →   H → WW ( * )  → e   H → ZZ ( * ) → 4l (l = e,  ) H →     → ll, lh Not all channels included for now; final sensitivity will improve. Used profile likelihood method for systematic uncertainties: nuisance parameers for: background rates, signal & background shapes.

G. Cowan / RHUL page 16 Combined discovery significance Discovery signficance (in colour) vs. L, m H : Approximations used here not always accurate for L < 2 fb  but in most cases conservative.

G. Cowan / RHUL page 17 Combined 95% CL exclusion limits  p-value of m H (in colour) vs. L, m H :

G. Cowan / RHUL page 18 Comment on combination software Current ATLAS Higgs combination shows median significances Obtained using median significances from each channel What we will need is the significance one would have from a single (e.g. real) data sample. Requires full likelihood function, global fit → software. Since summer 2008 ATLAS/CMS decision to focus joint statistics software effort in RooStats (based on RooFit, ROOT). Provides facility to construct global likelihood for combination of channels/experiments Emphasis on retaining modularity for validation by swapping in/out different components.

G. Cowan / RHUL page 19 The "look-elsewhere effect" Look for Higgs at many m H values -- probability of seeing a large fluctuation for some m H increased. Combined significance shown here relates to fixed m H. False discovery prob enhanced by ~ mass region explored /  m For H→  and H→WW, studied by allowing m H to float in fit:  → 

G. Cowan / RHUL page 20 Summary / conclusions Current philosophy (ATLAS/CMS) is to encourage a variety of methods, e.g., for limits: classical (PL ratio), CLs, Bayesian,... If the results agree, it's an important check of robustness. If the results disagree, we learn something (~ Cousins) Discussion on look-elsewhere effect still ongoing Derive trials factor from e.g. MC or other considerations Floating mass search D0, CDF, CMS, ATLAS need to compare like with like. Need discussions on e.g. formalism for discovery, limits, combination, treatment of common systematics,…

G. Cowan / RHUL page 21 Extra slides

G. Cowan / RHUL page 22 Fast Fourier Transform method to find distribution; derives n-event distribution from that of single event with FFT. Hu and Nielson, physics/ Solves "5-sigma problem". Used at LEP -- systematics treated by averaging the likelihoods by sampling new values of nuisance parameters for each simulated experiment (integrated rather than profile likelihood). An alternative (in simple cases equivalent) test variable is Comment on "LEP"-style methods

G. Cowan / RHUL page 23 Setting limits: CL s Alternative method (from Alex Read at LEP); exclude  if where This cures the problematic case where the one excludes parameter point where one has no sensitivity (e.g. large mass scale) because of a downwards fluctuation of the background. But there are perhaps other ways to get around this problem, e.g., only exclude if both observed and expected p-value .

G. Cowan / RHUL page 24

G. Cowan / RHUL page 25 Some issues The profile likelihood method "includes" systematics to the extent that for some point in the model's parameter space, the difference from the "truth" is negligible. Q: What if the model is not good enough? A: Improve the model, i.e., include additional flexibility (nuisance parameters). Increased flexibility → decrease in sensitivity. How to achieve optimal balance in a general way is not obvious. Corresponding exercise in Bayesian approach: Include nuisance parameters in model with prior probabilities -- also not obvious in many important cases, e.g., uncertainties in correlations.