CMS RooStats Higgs Combination Package

Slides:



Advertisements
Similar presentations
Probability and Statistics Basic concepts II (from a physicist point of view) Benoit CLEMENT – Université J. Fourier / LPSC
Advertisements

1 Methods of Experimental Particle Physics Alexei Safonov Lecture #21.
Some answers to the questions/comments at the Heavy/Light presentation of March 17, Outline : 1) Addition of the id quadrant cut (slides 2-3); 2)
8. Hypotheses 8.4 Two more things K. Desch – Statistical methods of data analysis SS10 Inclusion of systematic errors LHR methods needs a prediction (from.
RooStatsCms: a tool for analyses modelling, combination and statistical studies D. Piparo, G. Schott, G. Quast Institut für Experimentelle Kernphysik Universität.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
G. Cowan RHUL Physics Bayesian Higgs combination page 1 Bayesian Higgs combination using shapes ATLAS Statistics Meeting CERN, 19 December, 2007 Glen Cowan.
8. Hypotheses 8.2 The Likelihood ratio at work K. Desch – Statistical methods of data analysis SS10.
Statistical aspects of Higgs analyses W. Verkerke (NIKHEF)
LINEAR REGRESSION Introduction Section 0 Lecture 1 Slide 1 Lecture 5 Slide 1 INTRODUCTION TO Modern Physics PHYX 2710 Fall 2004 Intermediate 3870 Fall.
Discovery Experience: CMS Giovanni Petrucciani (UCSD)
Carolina Environmental Program UNC Chapel Hill The Analysis Engine – A New Tool for Model Evaluation, Sensitivity and Uncertainty Analysis, and more… Alison.
RooFit – Open issues W. Verkerke. Datasets Current class structure Data representation –RooAbsData (abstract base class) –RooDataSet (unbinned [weighted]
A Neural Network MonteCarlo approach to nucleon Form Factors parametrization Paris, ° CLAS12 Europen Workshop In collaboration with: A. Bacchetta.
The WinMine Toolkit Max Chickering. Build Statistical Models From Data Dependency Networks Bayesian Networks Local Distributions –Trees Multinomial /
Unfolding jet multiplicity and leading jet p T spectra in jet production in association with W and Z Bosons Christos Lazaridis University of Wisconsin-Madison.
Page 1 Charles Plager LJ+MET, March 23, 2009 Charles Plager UCLA LJ+MET Meeting March 23, 2008 “Throwing PEs” and More.
Results of combination Higgs toy combination, within and across experiments, with RooStats Grégory Schott Institute for Experimental Nuclear Physics of.
G. Cowan Statistical techniques for systematics page 1 Statistical techniques for incorporating systematic/theory uncertainties Theory/Experiment Interplay.
ROOT and statistics tutorial Exercise: Discover the Higgs, part 2 Attilio Andreazza Università di Milano and INFN Caterina Doglioni Université de Genève.
G. Cowan RHUL Physics page 1 Status of search procedures for ATLAS ATLAS-CMS Joint Statistics Meeting CERN, 15 October, 2009 Glen Cowan Physics Department.
Easy Limit Statistics Andreas Hoecker CAT Physics, Mar 25, 2011.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #24.
G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan, RHUL Physics Statistics for early physics page 1 Statistics jump-start for early physics ATLAS Statistics Forum EVO/Phone, 4 May, 2010 Glen Cowan.
The KOSMOSHOWS What is it ? The statistic inside What it can do ? Future development Demonstration A. Tilquin (CPPM)
Analysis experience at GSIAF Marian Ivanov. HEP data analysis ● Typical HEP data analysis (physic analysis, calibration, alignment) and any statistical.
BlueFin Best Linear Unbiased Estimate Fisher Information aNalysis Andrea Valassi (IT-SDC) based on the work done with Roberto Chierici TOPLHCWG meeting.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Conditional Observables Joe Tuggle BaBar ROOT/RooFit Workshop December 2007.
Systematics in Hfitter. Reminder: profiling nuisance parameters Likelihood ratio is the most powerful discriminant between 2 hypotheses What if the hypotheses.
Charm Mixing and D Dalitz analysis at BESIII SUN Shengsen Institute of High Energy Physics, Beijing (for BESIII Collaboration) 37 th International Conference.
1 Comparing Unbinned likelihood methods IceCube/Antares Common Point source analysis from IC22 & Antares 2007/2008 data sets J. Brunner.
Hands-on exercises *. Getting started – ROOT 5.25/02 setup Start a ROOT 5.25/02 session –On your local laptop installation, or –On lxplus (SLC4) or lx64slc5.
S. Ferrag, G. Steele University of Glasgow. RooStats and MClimit comparison Exercise to use RooStats by an MClimit-formatted person: – Use two programs.
Search for Standard Model Higgs in ZH  l + l  bb channel at DØ Shaohua Fu Fermilab For the DØ Collaboration DPF 2006, Oct. 29 – Nov. 3 Honolulu, Hawaii.
Analysis Tools interface - configuration Wouter Verkerke Wouter Verkerke, NIKHEF 1.
Studies of Systematics for Dark Matter Observations John Carr 1.
Max Baak (CERN) 1 Summary of experiences with HistFactory and RooStats Max Baak (CERN) (on behalf of list of people) RooFit / RooStats meeting January.
23 Jan 2012 Background shape estimates using sidebands Paul Dauncey G. Davies, D. Futyan, J. Hays, M. Jarvis, M. Kenzie, C. Seez, J. Virdee, N. Wardle.
1 A M Cooper-Sarkar University of Oxford ICHEP 2014, Valencia.
Supersymmetry searches in ATLAS and CMS in hadronic final states Sascha Caron (Nikhef and Radboud University Nijmegen)
Using RooFit/RooStat in rare decay searches Serra, Storaci, Tuning Rare Decays WG.
Confidence Limits and Intervals 3: Various other topics Roger Barlow SLUO Lectures on Statistics August 2006.
Hands-on Session RooStats and TMVA Exercises
(Day 3).
Status of the Higgs to tau tau
arXiv:physics/ v3 [physics.data-an]
The expected confident intervals for triple gauge coupling parameter
Statistical Methods used for Higgs Boson Searches
A Common Partial Wave Analysis
Ex1: Event Generation (Binomial Distribution)
Search for b → u transitions in B+ → {Kpp0}DK+
Grégory Schott Institute for Experimental Nuclear Physics
Analyzing Redistribution Matrix with Wavelet
Data Analysis in Particle Physics
p0 life time analysis: general method, updates and preliminary result
° status report analysis details: overview; “where we are”; plans: before finalizing result.. I.Larin 02/13/2009.
Where did we stop? The Bayes decision rule guarantees an optimal classification… … But it requires the knowledge of P(ci|x) (or p(x|ci) and P(ci)) We.
Eviews Tutorial for Labor Economics Lei Lei
Application of CLs Method to ATLAS Higgs Searches
Top mass measurements at the Tevatron and the standard model fits
Greg Heath University of Bristol
Dilepton Mass. Progress report.
A brief update on b-tagging of High P jets
Measurement of the Single Top Production Cross Section at CDF
° status report analysis details: overview; “where we are”; plans: before finalizing result.. I.Larin 02/13/2009.
Presentation transcript:

CMS RooStats Higgs Combination Package Giovanni Petrucciani (UCSD) 17.05.2011

the combination package A package that builds an executable that can be run to compute limits/signif. combine input –M method [options] Input can be a text “datacard” or an arbitrary RooStats Workspace saved in a ROOT file. Internally, it has two main components text2workspace: A python program that reads datacards and creates a RooStats model The part that deals with the statistical methods 17.05.2011

Datacards Started from a simple format for counting experiments: a text table with observed events and expected yields in each channel. Progressively enhanced it: Support for more pdfs for systematic uncertainties (e.g. Gammas, asymm errors, ...) Use of shapes instead of just counting: Binned shapes: plain ROOT histograms Arbitrary shapes (RooAbsPdf) RooDataHist and RooDataSet (or just TTrees) 17.05.2011

Simple datacards example # one channel, we observe 0 events bin 1 observation 0 ------------ # expected events for signal and backgrounds bin 1 1 process ggh4G Bckg process 0 1 rate 4.76 1.47 deltaS lnN 1.20 - 20% unc. on signal deltaB lnN - 1.50 50% unc. on bkg. 17.05.2011

Complex counting experiment bin e_tau mu_tau e_mu observation 517 540 101 ------------------------------------------------------ bin e_tau e_tau e_tau mu_tau mu_tau mu_tau e_mu e_mu e_mu process higgs ZTT QCD higgs ZTT QCD higgs ZTT other process 0 1 2 0 1 2 0 1 2 rate 0.34 190 327 0.57 329 259 0.15 88 14 lumi lnN 1.11 - - 1.11 - - 1.11 - 1.11 tauid lnN 1.23 1.23 - 1.23 1.23 - - - - ZtoLL lnN - 1.04 - - 1.04 - - 1.04 - effic lnN 1.04 1.04 - 1.04 1.04 - 1.04 1.04 1.04 QCDel lnN - - 1.20 - - - - - - QCDmu lnN - - - - - 1.10 - - - other lnN - - - - - - - - 1.1 Multiple channels Treating of individ. backgrounds Correlated effects of systematics Systematics with names, to allow combination of datacards 17.05.2011

From Counting to Shapes For number countings, the Likelihood in each channel is constructed as sum over all the contributing physics processes N(exp) = sum( N(exp, proc), … ) pdf = Poisson(N(obs) | N(exp)) full pdf = PROD(pdf(channel1), pdf(channel2), …) The extension to shapes is trivial pdf = RooAddPdf( N(exp, proc) * shape(proc), … ) full pdf = RooSimultaneous( pdf1, pdf2, … ) 17.05.2011

Shape Uncertainties (1) Vertical Morphing: Using linear or quadratic interpolation between 3 templates. Gaussian contraint on interpolation parameter. Works on any pdf: histo, keys, parametric Gives wrong results if the deformation is large (e.g. for a gaussian if the uncertainty on the mean is comparable with the peak width) 17.05.2011

Shape Uncertainties (2) Simple parametric uncertainties: Shape parameter used as nuisance, with a constraint (Gaussian or Bifurcated Gaussian Relies on the user to select a physics-oriented parametrization in which nuisances are uncorrelated. 17.05.2011

Shape uncertainties (3) Parametric interpolation: Templates are created, e.g. from different MCs Same parametrization is used to fit all templates, and save two additional RooArgSets of parameters for each systematic Shapes are obtained interpolating linearly the parameters between the RooArgSets (still under developed, not yet there) 17.05.2011

Combined Datasets If all channels are counting experiments, make a RooDataSet with one column per channel If all channels have all inputs as TH1s (data and templates), convert into RooDataHist all on the same dummy variable. Histos for channels that have less bins get padded with empty bins. If at least one channel has a RooDataSet dataset, try to make a combined RooDataSet Otherwise try to make a combined RooDataHist 17.05.2011

Statistical tools proper What we do on top of RooStats: Configure RooStats components through command-line args and sensible defaults Extend functionalities of some tools, or provide workarounds to bugs for which the fix is not in yet in CMSSW’s build of RooStats Provide other common infrastructure, e.g. Running toys to get expected limits Saving results to root files 17.05.2011

Zoo of statistical methods Asymptotic limits and significances from profiled likelihood Bayesian limits Various flavours of Hybrid Frequentist-Bayesian limits (“CLs”, ...) and significances Feldman-Cousins bands Computing both observed limits and expected limits with 68%/95% bands by running toys. 17.05.2011

Other tools Other pieces: Tools to combine datacards Tools to run the combination across the GRID using CRAB (the CMS grid wrapper). Plotting tools: basic functionality exists, but something better is under work 17.05.2011

What’s higgs specific? Datacard format: generic “excess of signal on top of some background” problem. Statistical tools even more generic than this. Main limitations: Most things work only when setting a limit in a single parameter of interest Parameter should behave like a cross section (positive definite; zero = no signal) Limitations driven by lack of need for something even more general; can be overcome if needed. 17.05.2011

CMS dependencies text2workspace: combination tool: AsymPow for asymmetric log-normals VerticalInterpPdf for vertical morphing combination tool: hypo test inversion with new HybridCalculator optimized TestStatistics classes some helper functions e.g. to factorize pdfs 17.05.2011

External dependencies text2workspace: python libraries (re, optparse) + PyROOT combination tool: c++ std library (io, stl, auto_ptr, exceptions) boost::program_options used to parse command-line args and configure methods boost::filesystem to do a “rm –r” of demporary dir some system-level stuff (e.g. fork, tempnam) 17.05.2011