Statistical Methods for Data Analysis upper limits examples from real measurements Luca Lista INFN Napoli.

Slides:



Advertisements
Similar presentations
Statistical Methods for Data Analysis Random number generators Luca Lista INFN Napoli.
Advertisements

Experimental Particle Physics PHYS6011 Joel Goldstein, RAL 1.Introduction & Accelerators 2.Particle Interactions and Detectors (2) 3.Collider Experiments.
Higgs physics theory aspects experimental approaches Monika Jurcovicova Department of Nuclear Physics, Comenius University Bratislava H f ~ m f.
Current limits (95% C.L.): LEP direct searches m H > GeV Global fit to precision EW data (excludes direct search results) m H < 157 GeV Latest Tevatron.
27 th March CERN Higgs searches: CL s W. J. Murray RAL.
The elusive agent of electroweak symmetry breaking - an experimentalist point of view - Ulrich Heintz Brown University.
Recent Results on the Possibility of Observing a Standard Model Higgs Boson Decaying to WW (*) Majid Hashemi University of Antwerp, Belgium.
DPF Victor Pavlunin on behalf of the CLEO Collaboration DPF-2006 Results from four CLEO Y (5S) analyses:  Exclusive B s and B Reconstruction at.
Top Physics at the Tevatron Mike Arov (Louisiana Tech University) for D0 and CDF Collaborations 1.
Higgs Searches at LEP2 E. Kneringer University of Innsbruck / Austria Collaboration LAKE LOUISE WINTER INSTITUTE Electroweak Physics February 1999.
Search for B     with SemiExclusive reconstruction C.Cartaro, G. De Nardo, F. Fabozzi, L. Lista Università & INFN - Sezione di Napoli.
Introduction to Single-Top Single-Top Cross Section Measurements at ATLAS Patrick Ryan (Michigan State University) The measurement.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
1 Viktor Veszprémi (Purdue University, CDF Collaboration) SUSY 2005, Durham Search for the SM Higgs Boson at the CDF Experiment Search for the SM Higgs.
Single-Top Cross Section Measurements at ATLAS Patrick Ryan (Michigan State University) Introduction to Single-Top The measurement.
8. Hypotheses 8.2 The Likelihood ratio at work K. Desch – Statistical methods of data analysis SS10.
Jake Anderson, on behalf of CMS Fermilab Semi-leptonic VW production at CMS.
Statistical aspects of Higgs analyses W. Verkerke (NIKHEF)
Guglielmo De Nardo Napoli University and INFN 7th Meeting on B Physics, Orsay, France, October 4th 2010.
Discovery Experience: CMS Giovanni Petrucciani (UCSD)
Why do Wouter (and ATLAS) put asymmetric errors on data points ? What is involved in the CLs exclusion method and what do the colours/lines mean ? ATLAS.
H → ZZ →  A promising new channel for high Higgs mass Sara Bolognesi – Torino INFN and University Higgs meeting 23 Sept – CMS Week.
Statistical methods in LHC data analysis part II.2 Luca Lista INFN Napoli.
Study of Higgs boson production in bosonic decay channels at the LHC (including off-shell production) Susumu Oda Kyushu University On behalf of the ATLAS.
1 HEP 2008, Olympia, Greece Ariadni Antonaki Dimitris Fassouliotis Christine Kourkoumelis Konstantinos Nikolopoulos University of Athens Studies for the.
Search for the Higgs Boson Rencontres de Physique de Particules Montpellier May 14, 2012 Dirk Zerwas LAL Orsay Standard Model Higgs (low mass) Digression:
1 Proposal to change limits for exclusion N.Andari, L. Fayard, M.Kado, F.Polci LAL- Orsay Y.Fang, Y.Pan, H.Wang, Sau Lan Wu University of Wisconsin-Madison.
Sensitivity Prospects for Light Charged Higgs at 7 TeV J.L. Lane, P.S. Miyagawa, U.K. Yang (Manchester) M. Klemetti, C.T. Potter (McGill) P. Mal (Arizona)
Statistical Methods for Data Analysis Introduction to the course Luca Lista INFN Napoli.
1 TTU report “Dijet Resonances” Kazim Gumus and Nural Akchurin Texas Tech University Selda Esen and Robert M. Harris Fermilab Jan. 26, 2006.
Moriond QCD 2001Enrico Piotto EP division CERN SM Higgs Search at LEP in Channels other than 4 Jets Enrico Piotto EP Division, CERN SM Higgs Search at.
G. Cowan RHUL Physics page 1 Status of search procedures for ATLAS ATLAS-CMS Joint Statistics Meeting CERN, 15 October, 2009 Glen Cowan Physics Department.
Higgs Reach Through VBF with ATLAS Bruce Mellado University of Wisconsin-Madison Recontres de Moriond 2004 QCD and High Energy Hadronic Interactions.
1 EPS2003, Aachen Nikos Varelas ELECTROWEAK & HIGGS PHYSICS AT DØ Nikos Varelas University of Illinois at Chicago for the DØ Collaboration
Experience from Searches at the Tevatron Harrison B. Prosper Florida State University 18 January, 2011 PHYSTAT 2011 CERN.
Search for the Higgs boson in H  ZZ (*) decay modes on ATLAS German D Carrillo Montoya, Lashkar Kashif University of Wisconsin-Madison On behalf of the.
Easy Limit Statistics Andreas Hoecker CAT Physics, Mar 25, 2011.
Moriond EW 2013BSM Higgs Searches at the Tevatron 1 Beyond the SM scalar boson searches in TeVatron Elemér Nagy CPPM on behalf of the CDF and D0 Collaborations.
Projected Exclusion Limits on the SM Higgs Boson Cross Section by Combining Higgs Channels at LHC Tommaso Dorigo (INFN and University of Padova) for the.
Study of pair-produced doubly charged Higgs bosons with a four muon final state at the CMS detector (CMS NOTE 2006/081, Authors : T.Rommerskirchen and.
1 TOP MASS MEASUREMENT WITH ATLAS A.-I. Etienvre, for the ATLAS Collaboration.
G. Cowan RHUL Physics Status of Higgs combination page 1 Status of Higgs Combination ATLAS Higgs Meeting CERN/phone, 7 November, 2008 Glen Cowan, RHUL.
RECENT RESULTS FROM THE TEVATRON AND LHC Suyong Choi Korea University.
Alexei Safonov (Texas A&M University) For the CDF Collaboration.
Susan Burke DØ/University of Arizona DPF 2006 Measurement of the top pair production cross section at DØ using dilepton and lepton + track events Susan.
Kinematics of Top Decays in the Dilepton and the Lepton + Jets channels: Probing the Top Mass University of Athens - Physics Department Section of Nuclear.
Particle Physics II Chris Parkes Top Quark Discovery Decay Higgs Searches Indirect mW and mt Direct LEP & LHC searches 2 nd Handout.
Search for the Standard Model Higgs in  and  lepton final states P. Grannis, ICHEP 2012 for the DØ Collaboration Tevatron, pp √s = 1.96 TeV -
Jessica Levêque Rencontres de Moriond QCD 2006 Page 1 Measurement of Top Quark Properties at the TeVatron Jessica Levêque University of Arizona on behalf.
La Thuile, March, 15 th, 2003 f Makoto Tomoto ( FNAL ) Prospects for Higgs Searches at DØ Makoto Tomoto Fermi National Accelerator Laboratory (For the.
1 Studies on Exclusion Limits N.Andari, L. Fayard, M.Kado, F.Polci LAL- Orsay LAL meeting 23/07/2009.
Stano Tokar, slide 1 Top into Dileptons Stano Tokar Comenius University, Bratislava With a kind permissison of the CDF top group Dec 2004 RTN Workshop.
Guglielmo De Nardo for the BABAR collaboration Napoli University and INFN ICHEP 2010, Paris, 23 July 2010.
Search for Invisible Higgs Decays at the ILC Ayumi Yamamoto, Akimasa Ishikawa, Hitoshi Yamamoto (Tohoku University) Keisuke Fujii (KEK)
Single Top Quark Production at D0, L. Li (UC Riverside) EPS 2007, July Liang Li University of California, Riverside On Behalf of the DØ Collaboration.
Viktor Veszpremi Purdue University, CDF Collaboration Tev4LHC Workshop, Oct , Fermilab ZH->vvbb results from CDF.
Search for Standard Model Higgs in ZH  l + l  bb channel at DØ Shaohua Fu Fermilab For the DØ Collaboration DPF 2006, Oct. 29 – Nov. 3 Honolulu, Hawaii.
Investigation on CDF Top Physics Group Ye Li Graduate Student UW - Madison.
Eric COGNERAS LPC Clermont-Ferrand Prospects for Top pair resonance searches in ATLAS Workshop on Top Physics october 2007, Grenoble.
1 Donatella Lucchesi July 22, 2010 Standard Model High Mass Higgs Searches at CDF Donatella Lucchesi For the CDF Collaboration University and INFN of Padova.
Bounds on light higgs in future electron positron colliders
Status of the Higgs to tau tau
Search for a High Mass SM Higgs Boson at the Tevatron
Jessica Leonard Oct. 23, 2006 Physics 835
B  at B-factories Guglielmo De Nardo Universita’ and INFN Napoli
Searches for Higgs Boson at CMS
ATLAS Standard Model Higgs Results
Top mass measurements at the Tevatron and the standard model fits
Measurement of the Single Top Production Cross Section at CDF
Presentation transcript:

Statistical Methods for Data Analysis upper limits examples from real measurements Luca Lista INFN Napoli

A rare process limit using event counting and combination of multiple channels Search for B at BaBar

Upper limits to B at BaBar Reconstruct one B ± with a complete hadronic decay (e + e ϒ (4S)B + B ) Look for a tau decay on other side with missing energy (neutrinos) –Five decay channels used:, e,, 0, Likelihood function: product of Poissonian likelihoods for the five channels Background is known with finite uncertainties from side-band applying scaling factors (taken from simulation) BABAR Collaboration, Phys.Rev.Lett.95:041804,2005, Search for the Rare Leptonic Decay B - - Luca ListaStatistical methods in LHC data analysis3

Combined likelihood Combine the five channels with likelihood ( n ch = 5 ): Define likelihood ratio estimator, as for combined LEP Higgs search: In case the scan of 2lnQ vs s shows a significant minimum, a non-null measurement of s can be determined More discriminating variables may be incorporated in the likelihood definition Luca ListaStatistical methods in LHC data analysis4

Upper limit evaluation Use toy Monte Carlo to generate a large number of counting experiments Evaluate the C.L. for a signal hypothesis defined as the fraction of C.L. for the s+b and b hypotheses: Modified frequentist approach Luca ListaStatistical methods in LHC data analysis5

Including (Gaussian) uncertainties Nuisance parameters are the background yields b i known with some uncertainty from side-band extrapolation Convolve likelihood with a Gaussian PDF (assuming negligible the tails at negative yield values!) –Note: b i is the estimated background, not the true one! … but C.L. evaluated anyway with a frequentist approach (Toy Monte Carlo)! Analytical integrability leads to huge CPU saving! (L.L., A 517 (2004) 360–363) Luca ListaStatistical methods in LHC data analysis6

Analytical expression Simplified analytic Q derivation: Where p n (, ) are polynomials defined with a recursive relation: … but in many cases its hard to be so lucky! Luca ListaStatistical methods in LHC data analysis7

Branching ratio: Ldt = 82 fb -1 Without background uncertainty With background uncertainty Low statistics scenario withoutwith uncertainty withoutwith uncertainty No evidence for a local minumum RooStats::HypoTestInverter Luca ListaStatistical methods in LHC data analysis8

B was eventually measured … and is now part of the PDG Luca ListaStatistical methods in LHC data analysis9

A Bayesian approach to Higgs search with small background Higgs search at LEP-I (L3)

Higgs search at LEP Production via e e HZ * bbl l Higgs candidate mass measured via missing mass to lepton pair Most of the background rejected via kinematic cuts and isolation requirements for the lepton pair Search mainly dominated by statistics A few background events survived selection (first observed in L3 at LEP-I) Luca ListaStatistical methods in LHC data analysis11

First Higgs candidate (m H 70 GeV) Luca ListaStatistical methods in LHC data analysis12

Extended likelihood approach Assume both signal and background are present, with different PDF for mass distribution: Gaussian peak for signal, flat for background (from Monte Carlo samples): Bayesian approach can be used to extract the upper limit, with uniform prior, π(s) = 1 : Luca ListaStatistical methods in LHC data analysis13

Application to Higgs search at L events standard limit LEP-I Luca ListaStatistical methods in LHC data analysis14

Comparison with frequentist C.L. Toy MC can be generated for different signal and background scenarios frequentist coverage (classical CL) can be computed counting the fraction of toy experiments above/below the Bayesian limit Always conservative! Luca ListaStatistical methods in LHC data analysis15

Higgs search at LEP-II Combined search using CLs

Combined Higgs search at LEP-II Extended likelihood definition: = 0 for b only, 1 for s + b hypotheses Likelihood ratio: Luca ListaStatistical methods in LHC data analysis17

CLs PDF plot Luca ListaStatistical methods in LHC data analysis18

Mass scan plot Green: 68% Yellow: 95% Bkg only (sim.) Signal + bkg (sim.) Observed (data) Luca ListaStatistical methods in LHC data analysis19

By experiment & channel Luca ListaStatistical methods in LHC data analysis20

Background hypothesis C.L. Luca ListaStatistical methods in LHC data analysis21

Background C.L. by experiment Luca ListaStatistical methods in LHC data analysis22

Signal hypothesis C.L. Luca ListaStatistical methods in LHC data analysis23

Higgs search at LHC Combined search using CLs Luca ListaStatistical methods in LHC data analysis24

Higgs search at LHC method Agreed method between ATLAS and CLS Test statistics: Has good asymptotic behavior Nuisance parameters are profiled Uncertainties are modeled with log-normal PDFs CLs protects against unphysical limits in cases of large downward background fluctuations Observed and median expected values of CLs limits presented as 68% and 95% belts Luca ListaStatistical methods in LHC data analysis25

Higgs boson production at LHC Decays are favored into heavy particles (top, Z, W, b, …) Most abundant production via gluon fusion and vector-boson fusion Luca ListaStatistical methods in LHC data analysis26

Golden channel: H ZZ 4l (l=e,μ) Mass resolution is ~2-3 GeV Luca ListaStatistical methods in LHC data analysis27

Jets: H ZZ 2l2q (l=e,μ) Luca ListaStatistical methods in LHC data analysis28

H γγ Large background, good resolution Luca ListaStatistical methods in LHC data analysis29

H WW 2l2ν Cant reconstruct Higgs mass due to neutrinos Signal can be discriminated vs background using angular distribution (Higgs boson has spin zero) –Two leptons tend to be aligned in Higgs event A multivariate analysis maximizes sig/bkg separation Boosted Decision Tree Luca ListaStatistical methods in LHC data analysis30

Low mass sensitive channels ττ bb Luca ListaStatistical methods in LHC data analysis31

Combining limits to σ/σSM Phys. Lett. B 710 (2012) 26-48, arXiv: Excluded range: 127.5–600 GeV al 95% CL (expected: GeV) Luca ListaStatistical methods in LHC data analysis32

Exclusion plot at 95% CL Luca ListaStatistical methods in LHC data analysis33

CL s vs Bayesian and asymptotic Luca ListaStatistical methods in LHC data analysis34

What if we use 99%? Luca ListaStatistical methods in LHC data analysis35 Excluded range: 127.5–600 GeV at 95% CL, 129–525 GeV at 99% CL

Cross section measurement ±1σ = excursion of +1 of likelihood from best fit value Luca ListaStatistical methods in LHC data analysis36

Comparing different channels Best fit to σ/σ SM separately in various canals A modest excess is present consistently in all low- mass sensitive channels Luca ListaStatistical methods in LHC data analysis37

Hint or fluctuation? Probability of a bkg fluctuation than the observed one The global significance of the observed maximum excess (minimum local p-value) for the full combination in this mass range is about 2.1σ, estimated using pseudoexperiments Luca ListaStatistical methods in LHC data analysis38

The global significance of the observed maximum excess (minimum local p-value) for the full combination in this mass range is about 2.1σ, estimated using pseudoexperiments Hint or fluctuation? Note once again: p-value is the probability to have at least the observed fluctuation if we have only background, Not Probability to have only background given the observed fluctuation! Note once again: p-value is the probability to have at least the observed fluctuation if we have only background, Not Probability to have only background given the observed fluctuation! Probability of a bkg fluctuation than the observed one Luca ListaStatistical methods in LHC data analysis39

ATLAS: γγ, 4l arXiv: arXiv: Luca ListaStatistical methods in LHC data analysis40

ATLAS: combined limit Local sifnificance: 2.8σ (γγ), 2.1σ (ZZ*4l), 1.4 σ (WW*lνlν) Global significance (LEE) 2.2σ ( GeV) Excluded ranges: (95% CL):110–115.5 GeV, GeV, 129–539GeV (expected: 120–550 GeV) arXiv: ATLAS-CONF Luca ListaStatistical methods in LHC data analysis41

ATLAS cross section Luca ListaStatistical methods in LHC data analysis42

Latest from Tevatron Excluded ranges: GeV, , expected: GeV, GeV Local significance (120 GeV): 2.7σ, global significance (LEE); 2.2σ arXiv: Luca ListaStatistical methods in LHC data analysis43

Latest SM fit Luca ListaStatistical methods in LHC data analysis44

Perspectives for 2012 LHC energy increased from 7 a 8 TeV. ATLAS and CMS are taking data now (+10% in cross section) In 2012 LHC should deliver about 4 times the 2011 integrated luminosity (~20fb 1 ) Higgs boson discovery or exclusion is very likely by 2012 Luca ListaStatistical methods in LHC data analysis45

In conclusion Many recipes and approaches available Bayesian and Frequentist approaches lead to similar results in the easiest cases, but may diverge in frontier cases Be ready to master both approaches! … and remember that Bayesian and Frequentist limits have very different meanings If you want your paper to be approved soon: –Be consistent with your assumptions –Understand the meaning of what you are computing –Try to adopt a popular and consolidated approach (even better, software tools, like RooStats), wherever possible –Debate your preferred statistical technique in a statistics forum, not a physics result publication! Luca ListaStatistical methods in LHC data analysis46