Setting Limits in the Presence of Nuisance Parameters Wolfgang A Rolke Angel M López Jan Conrad, CERN.

Slides:



Advertisements
Similar presentations
1 Statistical Methods in HEP, in particular... Nuisance parameters and systematic uncertainties Glen Cowan Royal Holloway, University of London
Advertisements

Jan Conrad (KTH, Sthlm) EPNT September Upper Limits and Discovery in Search for Exotic Physics Jan Conrad Royal Institute of Technology (KTH)
Probability and Statistics Basic concepts II (from a physicist point of view) Benoit CLEMENT – Université J. Fourier / LPSC
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #21.
1 LIMITS Why limits? Methods for upper limits Desirable properties Dealing with systematics Feldman-Cousins Recommendations.
Chap 8: Estimation of parameters & Fitting of Probability Distributions Section 6.1: INTRODUCTION Unknown parameter(s) values must be estimated before.
Statistics In HEP 2 Helge VossHadron Collider Physics Summer School June 8-17, 2011― Statistics in HEP 1 How do we understand/interpret our measurements.
Statistics for Particle Physics: Intervals Roger Barlow Karlsruhe: 12 October 2009.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 10 th Edition.
1 Nuisance parameters and systematic uncertainties Glen Cowan Royal Holloway, University of London IoP Half.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Introduction to Statistics: Chapter 8 Estimation.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
G. Cowan 2011 CERN Summer Student Lectures on Statistics / Lecture 41 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability.
G. Cowan Lectures on Statistical Data Analysis Lecture 13 page 1 Statistical Data Analysis: Lecture 13 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan RHUL Physics Bayesian Higgs combination page 1 Bayesian Higgs combination using shapes ATLAS Statistics Meeting CERN, 19 December, 2007 Glen Cowan.
Maximum likelihood (ML)
Business Statistics, A First Course (4e) © 2006 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics, A First Course.
Statistical Analysis of Systematic Errors and Small Signals Reinhard Schwienhorst University of Minnesota 10/26/99.
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 7-1 Chapter 7 Confidence Interval Estimation Statistics for Managers.
Statistical aspects of Higgs analyses W. Verkerke (NIKHEF)
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
Confidence Interval Estimation
Jan Conrad NuFACT August Some comments on ”What (should) sensitivity estimates mean ?” Jan Conrad Royal Institute of Technology (KTH) Stockholm.
880.P20 Winter 2006 Richard Kass 1 Confidence Intervals and Upper Limits Confidence intervals (CI) are related to confidence limits (CL). To calculate.
Chap 8-1 Copyright ©2013 Pearson Education, Inc. publishing as Prentice Hall Chapter 8 Confidence Interval Estimation Business Statistics: A First Course.
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
G. Cowan 2009 CERN Summer Student Lectures on Statistics1 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability densities,
G. Cowan Lectures on Statistical Data Analysis Lecture 3 page 1 Lecture 3 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Page 1 Charles Plager LJ+MET, March 23, 2009 Charles Plager UCLA LJ+MET Meeting March 23, 2008 “Throwing PEs” and More.
A statistical test for point source searches - Aart Heijboer - AWG - Cern june 2002 A statistical test for point source searches Aart Heijboer contents:
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
Basic Business Statistics, 10e © 2006 Prentice-Hall, Inc. Chap 8-1 Confidence Interval Estimation.
Chap 7-1 A Course In Business Statistics, 4th © 2006 Prentice-Hall, Inc. A Course In Business Statistics 4 th Edition Chapter 7 Estimating Population Values.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #25.
Experience from Searches at the Tevatron Harrison B. Prosper Florida State University 18 January, 2011 PHYSTAT 2011 CERN.
Statistics for Managers Using Microsoft Excel, 5e © 2008 Pearson Prentice-Hall, Inc.Chap 8-1 Statistics for Managers Using Microsoft® Excel 5th Edition.
Unfolding in ALICE Jan Fiete Grosse-Oetringhaus, CERN for the ALICE collaboration PHYSTAT 2011 CERN, January 2011.
Business Statistics: A Decision-Making Approach, 6e © 2005 Prentice-Hall, Inc. Chap 7-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.
1 Introduction to Statistics − Day 3 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
G. Cowan Lectures on Statistical Data Analysis Lecture 4 page 1 Lecture 4 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
Paging Area Optimization Based on Interval Estimation in Wireless Personal Communication Networks By Z. Lei, C. U. Saraydar and N. B. Mandayam.
2005 Unbinned Point Source Analysis Update Jim Braun IceCube Fall 2006 Collaboration Meeting.
G. Cowan Lectures on Statistical Data Analysis Lecture 9 page 1 Statistical Data Analysis: Lecture 9 1Probability, Bayes’ theorem 2Random variables and.
In Bayesian theory, a test statistics can be defined by taking the ratio of the Bayes factors for the two hypotheses: The ratio measures the probability.
G. Cowan Lectures on Statistical Data Analysis Lecture 12 page 1 Statistical Data Analysis: Lecture 12 1Probability, Bayes’ theorem 2Random variables and.
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Business Statistics: A First Course 5 th Edition.
Bc → J/   Lifetime measurement Use 6.7 fb -1 and try two independent approaches Selection 1: no dependence of selection on decay time Fit fixes bkgd.
G. Cowan RHUL Physics Statistical Issues for Higgs Search page 1 Statistical Issues for Higgs Search ATLAS Statistics Forum CERN, 16 April, 2007 Glen Cowan.
1 Nuisance parameters and systematic uncertainties Glen Cowan Royal Holloway, University of London IoP Half.
Presentation : “ Maximum Likelihood Estimation” Presented By : Jesu Kiran Spurgen Date :
G. Cowan SLAC Statistics Meeting / 4-6 June 2012 / Two Developments 1 Two developments in discovery tests: use of weighted Monte Carlo events and an improved.
Confidence Limits and Intervals 3: Various other topics Roger Barlow SLUO Lectures on Statistics August 2006.
Status of the Higgs to tau tau
arXiv:physics/ v3 [physics.data-an]
Confidence Interval Estimation
Lecture 4 1 Probability (90 min.)
Incorporating systematic uncertainties into upper limits
Confidence Interval Estimation
Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
Introduction to Statistics − Day 4
Lecture 4 1 Probability Definition, Bayes’ theorem, probability densities and their properties, catalogue of pdfs, Monte Carlo 2 Statistical tests general.
Statistics for HEP Roger Barlow Manchester University
Presentation transcript:

Setting Limits in the Presence of Nuisance Parameters Wolfgang A Rolke Angel M López Jan Conrad, CERN

The Problem x events in the signal region y events in data sidebands (or from MC), measured with some uncertainty, statistical and systematic z a measurement of the efficiency, measured with some uncertainty, statistical and systematic → How do we set limits on the signal rate?

Previous Solution : Cousins-Highland Basically, integrate out the nuisance parameter. Problem 1: “hidden” Bayesian method – what should be used as a prior? Problem 2: does it work, i.e. does it have coverage? Recent studies (Conrad and Tegenfeldt) suggest some overcoverage.

New Solution – Profile Likelihood Need some notation: x – number of events in signal region y - number of events in data sidebands τ – relative “size” of background region to signal region, so that y/ τ is estimate of background in signal region

m – number of MC events to test efficiency z – number of MC events that survive the cuts So z/m is an estimate of the efficiency Unknown Parameters: μ – signal rate (what we want to know) b – background rate in signal region e – efficiency

Probability Model: X ~ Pois(eμ+b), Y ~ Pois(τb), Z ~ Binom(m,e) Loglikelihood: l(μ,b,e) = (-2)* (xlog(eμ+b)-log(x!)-(eμ+b) + ylog(τb)-log(y!)-τb + log(m!)-log(z!)-log((m-z)!)+zlog(e)+(m-z)log(1-e)) is a function of all parameters Idea: for each μ find b and e which make the observations most likely – profile likelihood

Illustration of Profile Likelihood Case: x=8 y=15 tau=5.0 e=100% (known) mu fixed at 2 → bhat = 3.33

Sometimes this can be done analytically, sometimes (like here) it has to be done numerically. Result: given the data (x,y,z,τ,m) the profile likelihood is a function of μ alone → no more nuisance parameters

One Problem: x<y/τ Then mle of μ < 0 Example: same as before, but x=2, so x-y/τ = % upper limit is 2.45

Even worse: Same as last, but y=35.0 So we expect 7 events just from background, but we only see 2 Note: even if μ=0 this happens only about 5% of the time.

Two ways to handle this: keep y, z, τ, m fixed, find smallest x for which upper limit is greater than 0 → intuitive meaning of “upper limit” “unbounded likelihood method” use constrained likelihood, i.e. require mle ≥ 0 always → uses physical limits on parameters “bounded likelihood method”

Method can deal with other situations: Background and/or Efficiency are known without error Background is Gaussian instead of Poisson: y ~ N(b,σ b ) Efficiency is Gaussian instead of Binomial: z ~ N(e,σ e ) → Allows incorporation of systematic errors

So, does it work? Confidence Intervals work if they have coverage: Fix μ, b, e, σ b, σ e and α Generate y 1,.., y n ~ N(b, σ b ) Generate z 1,.., z n ~ N(e, σ e ) Generate x 1,.., x n ~ Pois(eμ+ b) Find (1-α)100% CI’s (L i,U i ) for i=1,..,n Find percentage p with L i ≤ μ ≤ U i If p ≥ (1-α)100%, we have correct coverage Repeat for many values of μ, b, e, σ b, σ e and α

Example Background – Gaussian with error 0.5 Efficiency – Gaussian with mean 0.85 and error Signal rate varies from 0 to 10 in steps of 0.1 Background rate varies from 0 to 10 in steps of 2 Nominal coverage rate 90% Orange – unbounded likelihood Blue - bounded likelihood

Features of our Method: Always yields positive upper limit Smooth transition from upper limits to two- sided intervals Now available as part of ROOT: TRolke Limits are consistent as errors on nuisance parameters become small:

TRolke Intervals

Isn’t it a marvelous new method ? See F. James, MINUIT Reference Manual, CERN Library Long Writeup D506, p.5: ”The MINOS error for a given parameter is defined as the change in the value of the parameter that causes the F’ to increase by the amount UP, where F’ is the minimum w.r.t to all other free parameters”. Confidence Interval Profile Likelihood (in Χ 2 approximation) ΔΧ 2 = 2.71 (90%), ΔΧ 2 = 1.07 (70 %) NO ! It is a marvelous old method … nobody knew how marvelous though

Summary Profile Likelihood is a general technique for dealing with nuisance parameters It is familiar to physicists as part of MINUIT For the problem of setting limits for rare decays it yields a method with good coverage and some nice properties It is available as part of ROOT The End