DIJET STATUS Kazim Gumus 30 Aug. 2005. Our signal is spread over many bins, and the background varies widely over the bins. If we were to simply sum up.

Slides:



Advertisements
Similar presentations
Probability Distributions CSLU 2850.Lo1 Spring 2008 Cameron McInally Fordham University May contain work from the Creative Commons.
Advertisements

1 Methods of Experimental Particle Physics Alexei Safonov Lecture #21.
Is It Interesting? Mike Strauss. Outline Statistics and probability Statistics and probability Experimental uncertainties Experimental uncertainties Particle.
CSC321: 2011 Introduction to Neural Networks and Machine Learning Lecture 10: The Bayesian way to fit models Geoffrey Hinton.
Search for Narrow Resonance Decaying to Muon Pairs in 2.3 fb -1 Chris Hays 1, Ashutosh Kotwal 2, Ye Li 3, Oliver Stelzer-Chilton 1 1 Oxford University.
Slepton Discovery in Cascade Decays Jonathan Eckel, Jessie Otradovec, Michael Ramsey-Musolf, WS, Shufang Su WCLHC Meeting UCSB April
Status Analysis pp -> D s D s0 (2317) Overview Reconstruction Some QA plots Figure of merit First approach/strategy.
1 6 th September 2007 C.P. Ward Sensitivity of ZZ→llνν to Anomalous Couplings Pat Ward University of Cambridge Neutral Triple Gauge Couplings Fit Procedure.
Search for B     with SemiExclusive reconstruction C.Cartaro, G. De Nardo, F. Fabozzi, L. Lista Università & INFN - Sezione di Napoli.
1 24 th September 2007 C.P. Ward Sensitivity of ZZ→llνν to Anomalous Couplings Pat Ward University of Cambridge Neutral Triple Gauge Couplings Fit Procedure.
7/12/2015 Top Pairs Meeting 1 A template fit technique to measure the top quark mass in the l+jets channel Ulrich Heintz, Vivek Parihar.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
G. Cowan RHUL Physics Bayesian Higgs combination page 1 Bayesian Higgs combination using shapes ATLAS Statistics Meeting CERN, 19 December, 2007 Glen Cowan.
INFERENTIAL STATISTICS – Samples are only estimates of the population – Sample statistics will be slightly off from the true values of its population’s.
Statistical aspects of Higgs analyses W. Verkerke (NIKHEF)
880.P20 Winter 2006 Richard Kass 1 Confidence Intervals and Upper Limits Confidence intervals (CI) are related to confidence limits (CL). To calculate.
Low-x Wshop, , Prague Jan Hladký, Exotic Anti-Charm Baryon State 1 Evidence for a Narrow Exotic Anti-Charmed Baryon State *) Jan Hladký, Institute.
1 Introduction to Dijet Resonance Search Exercise John Paul Chou, Eva Halkiadakis, Robert Harris, Kalanand Mishra and Jason St. John CMS Data Analysis.
 Candidate events are selected by reconstructing a D, called a tag, in several hadronic modes  Then we reconstruct the semileptonic decay in the system.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #23.
Astrophysics working group - CERN March, 2004 Point source searches, Aart Heijboer 1 Point Source Searches with ANTARES Outline: reconstruction news event.
STA Lecture 181 STA 291 Lecture 18 Exam II Next Tuesday 5-7pm Memorial Hall (Same place) Makeup Exam 7:15pm – 9:15pm Location TBA.
Lab 3b: Distribution of the mean
A taste of statistics Normal error (Gaussian) distribution  most important in statistical analysis of data, describes the distribution of random observations.
A statistical test for point source searches - Aart Heijboer - AWG - Cern june 2002 A statistical test for point source searches Aart Heijboer contents:
1 HEP 2008, Olympia, Greece Ariadni Antonaki Dimitris Fassouliotis Christine Kourkoumelis Konstantinos Nikolopoulos University of Athens Studies for the.
1 Proposal to change limits for exclusion N.Andari, L. Fayard, M.Kado, F.Polci LAL- Orsay Y.Fang, Y.Pan, H.Wang, Sau Lan Wu University of Wisconsin-Madison.
Sensitivity Prospects for Light Charged Higgs at 7 TeV J.L. Lane, P.S. Miyagawa, U.K. Yang (Manchester) M. Klemetti, C.T. Potter (McGill) P. Mal (Arizona)
Example: Bioassay experiment Problem statement –Observations: At each level of dose, 5 animals are tested, and number of death are observed.
1 2 nd Pre-Lab Quiz 3 rd Pre-Lab Quiz 4 th Pre-Lab Quiz.
1 TTU report “Dijet Resonances” Kazim Gumus and Nural Akchurin Texas Tech University Selda Esen and Robert M. Harris Fermilab Jan. 26, 2006.
Radiation Detection and Measurement, JU, First Semester, (Saed Dababneh). 1 Counting Statistics and Error Prediction Poisson Distribution ( p.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #25.
A. Smith UMD, April 20-21, 2007 Milagro collaboration meeting The Hidden Potential of the “Gamma’’ Data. The Milagro Gamma-Hadron separation is bases on.
G. Cowan, RHUL Physics Discussion on significance page 1 Discussion on significance ATLAS Statistics Forum CERN/Phone, 2 December, 2009 Glen Cowan Physics.
Radiation Detection and Measurement, JU, 1st Semester, (Saed Dababneh). 1 Radioactive decay is a random process. Fluctuations. Characterization.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #24.
G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.
Itamar Roth, Ehud Duchovni Group meeting 19/01/15 1.
Study of pair-produced doubly charged Higgs bosons with a four muon final state at the CMS detector (CMS NOTE 2006/081, Authors : T.Rommerskirchen and.
A bin-free Extended Maximum Likelihood Fit + Feldman-Cousins error analysis Peter Litchfield  A bin free Extended Maximum Likelihood method of fitting.
ES 07 These slides can be found at optimized for Windows)
Mitchell Naisbit University of Manchester A study of the decay using the BaBar detector Mitchell Naisbit – Elba.
2005 Unbinned Point Source Analysis Update Jim Braun IceCube Fall 2006 Collaboration Meeting.
Sampling Distribution (a.k.a. “Distribution of Sample Outcomes”) – Based on the laws of probability – “OUTCOMES” = proportions, means, test statistics.
06/2006I.Larin PrimEx Collaboration meeting  0 analysis.
1 CMS Sensitivity to Quark Contact Interactions with Dijets Selda Esen (Brown) Robert M. Harris (Fermilab) DPF Meeting Nov 1, 2006.
1 Studies on Exclusion Limits N.Andari, L. Fayard, M.Kado, F.Polci LAL- Orsay LAL meeting 23/07/2009.
CHAPTER 4 ESTIMATES OF MEAN AND ERRORS. 4.1 METHOD OF LEAST SQUARES I n Chapter 2 we defined the mean  of the parent distribution and noted that the.
I'm concerned that the OS requirement for the signal is inefficient as the charge of the TeV scale leptons can be easily mis-assigned. As a result we do.
Investigation on CDF Top Physics Group Ye Li Graduate Student UW - Madison.
1 Dijet Resonances Kazım Z. Gümüş and Nural Akchurin Texas Tech University Selda Esen and Robert M. Harris Fermilab PRS JETMET Meeting July 6, 2005.
Sensitivity Study on Top Quark Pair Resonance Search
Discussion on significance
Status of the Higgs to tau tau
Statistical Significance & Its Systematic Uncertainties
Confidence Intervals and Limits
Two Interpretations of What it Means to Normalize the Low Energy Monte Carlo Events to the Low Energy Data Atms MC Atms MC Data Data Signal Signal Apply.
Measurement of inclusive branching fraction for y(3686)fX
Discrete Event Simulation - 4
Using Single Photons for WIMP Searches at the ILC
Parametric Methods Berlin Chen, 2005 References:
Search for Narrow Resonance Decaying to Muon Pairs in 2.3 fb-1
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
Dilepton Mass. Progress report.
Why do Wouter (and ATLAS) put asymmetric errors on data points ?
CHAPTER – 1.2 UNCERTAINTIES IN MEASUREMENTS.
Presentation transcript:

DIJET STATUS Kazim Gumus 30 Aug. 2005

Our signal is spread over many bins, and the background varies widely over the bins. If we were to simply sum up all the bins in our QCD background and compare with all the bins in our Z' signal, then the lowest mass bin where the QCD background is huge would dominate, and we would get a ridiculously wrong result. That is why, we use the binned maximum likelihood method, which takes into account correctly the relative probabilities in each of the bins. Signal and QCD Crossection

Binned maximum likelihood method N i : Observed events (Assume that it comes from a mass resonance sinal sitting on a large QCD background) Nsignal i : Predicted signal events in the i'th mass bin Nqcd i : Predicted qcd background events in the i'th mass bin Let the signal to be multiplied by an unknown parameter alpha and to be add in the background to obtain the predicted number of events in the i'th mass bin Mu i =alpha * Nsignal i + Nqcd i The prob, of Pi observing n i events when Mu i are predicted is given by Poisson statistics P i = Mu i * exp( - Mu i ) / factorial( n i ) The product of Pi over all bins in the mass spectrum is the likelihood function L for seeing the observation given the prediction L = P 1 * P 2 *.....P n Minimize the -ln (L) to find the best fit value of the signal normalization parameter alpha.

Fit to data and Residual Plot This plot shows the goodness of the fit

Two approaches to calculate 95 C.L and 5 Sigma Cross sections 1. Data=qcd (no signal) or Data=qcdfit (no signal) 95 C.L. exclusion 5 sigma exclusion 2. Data=qcd+signal or Data=qcdfit+signal 5 sigma discovery

Approach I Likelihoods for 95 C.L and 5 Sigma Exclusion

700 GeV Likelihood (no signal) Data=qcdfitData=qcd We are constructing the likelihood for our smoothed data sample resulting from a QCD background and a resonance signal as a function of the resonance signal cross section.

Interpretation of Likelihood Distributions If the peak position of the likelihood plot is zero, then we have a perfect fit to the data. If the peak position of likelihood plot for a given mass is different than zero, it means that the fit is not "perfect" at the related mass. The fit may still be within expected values, since data fluctuates up and down with statistics. We shouldn't expect the fit to be perfect with real data. Because we should expect some bins to fluctuate high, and others to fluctuate low, with a Gaussian distribution of fluctuations with width determined by the number of events in the data.

2TeV Likelihood (no signal) Data=qcdData=qcdfit

5TeV Likelihood (no signal) Data=qcd Data=qcdfit

Why We Should Use Perfect Fit Likelihoods It is conceptually more correct, because we are finding what our sensitivity is "on average" by getting rid of bin-to-bin statistical fluctuations. It is like repeating the default likelihood for a thousand different experiments and taking the average of them all: That is what we want to know, how sensitive we are on average, not how sensitive we are for this particular data sample with its own statistics.

Method for Calculating 95 CL Exclusion To find the 95% C.L cross section for a dijet resonance, find the point on the likelihood distribution, alpha95, such that the integral of the likelihood from alpha95 to infinity is Integrate the likelihood out to some suitably large alpha value, alpha_end, which is large enough for our purposes to be considered infinite: for example one for which the likelihood has dropped to 10^-9 of its peak value. Use that integral to normalize your likelihood to unit area. Loop over increasing alpha, integrate the normalized likelihood from alpha to alpha_end. Find the first integral that is less than 0.05(closest alpha value). Multiply alpha95 with the total cross section of signal.

Data defaultqcd=qcdfit 700GeV TeV TeV C.L Sigma Exclusion Results If the data fluctuates beneath the fit, then alpha95(data) < alpha95 (qcd=qcdfit) If the data fluctuates above the fit, then alpha95(data) > alpha95 (qcd=qcdfit) alpha_95

Comments on 95% C.L. Exclusion The likelihood works fine for 95% CL exclusions of a signal, because we are estimating the probability that a high signal could have fluctuated down into the data we are observing, giving no observed signal. 95% CL exclusion says that “In 19 out of 20 cases the signal would not produce data like we observe, but that in 1 out of 20 cases it would accidentally fluctuate down to give an observation compatible with our data”.

5 Sigma Exclusion Cross Section To find the 5 sigma exclusion cross section for a dijet resonance, find the point on the likelihood distribution, alpha_5sigma, such that the integral of the likelihood from alpha_5sigma to infinity is 2.86 x 10^-7. The area of a Gaussian probability distribution from +5 sigma to infinity is 2.86 x10^-7. Integrate the likelihood out to some suitably large alpha value, alpha_end, which is large enough for our purposes to be considered infinite: for example one for which the likelihood has dropped to 10^-9 of its peak value. Use that integral to normalize your likelihood to unit area. Loop over increasing alpha, integrate the normalized likelihood from alpha to alpha_end. Find the first integral that is less than 2.86 x 10^-7 (closest alpha value). We are integrating the tail of the likelihood and searching for a probability x 10^-7 = This avoids problems with knowing the integral to an accuracy of 1 part in 10 million.

Data defaultqcd=qcdfit 700GeV TeV TeV Sigma Exclusion Results alpha_sigma5

95% C.L. and 5 Sigma Exclusions

Approach II Likelihoods for 5 Sigma Discovery

What is a 5 sigma discovery? In the simplest scenario, we would be just counting events and a 5 sigma discovery is an observation that is 5 sigma above the background level. n b = number of background events n s = number of signal events Here the sqrt(n b ) is 1 sigma on the expected number of events, and we observe a 5 sigma fluctuation,

5 Sigma Discovery Cross Section Calculation Method Form a new distribution of "signal + background" using this 5 sigma signal. The background should be from the perfect QCD fit. Set data = "signal + background", and find the likelihood distribution for this data. The likelihood distribution will show a Gaussian peak at some value of alpha well separated from zero. Measure the mean and the RMS of this likelihood distribution and the significance of this signal, s = mean/RMS. s should be very close to 5. If s is 5, then you have found the 5 sigma discovery cross section, if s is not 5 then re-estimate the 5 sigma cross section as "new signal cross section = old signal cross section * 5/s and repeat steps 2) - 4).

700 GeV Likelihood (data contains signal) alpha_5sigma= 33.4 mean= rms= mean/rms= alpha_5sigma= 33.4 * 5 / = mean= rms= mean/rms=5.0049

2TeV Likelihood (data contains signal) alpha_5sigma= 7.4 mean= rms= mean/rms= alpha_5sigma= 7.4 * 5 / = mean= rms= mean/rms=

5TeV Likelihood (data contains signal) alpha_5sigma= mean= rms= mean/rms= alpha_5sigma= * 5 / = mean= rms= mean/rms=

Dataalpha _ 5 sigma 700GeV TeV TeV Sigma Discovery Results

95% C.L. and 5 Sigma Discovery

Conclusions When there is a lot of data; the 5 sigma exclusions, from the NO SIGNAL likelihood, are at about the same cross section as the 5 sigma discovery cross section from the SIGNAL likelihood. This demonstrates that we have a fairly good understanding of the likelihood distributions. We can make the cross section limits for new particles.