8th December 2004Tim Adye1 Proposal for a general-purpose unfolding framework in ROOT Tim Adye Rutherford Appleton Laboratory BaBar Statistics Working.

Slides:



Advertisements
Similar presentations
Transverse momentum of Z bosons in Zee and Zmm decays Daniel Beecher 12 December 2005.
Advertisements

1 Methods of Experimental Particle Physics Alexei Safonov Lecture #21.
N.D.GagunashviliUniversity of Akureyri, Iceland Pearson´s χ 2 Test Modifications for Comparison of Unweighted and Weighted Histograms and Two Weighted.
RooUnfold unfolding framework and algorithms
A Bayesian Analysis of Parton Distribution Uncertainties Clare Quarman Atlas UK Physics meeting – UCL 15 th Dec 2003.
JET CHARGE AT LHC Scott Lieberman New Jersey Institute of Technology TUNL REU 2013 Duke High Energy Physics Group Working with: Professor Ayana Arce Dave.
G. Cowan RHUL Physics Profile likelihood for systematic uncertainties page 1 Use of profile likelihood to determine systematic uncertainties ATLAS Top.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem, random variables, pdfs 2Functions.
Slide 1 Statistics for HEP Roger Barlow Manchester University Lecture 3: Estimation.
Statistical Image Modelling and Particle Physics Comments on talk by D.M. Titterington Glen Cowan RHUL Physics PHYSTAT05 Glen Cowan Royal Holloway, University.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
7. Least squares 7.1 Method of least squares K. Desch – Statistical methods of data analysis SS10 Another important method to estimate parameters Connection.
Chi Square Distribution (c2) and Least Squares Fitting
Basics of regression analysis
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Linear and generalised linear models Purpose of linear models Least-squares solution for linear models Analysis of diagnostics Exponential family and generalised.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
G. Cowan RHUL Physics Bayesian Higgs combination page 1 Bayesian Higgs combination using shapes ATLAS Statistics Meeting CERN, 19 December, 2007 Glen Cowan.
Principles of the Global Positioning System Lecture 10 Prof. Thomas Herring Room A;
Objectives of Multiple Regression
Sung-Won Lee 1 Study of Jets Production Association with a Z boson in pp Collision at 7 and 8 TeV with the CMS Detector Kittikul Kovitanggoon Ph. D. Thesis.
Unfolding algorithms and tests using RooUnfold
880.P20 Winter 2006 Richard Kass 1 Confidence Intervals and Upper Limits Confidence intervals (CI) are related to confidence limits (CL). To calculate.
G. Cowan Lectures on Statistical Data Analysis Lecture 3 page 1 Lecture 3 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
Generalized Minimum Bias Models
R. Kass/W03P416/Lecture 7 1 Lecture 7 Some Advanced Topics using Propagation of Errors and Least Squares Fitting Error on the mean (review from Lecture.
Unfolding jet multiplicity and leading jet p T spectra in jet production in association with W and Z Bosons Christos Lazaridis University of Wisconsin-Madison.
Lab 3b: Distribution of the mean
SUPA Advanced Data Analysis Course, Jan 6th – 7th 2009 Advanced Data Analysis for the Physical Sciences Dr Martin Hendry Dept of Physics and Astronomy.
RooUnfold unfolding framework and algorithms Tim Adye Rutherford Appleton Laboratory ATLAS RAL Physics Meeting 20 th May 2008.
RooUnfold unfolding framework and algorithms Tim Adye Rutherford Appleton Laboratory Oxford ATLAS Group Meeting 13 th May 2008.
1 Iterative dynamically stabilized (IDS) method of data unfolding (*) (*arXiv: ) Bogdan MALAESCU CERN PHYSTAT 2011 Workshop on unfolding.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
FSI and Mw(qqqq) 1 FSI and Mw(qqqq) Marie Legendre, Djamel Boumediene, Patrice Perez, Oliver Buchmüller … an alternative approach … PFCUT and PCUT update.
V0 analytical selection Marian Ivanov, Alexander Kalweit.
Unfolding in ALICE Jan Fiete Grosse-Oetringhaus, CERN for the ALICE collaboration PHYSTAT 2011 CERN, January 2011.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.
JET CHARGE AT LHC Scott Lieberman New Jersey Institute of Technology TUNL REU 2013 Duke High Energy Physics Group Working with: Professor Ayana Arce Dave.
Beam Extrapolation Fit Peter Litchfield  An update on the method I described at the September meeting  Objective;  To fit all data, nc and cc combined,
1 Introduction to Statistics − Day 3 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
A bin-free Extended Maximum Likelihood Fit + Feldman-Cousins error analysis Peter Litchfield  A bin free Extended Maximum Likelihood method of fitting.
 Measurement of  x E  (Fig. 4) Assorted correlations between a fixed high-p T trigger hadron (  p Ttrig  =4.7GeV/c) and lower p T associated hadrons.
G. Cowan Computing and Statistical Data Analysis / Stat 9 1 Computing and Statistical Data Analysis Stat 9: Parameter Estimation, Limits London Postgraduate.
Mitchell Naisbit University of Manchester A study of the decay using the BaBar detector Mitchell Naisbit – Elba.
From the population to the sample The sampling distribution FETP India.
Spectrum Reconstruction of Atmospheric Neutrinos with Unfolding Techniques Juande Zornoza UW Madison.
1 Introduction to Statistics − Day 2 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
G. Cowan CERN Academic Training 2012 / Statistics for HEP / Lecture 41 Statistics for HEP Lecture 4: Unfolding Academic Training Lectures CERN, 2–5 April,
October 2011 David Toback, Texas A&M University Research Topics Seminar1 David Toback Texas A&M University For the CDF Collaboration CIPANP, June 2012.
G. Cowan Lectures on Statistical Data Analysis Lecture 9 page 1 Statistical Data Analysis: Lecture 9 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 1 Statistical Data Analysis: Lecture 5 1Probability, Bayes’ theorem 2Random variables and.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Status of QEL Analysis ● QEL-like Event Selection and Sample ● ND Flux Extraction ● Fitting for MINOS Collaboration Meeting FNAL, 7 th -10 th December.
Richard Kass/F02P416 Lecture 6 1 Lecture 6 Chi Square Distribution (  2 ) and Least Squares Fitting Chi Square Distribution (  2 ) (See Taylor Ch 8,
Computacion Inteligente Least-Square Methods for System Identification.
Tau31 Tracking Efficiency at BaBar Ian Nugent UNIVERSITY OF VICTORIA Sept 2005 Outline Introduction  Decays Efficiency Charge Asymmetry Pt Dependence.
Extrapolation Techniques  Four different techniques have been used to extrapolate near detector data to the far detector to predict the neutrino energy.
I'm concerned that the OS requirement for the signal is inefficient as the charge of the TeV scale leptons can be easily mis-assigned. As a result we do.
1 Unfolding Study 07/11/12. 2 RooUnfold Using the RooUnfold In this study we make response.
Statistics (2) Fitting Roger Barlow Manchester University IDPASC school Sesimbra 14 th December 2010.
Data Analysis in Particle Physics
Unfolding Problem: A Machine Learning Approach
Unfolding atmospheric neutrino spectrum with IC9 data (second update)
Chi Square Distribution (c2) and Least Squares Fitting
Computing and Statistical Data Analysis / Stat 7
Statistical Methods for Particle Physics Lecture 4: unfolding
Introduction to Unfolding
Juande Zornoza UW Madison
Unfolding with system identification
Presentation transcript:

8th December 2004Tim Adye1 Proposal for a general-purpose unfolding framework in ROOT Tim Adye Rutherford Appleton Laboratory BaBar Statistics Working Group BaBar Collaboration Meeting 8 th December 2004

Tim Adye2 Outline What is Unfolding? and why might you want to do it? Overview of a few techniques Regularised unfolding Iterative method Idea for a ROOT package … but not much code yet! References

8th December 2004Tim Adye3 Unfolding In other fields known as “deconvolution” or “unsmearing” Given a “true” PDF, μ, that is corrupted by detector effects, described by a response function, R, we measure a distribution ν. In terms of histograms This may involve 1.inefficiencies: lost events 2.smearing: events moving between bins (off-diagonal R ij ) With infinite statistics, it would be possible to recover the original PDF by inverting the response matrix

8th December 2004Tim Adye4 Not so simple… Unfortunately, if there are statistical fluctuations between bins this information is destroyed Since R washes out statistical fluctuations, R -1 cannot distinguish between wildly fluctuating and smooth PDFs Obtain large negative correlations between adjacent bins Large fluctuations in reconstructed bin contents Need some procedure to remove wildly fluctuating solutions 1.Give added weight to “smoother” solutions 2.Solve for µ iteratively, starting with a reasonable guess and truncate iteration before it gets out of hand

8th December 2004Tim Adye5 What happens if you don’t smooth

8th December 2004Tim Adye6

8th December 2004Tim Adye7 So why don’t we always do this? If the true PDF and resolution function can be parameterised, then a ML fit is usually more convenient Directly returns parameters of interest Does not require binning If the response function doesn’t include smearing (ie. it’s diagonal), then apply bin-by-bin efficiency correction directly If result is just needed for comparison (eg. with MC), could apply response function to MC simpler than un-applying response to data

8th December 2004Tim Adye8 When to use unfolding Use unfolding to recover theoretical distribution where there is no a-priori parameterisation this is needed for the result and not just comparison with MC there is significant bin-to-bin migration of events

8th December 2004Tim Adye9 Where could we use unfolding? Traditionally used to extract structure functions Widely used outside PP for image reconstruction Dalitz plots Cross-feed between bins due to misreconstruction “True” decay momentum distributions Theory at parton level, we measure hadrons Correct for hadronisation as well as detector effects Maybe could use smoothing for standard ML fits?

8th December 2004Tim Adye10 1. Regularised Unfolding Use ML to fit smeared bin contents to measured data, but include regularisation function where the regularisation parameter, α, controls the degree of smoothness. various criteria are used to select α eg. minimise mean squared error Various choices of regularisation function, S, are used Tikhonov regularisation: minimise curvature for some definition of curvature, eg. RUN by Volker Blobel GURU by Andreas Höcker and Vakhtang Kartvelishvili using Singular Value Decomposition Maximum entropy:

8th December 2004Tim Adye11 2. Iterative method Uses Bayes’ theorem to invert and using an initial set of probabilities, p i (eg. flat) obtain an improved estimate Repeating with new p i from these new bin contents converges quite rapidly Truncating the iteration prevents us seeing the bad effects of statistical fluctuations Fergus Wilson has implemented this method in ROOT/C++

8th December 2004Tim Adye12 A ROOT framework It would be nice if these different methods could be made available as ROOT/C++ classes a la RooFit Could have a common interface to specify unfolding method and parameters response matrix or it could calculate it from MC samples measured histogram and return reconstructed truth histogram Handle 1D and 2D at least

8th December 2004Tim Adye13 A ROOT framework (2) Should also handle “simple” cases correction factors direct inversion of response matrix allowing to easily see whether full unfolding required Could also be useful outside BaBar!

8th December 2004Tim Adye14 Progress so far I have played around with Fergus’ code Can now be used in interactive ROOT Having a few problems right now extending beyond simple example Need to understand more options before designing an interface Also started to look at Andreas’ GURU package Is this a good idea? Or would I better spend my time with yet another tweak to the bookkeeping system  I am still just learning, so pointers, suggestions, and ideas are most welcome…

8th December 2004Tim Adye15 References G. Cowan, A Survey of Unfolding Methods for Particle Physics, Proc. Advanced Statistical Techniques in Particle Physics, Durham (2002) G. Cowan, Statistical Data Analysis, Oxford University Press (1998) R. Barlow, SLUO Lectures on Numerical Methods in HEP (2000), Lecture 9: Unfolding www-group.slac.stanford.edu/sluo/Lectures/Stat_Lectures.html V. Blobel, Unfolding Methods in High Energy Physics, DESY (1984); also CERN A. Höcker and V. Kartvelishvili, SVD Approach to Data Unfolding, NIM A 372 (1996) 469