Ronan McNulty EWWG 05.04.111 A general methodology for updating PDF sets with LHC data Francesco de Lorenzi*, Ronan McNulty (University College Dublin)

Slides:



Advertisements
Similar presentations
ECE 8443 – Pattern Recognition LECTURE 05: MAXIMUM LIKELIHOOD ESTIMATION Objectives: Discrete Features Maximum Likelihood Resources: D.H.S: Chapter 3 (Part.
Advertisements

Biointelligence Laboratory, Seoul National University
The Roles of Uncertainty and Randomness in Online Advertising Ragavendran Gopalakrishnan Eric Bax Raga Gopalakrishnan 2 nd Year Graduate Student (Computer.
FTP Biostatistics II Model parameter estimations: Confronting models with measurements.
Sampling: Final and Initial Sample Size Determination
Lecture 3 Probability and Measurement Error, Part 2.
Visual Recognition Tutorial
The Simple Linear Regression Model: Specification and Estimation
1 META PDFs as a framework for PDF4LHC combinations Jun Gao, Joey Huston, Pavel Nadolsky (presenter) arXiv: , Parton.
G. Cowan RHUL Physics Profile likelihood for systematic uncertainties page 1 Use of profile likelihood to determine systematic uncertainties ATLAS Top.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem, random variables, pdfs 2Functions.
Analysis of Covariance Goals: 1)Reduce error variance. 2)Remove sources of bias from experiment. 3)Obtain adjusted estimates of population means.
Fall 2006 – Fundamentals of Business Statistics 1 Chapter 6 Introduction to Sampling Distributions.
Sept 2003PHYSTAT41 Uncertainty Analysis (I) Methods.
8 Statistical Intervals for a Single Sample CHAPTER OUTLINE
Lecture 11 Multivariate Regression A Case Study. Other topics: Multicollinearity  Assuming that all the regression assumptions hold how good are our.
Visual Recognition Tutorial
Modular 13 Ch 8.1 to 8.2.
Sept 2003PHYSTAT41 Uncertainty Analysis (I) Methods.
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Analysis of Covariance Goals: 1)Reduce error variance. 2)Remove sources of bias from experiment. 3)Obtain adjusted estimates of population means.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 7 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
Lecture II-2: Probability Review
Modern Navigation Thomas Herring
Preliminary results in collaboration with Pavel Nadolsky Les Houches /6/5.
TOPLHCWG. Introduction The ATLAS+CMS combination of single-top production cross-section measurements in the t channel was performed using the BLUE (Best.
Elec471 Embedded Computer Systems Chapter 4, Probability and Statistics By Prof. Tim Johnson, PE Wentworth Institute of Technology Boston, MA Theory and.
Physics 114: Lecture 15 Probability Tests & Linear Fitting Dale E. Gary NJIT Physics Department.
1 Combination of PDF uncertainties for LHC observables Pavel Nadolsky (SMU) in collaboration with Jun Gao (Argonne), Joey Huston (MSU) Based on discussions.
GEO7600 Inverse Theory 09 Sep 2008 Inverse Theory: Goals are to (1) Solve for parameters from observational data; (2) Know something about the range of.
880.P20 Winter 2006 Richard Kass Propagation of Errors Suppose we measure the branching fraction BR(Higgs  +  - ) using the number of produced Higgs.
Chapter 15 Modeling of Data. Statistics of Data Mean (or average): Variance: Median: a value x j such that half of the data are bigger than it, and half.
W/Z PRODUCTION AND PROPERTIES Anton Kapliy (University of Chicago) on behalf of the ATLAS collaboration PHENO-2012.
Prof. Dr. S. K. Bhattacharjee Department of Statistics University of Rajshahi.
Lecture 12 Statistical Inference (Estimation) Point and Interval estimation By Aziza Munir.
A Neural Network MonteCarlo approach to nucleon Form Factors parametrization Paris, ° CLAS12 Europen Workshop In collaboration with: A. Bacchetta.
NNPDF Partons: Issues NNPDF2.1 Users Guide Reweighting RDB, Valerio Bertone, Francesco Cerutti, Luigi Del Debbio, Stefano Forte, Alberto Guffanti, Jose.
Linear Regression Andy Jacobson July 2006 Statistical Anecdotes: Do hospitals make you sick? Student’s story Etymology of “regression”
ECE 8443 – Pattern Recognition ECE 8423 – Adaptive Signal Processing Objectives: Deterministic vs. Random Maximum A Posteriori Maximum Likelihood Minimum.
CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.
SUPA Advanced Data Analysis Course, Jan 6th – 7th 2009 Advanced Data Analysis for the Physical Sciences Dr Martin Hendry Dept of Physics and Astronomy.
Predictions for high energy neutrino cross-sections from ZEUS-S Global fit analysis S Chekanov et al, Phys Rev D67, (2002) The ZEUS PDFs are sets.
An Introduction to Kalman Filtering by Arthur Pece
July 11, 2006Bayesian Inference and Maximum Entropy Probing the covariance matrix Kenneth M. Hanson T-16, Nuclear Physics; Theoretical Division Los.
Data Modeling Patrice Koehl Department of Biological Sciences National University of Singapore
Chapter 20 Classification and Estimation Classification – Feature selection Good feature have four characteristics: –Discrimination. Features.
More on NLOQCD fits ZEUS Collab Meeting March 2003 Eigenvector PDF sets- ZEUS-S 2002 PDFS accessible on HEPDATA High x valence distributions from ZEUS-Only.
G. Cowan Lectures on Statistical Data Analysis Lecture 8 page 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem 2Random variables and.
Statistical Inference Statistical inference is concerned with the use of sample data to make inferences about unknown population parameters. For example,
Treatment of correlated systematic errors PDF4LHC August 2009 A M Cooper-Sarkar Systematic differences combining ZEUS and H1 data  In a QCD fit  In a.
11 QCD analysis with determination of α S (M Z ) based on HERA inclusive and jet data: HERAPDF1.6 A M Cooper-Sarkar Low-x meeting June 3 rd 2011 What inclusive.
G. Cowan Computing and Statistical Data Analysis / Stat 9 1 Computing and Statistical Data Analysis Stat 9: Parameter Estimation, Limits London Postgraduate.
Statistical Methods. 2 Concepts and Notations Sample unit – the basic landscape unit at which we wish to establish the presence/absence of the species.
CT14 PDF update J. Huston* PDF4LHC meeting April 13, 2015 *for CTEQ-TEA group: S. Dulat, J. Gao, M. Guzzi, T.-J. Hou, J. Pumplin, C. Schmidt, D. Stump,
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
H1 and ZEUS Combined PDF Fit DIS08 A M Cooper Sarkar on behalf of ZEUS and H1 HERA Structure Function Working Group NLO DGLAP PDF fit to the combined HERA.
D Parton Distribution Functions, Part 2. D CT10-NNLO Parton Distribution Functions.
Université d’Ottawa / University of Ottawa 2001 Bio 8100s Applied Multivariate Biostatistics L11.1 Lecture 11: Canonical correlation analysis (CANCOR)
Chapter 7: The Distribution of Sample Means
Moriond 2001Jets at the TeVatron1 QCD: Approaching True Precision or, Latest Jet Results from the TeVatron Experimental Details SubJets and Event Quantities.
1 Proton Structure Functions and HERA QCD Fit HERA+Experiments F 2 Charged Current+xF 3 HERA QCD Fit for the H1 and ZEUS Collaborations Andrew Mehta (Liverpool.
1 A M Cooper-Sarkar University of Oxford ICHEP 2014, Valencia.
PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 1: INTRODUCTION.
News from HERAPDF A M Cooper-Sarkar PDF4LHC CERN March
HESSIAN vs OFFSET method
Low-x physics at the LHC
ATLAS 2.76 TeV inclusive jet measurement and its PDF impact A M Cooper-Sarkar PDF4LHC Durham Sep 26th 2012 In 2011, 0.20 pb-1 of data were taken at √s.
Learning From Observed Data
Ch 3. Linear Models for Regression (2/2) Pattern Recognition and Machine Learning, C. M. Bishop, Previously summarized by Yung-Kyun Noh Updated.
Presentation transcript:

Ronan McNulty EWWG A general methodology for updating PDF sets with LHC data Francesco de Lorenzi*, Ronan McNulty (University College Dublin) * now at Iowa State (ATLAS)

Ronan McNulty EWWG Concept It would be nice to know the impact that your experimental measurement has on the PDFs. Since each PDF set defines a probability density function (either multinomial distribution or sampling), in principle one can update this using your measurement. NB: This technique assumes that the PDFs can be described using standard statistical technique. NB: This is often not true (issue of tolerances). Consequently this methodology is NOT a replacement for Global Fits, but is indicative of the likely improvements your data will bring.

Ronan McNulty EWWG SS Global Fits HERA Tevatron Fixed Target LHC DGLAP Factorisation Theory LO/NLO/ NNLO Heavy Flavour Parametrisation Consistency of data Tensions Tolerances PDF

Ronan McNulty EWWG For any function, F, depending on λ, the best value is: F(λ 1,λ 2,λ 3 …λ n ) and the uncertainty is What do you get for your money? (An end-user perspective) PDF Hessian A set of parameters: (λ 1, λ 2, λ 3 …λ n ) A covariance matrix: These correspond to distances along a set of orthonormal vectors defined at the minimum of the global chisquared.

Ronan McNulty EWWG PDFs are of form (e.g.): So u v =u v (λ 1, λ 2, …λ n ), d v =d v (λ 1, λ 2 …λ n ), g=g(λ 1, λ 2, …λ n ) And δ uv 2 = Σ(du v /dλ i ) 2 σ i 2 δ dv 2 = Σ(dd v /dλ i ) …. etc. Thus PDFs and uncertainties defined through λ i What do you get for your money? (An end-user perspective) PDF Hessian with η(λ i ) ε (λ i )  (λ i ) But also observables e.g. σ Z (λ 1,λ 2 …λ n ) with or differential cross-sections, or ratios of cross-sections.

Ronan McNulty EWWG How does new data affect PDF? The global PDF fit gives us λ δ 1 0, λ δ 2 0 … λ n 0 +- δ n 0 and predicts σ Z th (λ 1 0,λ 2 0,λ 3 0 …λ n 0 ) and now we measure σ Z. +- δ Z. We can fit for new values of λ i by minimising: measurementpredictionconstraint And taking second derivative of χ2 we get a covariance matrix: where δ i <= δ i 0 and in general off-diagonal terms are non-zero. So improved values for λ i with smaller (though correlated) errors

Ronan McNulty EWWG While observables like σ Z (λ 1,λ 2,λ 3 …λ n ) with δ σ = or differential cross-sections, or ratios of cross-sections, are predicted. (Compare to two slides back) PDFs are of form (e.g.): So u v =u v (λ 1, λ 2, …λ n ), d v =d v (λ 1, λ 2 …λ n ), g=(λ 1, λ 2, …λ n ) And δ uv = δ dv =…. etc. Thus PDFs and uncertainties defined through λ i with η(λ i ) ε (λ i )  (λ i ) du v The old function relationship still holds (though the basis is no longer orthonormal)

Ronan McNulty EWWG Constraining the PDFs F can be , u V, d V, g Before the fitAfter the fit Uncertainties obtained adding in quadrature eigenvector deviation from central value V is the new covariance matrix of the fit Note: Before taking any data, it’s possible to estimate the improvement in the PDFs with a given luminosity. The central value though, will depend on what you measure. δi2δi2

Ronan McNulty EWWG truth VS fitted (each eigenvalues) truth fitted Francesco De Lorenzi7/29/2009

Ronan McNulty EWWG Pull = (truth – fitted)/error_fitted Francesco De Lorenzi7/29/2009

Ronan McNulty EWWG Expected improvement of MSTW08 PDFs with 1fb-1 of LHCb data

Ronan McNulty EWWG What do you get for your money? (An end-user perspective) PDF NNPDF N equal probablity replicas: u v i,d v i,g i,s i etc. (i=1,N) The ensemble defines the probability phase space. For any function, F, depending on u,d,s…g, there will be N possible values for F. The ‘best value’ of F from mean: The uncertainty from RMS. Again, F can be a PDF, or an observable.

Ronan McNulty EWWG Simplistic prescription: The data will be compatible with some replicas, and incompatible with others. Evaluate with Remove incompatible replicas. (Prob<1%) The spread of replicas reduces; the uncertainty is smaller; the mean may shift. How does new data affect PDF?

Ronan McNulty EWWG Expected improvement of NNPDF20 PDFs with 1fb-1 of LHCb data

Ronan McNulty EWWG Improvements for various PDF sets with 1fb-1 of LHCb electroweak data

Ronan McNulty EWWG Sophisticated prescription (NNPDF): Take each PDF replica as a Bayesian Prior. Evaluate Chi2 compared to this replica Probability is Each replica is weighted by this probability Take mean and variance of weighted replicas. How does new data affect PDF?

Ronan McNulty EWWG Summary The NNPDF (sophisticated) prescription has been used to show affect of LHCb, ATLAS and CMS data. (See Maria’s talk) Similar simple techniques can also be used to update the Hessian methods (shown here). CAVEAT: You are assuming that the global fits obey standard error propagation. Past experience has shown this is not always the case.