K. Kinoshita University of Cincinnati Belle Collaboration Statistical distribution of  - zero free parameters impact of free parameters some speculations.

Slides:



Advertisements
Similar presentations
Fundamentals of Data Analysis Lecture 12 Methods of parametric estimation.
Advertisements

6-1 Introduction To Empirical Models 6-1 Introduction To Empirical Models.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #21.
Chap 8: Estimation of parameters & Fitting of Probability Distributions Section 6.1: INTRODUCTION Unknown parameter(s) values must be estimated before.
Fermilab, June 26 th 2001 Thoughts on the fitting procedure for the  c + lifetime with the        channel Gianluigi Boca.
1/15 Sensitivity to  with B  D(KK  )K Decays CP Working Group Meeting - Thursday, 19 th April 2007 Introduction B  DK  Dalitz Analysis Summary.
A. BondarModel-independent φ 3 measurement August 6, 2007Charm 2007, Cornell University1/15 γ/φ 3 model-independent Dalitz analysis (Dalitz+CP tagged Dalitz.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem, random variables, pdfs 2Functions.
Statistical Inference Chapter 12/13. COMP 5340/6340 Statistical Inference2 Statistical Inference Given a sample of observations from a population, the.
Multivariate Data Analysis Chapter 4 – Multiple Regression.
Simulation Modeling and Analysis Session 12 Comparing Alternative System Designs.
Statistical inference
Statistical inference form observational data Parameter estimation: Method of moments Use the data you have to calculate first and second moment To fit.
G. Cowan Lectures on Statistical Data Analysis 1 Statistical Data Analysis: Lecture 8 1Probability, Bayes’ theorem, random variables, pdfs 2Functions of.
7. Least squares 7.1 Method of least squares K. Desch – Statistical methods of data analysis SS10 Another important method to estimate parameters Connection.
Chi Square Distribution (c2) and Least Squares Fitting
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Rao-Cramer-Frechet (RCF) bound of minimum variance (w/o proof) Variance of an estimator of single parameter is limited as: is called “efficient” when the.
Sampling Distributions & Point Estimation. Questions What is a sampling distribution? What is the standard error? What is the principle of maximum likelihood?
Introduction to Multilevel Modeling Using SPSS
Day 7 Model Evaluation. Elements of Model evaluation l Goodness of fit l Prediction Error l Bias l Outliers and patterns in residuals.
880.P20 Winter 2006 Richard Kass 1 Confidence Intervals and Upper Limits Confidence intervals (CI) are related to confidence limits (CL). To calculate.
Stats for Engineers Lecture 9. Summary From Last Time Confidence Intervals for the mean t-tables Q Student t-distribution.
Basic Business Statistics, 11e © 2009 Prentice-Hall, Inc. Chap 8-1 Chapter 8 Confidence Interval Estimation Basic Business Statistics 11 th Edition.
Confidence Interval Estimation
880.P20 Winter 2006 Richard Kass 1 Maximum Likelihood Method (MLM) Does this procedure make sense? The MLM answers this question and provides a method.
G. Cowan 2009 CERN Summer Student Lectures on Statistics1 Introduction to Statistics − Day 4 Lecture 1 Probability Random variables, probability densities,
G. Cowan Lectures on Statistical Data Analysis Lecture 3 page 1 Lecture 3 1 Probability (90 min.) Definition, Bayes’ theorem, probability densities and.
880.P20 Winter 2006 Richard Kass 1 Hypothesis Testing The goal of hypothesis testing is to set up a procedure(s) to allow us to decide if a model is acceptable.
Measures of Variability In addition to knowing where the center of the distribution is, it is often helpful to know the degree to which individual values.
Christof Roland / MITSQM 2004September 2004 Christof Roland / MIT For the NA49 Collaboration Strange Quark Matter 2004 Capetown, South Africa Event-by-Event.
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #23.
Chapter 7 Point Estimation
Chapter 11 Linear Regression Straight Lines, Least-Squares and More Chapter 11A Can you pick out the straight lines and find the least-square?
Lab 3b: Distribution of the mean
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
SUPA Advanced Data Analysis Course, Jan 6th – 7th 2009 Advanced Data Analysis for the Physical Sciences Dr Martin Hendry Dept of Physics and Astronomy.
Likelihood Methods in Ecology November 16 th – 20 th, 2009 Millbrook, NY Instructors: Charles Canham and María Uriarte Teaching Assistant Liza Comita.
1 Lecture 16: Point Estimation Concepts and Methods Devore, Ch
ELEC 303 – Random Signals Lecture 18 – Classical Statistical Inference, Dr. Farinaz Koushanfar ECE Dept., Rice University Nov 4, 2010.
- 1 - Bayesian inference of binomial problem Estimating a probability from binomial data –Objective is to estimate unknown proportion (or probability of.
Lecture 4: Statistics Review II Date: 9/5/02  Hypothesis tests: power  Estimation: likelihood, moment estimation, least square  Statistical properties.
Lecture 3: Statistics Review I Date: 9/3/02  Distributions  Likelihood  Hypothesis tests.
CLASS: B.Sc.II PAPER-I ELEMENTRY INFERENCE. TESTING OF HYPOTHESIS.
1 Introduction to Statistics − Day 4 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Lecture 2 Brief catalogue of probability.
CHAPTER 9 Inference: Estimation The essential nature of inferential statistics, as verses descriptive statistics is one of knowledge. In descriptive statistics,
1 Methods of Experimental Particle Physics Alexei Safonov Lecture #24.
1 Marek Szczekowski, Artur Ukleja Warsaw Group 3 June 2015 Model-independent search for CPV in three-bodies charm baryon decays Outline Selections for.
1 Introduction to Statistics − Day 3 Glen Cowan Lecture 1 Probability Random variables, probability densities, etc. Brief catalogue of probability densities.
1 Warsaw Group May 2015 Search for CPV in three-bodies charm baryon decays Outline Selections Mass distributions and reconstructed numbers of candidates.
Christof Roland / MITQuark Matter 2004January 2004 Christof Roland / MIT For the NA49 Collaboration Quark Matter 2004 Oakland,CA Event-by-Event Fluctuations.
G. Cowan Lectures on Statistical Data Analysis Lecture 9 page 1 Statistical Data Analysis: Lecture 9 1Probability, Bayes’ theorem 2Random variables and.
Topics Semester I Descriptive statistics Time series Semester II Sampling Statistical Inference: Estimation, Hypothesis testing Relationships, casual models.
Andrzej Bożek for Belle Coll. I NSTITUTE OF N UCLEAR P HYSICS, K RAKOW ICHEP Beijing 2004  3 and sin(2  1 +  3 ) at Belle  3 and sin(2  1 +  3 )
Parameter Estimation. Statistics Probability specified inferred Steam engine pump “prediction” “estimation”
G. Cowan Lectures on Statistical Data Analysis Lecture 10 page 1 Statistical Data Analysis: Lecture 10 1Probability, Bayes’ theorem 2Random variables and.
Richard Kass/F02P416 Lecture 6 1 Lecture 6 Chi Square Distribution (  2 ) and Least Squares Fitting Chi Square Distribution (  2 ) (See Taylor Ch 8,
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Lecture 1 Fadzly, N Normality and data transformation + Non-parametric tests.
11-1 Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Statistics 350 Review. Today Today: Review Simple Linear Regression Simple linear regression model: Y i =  for i=1,2,…,n Distribution of errors.
Fundamentals of Data Analysis Lecture 11 Methods of parametric estimation.
Outline Sampling Measurement Descriptive Statistics:
Parameter Estimation and Fitting to Data
BA 275 Quantitative Business Methods
A New Measurement of |Vus| from KTeV
MEGN 537 – Probabilistic Biomechanics Ch.3 – Quantifying Uncertainty
Basic Training for Statistical Process Control
Basic Training for Statistical Process Control
Classical regression review
Presentation transcript:

K. Kinoshita University of Cincinnati Belle Collaboration Statistical distribution of  - zero free parameters impact of free parameters some speculations evaluating quality of fit in Unbinned Maximum Likelihood Fitting

2 Unbinned Maximum Likelihood (UMxL) fitting: preferred for determining parameter(s)  via parameter- dependent shape f(x;  ) of distribution in measured x maximizes use of x information, esp. w. limited statistics used in many current analyses - CP, lifetime, Dalitz plot, … Goodness-of-fit: to answer - are data statistically consistent with fitted shape (not easily visualized within binless context, esp. in multiple dim.) ? - is f(x;  ) a valid parametrization? Straightforward for least-squares To date, no good test in UMxL - why not? Motivation

3 Unbinned Maximum Likelihood (UMxL) fit Brief outline: have experiment w N measurements of x - Maximize under variations in  (f(x;  )=normalized PDF) Equivalent to maximizing: Max. at Wish to examine fit quality - questions: How are  distributed in ensemble, if root is ? 0 free parameters effect of free parameters at what level can other distributions be ruled out?

4 Distribution with zero free parameters Mean: limit for large N= expected mean for finite N Variance: of over PDF = (“Statistical Methods in Experimental Physics”, Eadie, Drijard, James, Roos, & Sadoulet) Summary: Ensemble expts w N measurements of x 0 free parameters

5 UMxL with free parameter(s) (1)in each experiment, is maximized - (2)J. Heinrich note (CDF/MEMO/BOTTOM/CDFR/5639) : toy MC ’ s for 2 different PDF ’ s by UMxL found confirmed in analytic calculation -> conjecture: “ CL/goodness ” is always 100% First, examine (2) …

6 Does fitted always give expected ? Rewrite parametrized PDF -> measured distribution To maximize, => ” expectation value ” over PDF <-

7 Just 2 measured numbers characterize data vis-a-vis f: (averages, not highly correlated w shape of data distribution -> no GoF) Look at PDF ’ s examined by Heinrich: (a) Note: (b) Note: => 0 DoF in max => 0 DoF i.e. these are special cases where fixes The bottom line: (2 params - 3 measured # ’ s, 2 constraints) (+maximization of constrains 1) (1 param - 2 measured # ’ s, 1 constraint)

8 However … there is often a partial correlation …    Seen in max vs  max - Always get max = E[ (  max )] Illustrate: max  max 100% correlation is special case

9    (-1<x<1) … example => 2 largest terms are highly correlated  max max  max is at least partially correlated w measured  max, INDEPENDENTLY from actual distribution in data->no GoF

10 Can max be used within parametrization to set confidence interval?   max - how much is mean shifted by fit?  max  =O(   )->conjecture:  =O(0.5) per fitted parameter N=10 If we assume 100% correlation, determines

11 N=10  =1.0  =0.51±0.01 N=100  =1.0  =0.5±0.1 N=1000  =0.5  =0.5±0.1 Test on the same suspects Mean = ±0.001 Mean = ±0.001 Mean = ±0.001 Mean = ±0.001 Mean = ± Mean = ±0.0001

12 Other speculations tests of fit quality using information generated in UMxL … without resorting to binning Test on subsets of fitted sample, e.g. sin2  1 result from simultaneous fit over many decay modes - compare max in different sets w. expectation - “  2 ” event-by-event distribution {  (  max )} - moments, or K-S test Maybe - with stronger demo of distribution shift, width shift, extension to multiple parameters nice for multi-parameter fits - reduce to 1-d … can max be used within parametrization to set confidence interval?

13 Summary Goodness-of-fit for UMxL sorry, not possible with max alone Other measures of fit quality Desirable, especially for multiparameter fitting steps toward definition of max distribution for general PDF speculation - exploit info in {  (  max )}