An update on the Statistical Toolkit Barbara Mascialino, Maria Grazia Pia, Andreas Pfeiffer, Alberto Ribon, Paolo Viarengo July 19 th, 2005.

Slides:



Advertisements
Similar presentations
Pattern Recognition and Machine Learning
Advertisements

Alberto Ribon CERN Geant4Workshop Vancouver, September 2003 Tutorial of the Statistical Toolkit
Statistical Toolkit Power of Goodness-of-Fit tests
Maria Grazia Pia, INFN Genova Statistical Testing Project Maria Grazia Pia, INFN Genova on behalf of the Statistical Testing Team
One-way nonparametric ANOVA with trigonometric scores by Kravchuk, O.Y. School of Land and Food Sciences, University of Queensland.
Maria Grazia Pia, INFN Genova Geant4 Physics Validation (mostly electromagnetic, but also hadronic…) K. Amako, S. Guatelli, V. Ivanchenko, M. Maire, B.
Philip Twumasi-Ankrah, PhD
Class Session #2 Numerically Summarizing Data
Barbara MascialinoIEEE-NSSOctober 21 th, 2004 Application of statistical methods for the comparison of data distributions Susanna Guatelli, Barbara Mascialino,
Distribution of Sample Means, the Central Limit Theorem If we take a new sample, the sample mean varies. Thus the sample mean has a distribution, called.
DISTRIBUTION FITTING.
Barbara Mascialino, INFN Genova An update on the Goodness of Fit Statistical Toolkit B. Mascialino, A. Pfeiffer, M.G. Pia, A. Ribon, P. Viarengo
K sample problems and non-parametric tests. Two-Sample T-test (unpaired)
Bivariate Statistics GTECH 201 Lecture 17. Overview of Today’s Topic Two-Sample Difference of Means Test Matched Pairs (Dependent Sample) Tests Chi-Square.
Maria Grazia Pia, INFN Genova A Toolkit for Statistical Data Analysis M.G. Pia S. Donadio, F. Fabozzi, L. Lista, S. Guatelli, B. Mascialino, A. Pfeiffer,
Maria Grazia Pia, INFN Genova Test & Analysis Project Maria Grazia Pia, INFN Genova on behalf of the T&A team
Geant4-INFN (Genova-LNS) Team Validation of Geant4 electromagnetic and hadronic models against proton data Validation of Geant4 electromagnetic and hadronic.
Maria Grazia Pia Systematic validation of Geant4 electromagnetic and hadronic models against proton data Systematic validation of Geant4 electromagnetic.
Principles of the Global Positioning System Lecture 10 Prof. Thomas Herring Room A;
M obile C omputing G roup A quick-and-dirty tutorial on the chi2 test for goodness-of-fit testing.
Comparison of data distributions: the power of Goodness-of-Fit Tests
Density Curves and the Normal Distribution
Copyright © Cengage Learning. All rights reserved. 8 Tests of Hypotheses Based on a Single Sample.
T-distribution & comparison of means Z as test statistic Use a Z-statistic only if you know the population standard deviation (σ). Z-statistic converts.
CHAPTER 7: CONTINUOUS PROBABILITY DISTRIBUTIONS. CONTINUOUS PROBABILITY DISTRIBUTIONS (7.1)  If every number between 0 and 1 has the same chance to be.
1 Using martingale residuals to assess goodness of fit for sampled risk set data Ørnulf Borgan Department of Mathematics University of Oslo Based on joint.
Common Nonparametric Statistical Techniques in Behavioral Sciences Chi Zhang, Ph.D. University of Miami June, 2005.
Geant4 Acceptance Suite for Key Observables CHEP06, T.I.F.R. Mumbai, February 2006 J. Apostolakis, I. MacLaren, J. Apostolakis, I. MacLaren, P. Mendez.
An Empirical Likelihood Ratio Based Goodness-of-Fit Test for Two-parameter Weibull Distributions Presented by: Ms. Ratchadaporn Meksena Student ID:
Warsaw Summer School 2014, OSU Study Abroad Program Variability Standardized Distribution.
CS433: Modeling and Simulation Dr. Anis Koubâa Al-Imam Mohammad bin Saud University 15 October 2010 Lecture 05: Statistical Analysis Tools.
Provide tools for the statistical comparison of distributions  equivalent reference distributions  experimental measurements  data from reference sources.
Alberto Ribon, CERN Statistical Testing Project Alberto Ribon, CERN on behalf of the Statistical Testing Team CLHEP Workshop CERN, 28 January 2003.
ECE 8443 – Pattern Recognition LECTURE 07: MAXIMUM LIKELIHOOD AND BAYESIAN ESTIMATION Objectives: Class-Conditional Density The Multivariate Case General.
1/23 Ch10 Nonparametric Tests. 2/23 Outline Introduction The sign test Rank-sum tests Tests of randomness The Kolmogorov-Smirnov and Anderson- Darling.
Maria Grazia Pia, INFN Genova Statistical Toolkit Recent updates M.G. Pia B. Mascialino, A. Pfeiffer, M.G. Pia, A. Ribon, P. Viarengo
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
STATISTICAL ANALYSIS OF FATIGUE SIMULATION DATA J R Technical Services, LLC Julian Raphael 140 Fairway Drive Abingdon, Virginia.
Toward a unified approach to fitting loss models Jacques Rioux and Stuart Klugman, for presentation at the IAC, Feb. 9, 2004.
Probabilistic and Statistical Techniques 1 Lecture 3 Eng. Ismail Zakaria El Daour 2010.
Randomness Test Fall 2012 By Yaohang Li, Ph.D.. Review Last Class –Random Number Generation –Uniform Distribution This Class –Test of Randomness –Chi.
Chapter 3 Looking at Data: Distributions Chapter Three
Maria Grazia Pia, INFN Genova Update on the Goodness of Fit Toolkit M.G. Pia B. Mascialino, A. Pfeiffer, M.G. Pia, A. Ribon, P. Viarengo
Brownian Bridge and nonparametric rank tests Olena Kravchuk School of Physical Sciences Department of Mathematics UQ.
GG 313 Lecture 9 Nonparametric Tests 9/22/05. If we cannot assume that our data are at least approximately normally distributed - because there are a.
B AD 6243: Applied Univariate Statistics Data Distributions and Sampling Professor Laku Chidambaram Price College of Business University of Oklahoma.
Summarizing Risk Analysis Results To quantify the risk of an output variable, 3 properties must be estimated: A measure of central tendency (e.g. µ ) A.
 A Characteristic is a measurable description of an individual such as height, weight or a count meeting a certain requirement.  A Parameter is a numerical.
Maria Grazia Pia, INFN Genova Statistics Toolkit Project Maria Grazia Pia, INFN Genova AIDA Workshop.
The Statistical Testing Project Stefania Donadio and Barbara Mascialino January 15 TH, 2003.
Barbara MascialinoMonte Carlo 2005Chattanooga, April 19 th 2005 Monte Carlo Chattanooga, April 2005 B. Mascialino, A. Pfeiffer, M. G. Pia, A. Ribon,
Alberto Ribon CERN Geant4Workshop Vancouver, September 2003 Tutorial of the Statistical Toolkit
Last lecture summary Types of statistics Measures of central tendency Measures of variability Bias, Bessel's correction MAD Normal distribution Empirical.
Chapter 6: Descriptive Statistics. Learning Objectives Describe statistical measures used in descriptive statistics Compute measures of central tendency.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Update on the Goodness of Fit Toolkit
Chapter 7: Sampling Distributions
A Statistical Toolkit for Data Analysis
Descriptive Statistics
F. Foppiano, S. Guatelli, B. Mascialino, M. G. Pia, M. Piergentili
Data analysis in HEP: a statistical toolkit
Descriptive Statistics: Describing Data
B.Mascialino, A.Pfeiffer, M.G.Pia, A.Ribon, P.Viarengo
An update on the Goodness of Fit Statistical Toolkit
Measure of Central Tendency
Validating a Random Number Generator
Testing the Fit of a Quantal Model of Neurotransmission
Statistical Testing Project
Comparison of data distributions: the power of Goodness-of-Fit Tests
Data analysis in HEP: a statistical toolkit
Presentation transcript:

An update on the Statistical Toolkit Barbara Mascialino, Maria Grazia Pia, Andreas Pfeiffer, Alberto Ribon, Paolo Viarengo July 19 th, 2005

G.A.P Cirrone, S. Donadio, S. Guatelli, A. Mantero, B. Mascialino, S. Parlati, M.G. Pia, A. Pfeiffer, A. Ribon, P. Viarengo “A Goodness-of-Fit Statistical Toolkit” IEEE- Transactions on Nuclear Science (2004), 51 (5): Release StatisticsTesting-V downloadable from the web:

EMPIRICAL DISTRIBUTION FUNCTION ORIGINAL DISTRIBUTIONS Kolmogorov-Smirnov test Goodman approximation of KS test Kuiper test D mn Tests based on maximum distance unbinned distributions SUPREMUMSTATISTICS

Fisz-Cramer-von Mises test Anderson-Darling test Tests containing a weighting function binned/unbinned distributions EMPIRICAL DISTRIBUTION FUNCTION ORIGINAL DISTRIBUTIONS QUADRATICSTATISTICS+ WEIGHTING FUNCTION Sum/integral of all the distances

1.Status of the existing tests 2. New GoF Tests added 3. Description of the power study phase I 4. Description of the power study phase II 5. A concrete example: IMRT

1.Status of the existing tests: Fisz-Cramer-von Mises Conover (book) + Darling (1957): - The two-sample Cramer-von Mises test (Fisz test) has the same asymptotic distribution of the one-sample test (Cramer-von Mises test). - The equation of the asymptotic distribution is available in the paper by Anderson and Darling (1952). binned/unbinned distributions

1.Status of the existing tests: two-sample Anderson-Darling Scholz and Stephens (1987): - -The two-sample Anderson-Darling test can be written in different ways: - exact formulation (for unbinned distributions only) - approximated formulation (for binned/unbinned distributions) - -The approximated distance is already available in the toolkit. - -The asymptotic distributions of both exact and approximated formulations are available in the paper. - - The two-sample Anderson-Darling test has the same asymptotic distribution of the one-sample test. binned/unbinned distributions

1.Status of the existing tests: Tiku test Tiku (1965): - - Cramer-von Mises test in a chi-squared approximation. - - Cramer-von Mises test statistics is converted into a central chi-square, bypassing the problem of integrating the weighting function. binned/unbinned distributions

2. New GoF Tests: weighted Kolmogorov-Smirnov Canner (1975) & Buning (2001): - - Canner modified KS test introducing one weighting function identical to the one used in AD test. - - Buning modified KS test introducing one weighting function similar to the one used in AD test. - The equation of the asymptotic distribution is not available in Canner’s paper, only a few critical values for some samples sizes (n=m). unbinned distributions

2. New GoF Tests: weighted Cramer von Mises Buning (2001): - - Buning modified CVM test introducing one weighting function similar to the one used in AD test. - The equation of the asymptotic distribution is not available in the paper, only critical values for many samples sizes. unbinned distributions 2

2. New GoF Tests: Watson Watson (1975): - -Derives from Cramer-von Mises test statistics. - - Like Kuiper test it can be applied in case of cyclic observations. - The equation of the asymptotic distribution is not available in the paper, only critical values for many samples sizes. 22

Other news New user layer dealing with ROOT histograms (Andreas is working on that). Paper to IEEE-TNS Next release of the GoF Statistical Toolkit scheduled within summer.

Future developments Fix some design-related problems. New design (add uncertainties). Extend the toolkit to the comparison of: –Experimental data versus theoretical functions, –k-sample problem, –Many dimensional one-, two-, k-sample problem.

Which is the recipe to select the most suitable Goodness-of-Fit test among the ones available in the GoF Statistical Toolkit?

3. Description of the power study – phase I SAMPLE 1 RESULTS:LOCATION-SCALEALTERNATIVE SAMPLE 2 TEST PARENT 1 PARENT 2 MONTECARLOREPLICATIONSk=1000 EDF STATISTICS (UNBINNED DATA): KS, KSW, KSA, KUIPER, CVM, ADA “EMPIRICAL” POWER EVALUATION RESULTS:GENERALALTERNATIVE COMPARISON WITH PUBLISHEDRESULTSREALDATAEXAMPLES

Parent distributions Uniform Gaussian Double exponential Cauchy Exponential Contaminated Normal Distribution 2Contaminated Normal Distribution 1

Skewness and tailweight ParentST f 1 (x) Uniform f 2 (x) Gaussian f 3 (x) Double exponential f 4 (x) Cauchy f 5 (x) Exponential f 6 (x) Contamined normal f 7 (x) Contamined normal SkewnessTailweight

Comparative evaluation of tests Short (T<1.5) Medium (1.5 < T < 2) Long(T>2) S~1S~1S~1S~1KS KS – CVM CVM - AD S>1.5 KS - AD AD CVM - AD Skewness Tailweight 2222 2222 Supremum statistics tests Tests containing a weight function < <

4. Description of the power study – phase II SAMPLE 1 RESULTS:LOCATION-SCALEALTERNATIVE SAMPLE 2 TEST PARENT 1 PARENT 2 MONTECARLOREPLICATIONSk=1000 BINNED/UNBINNED DATA CHI2, KS, KSW, KSA, KUIPER, CVM, CVMA, ADA, AD “EMPIRICAL” + “MC” POWER EVALUATION RESULTS:GENERALALTERNATIVELINEARPOWERCORRELATIONBETWEENTESTSISODYNES

Which is the most suitable goodness-of-fit test? EXAMPLE EXAMPLE : unbinned data Lateral profiles 5. A concrete example: IMRT MichelaPiergentili

GoF test selection SkewnessTailweight S = 1: symmetric distribution S < 1: left skewed distribution S > 1: right skewed distribution T is always greater than 1, the longer the tail the greater the value of T. 1.Classify the type of the distributions in terms of skewness S and tailweight T

Comparative evaluation of tests power Short (T<1.3) Medium (1.3 < T < 2) Long(T>2) S~1S~1S~1S~1KS KS – CVM CVM - AD S>1.5 KS - AD AD CVM - AD Skewness Tailweight 2. Choose the most appropriate test for the classified type of distribution

GoF test: test selection & results Moderate skewed – medium tail KOLMOGOROV-SMIRNOV TEST D=0.27 – p>0.05 X-variable: Ŝ=1.53 T=1.36 Y-variable: Ŝ=1.27 T=1.34 ^ ^ RESULTS RESULTS: unbinned data