Maria Grazia Pia, INFN Genova Test & Analysis Project Maria Grazia Pia, INFN Genova on behalf of the T&A team

Slides:



Advertisements
Similar presentations
Alberto Ribon CERN Geant4Workshop Vancouver, September 2003 Tutorial of the Statistical Toolkit
Advertisements

Statistical Toolkit Power of Goodness-of-Fit tests
Maria Grazia Pia, INFN Genova Test & Analysis Project Maria Grazia Pia, INFN Genova on behalf of the T&A team
Maria Grazia Pia, INFN Genova Statistical Testing Project Maria Grazia Pia, INFN Genova on behalf of the Statistical Testing Team
Precision validation of Geant4 electromagnetic physics Katsuya Amako, Susanna Guatelli, Vladimir Ivanchenko, Michel Maire, Barbara Mascialino, Koichi Murakami,
Maria Grazia Pia, INFN Genova Geant4 Physics Validation (mostly electromagnetic, but also hadronic…) K. Amako, S. Guatelli, V. Ivanchenko, M. Maire, B.
Maria Grazia Pia Geant4 LowE Workshop 30-31/5/2002 ow Energy e.m. Workshop CERN, May 2002.
Maria Grazia Pia, INFN Genova PhysicsLists in Geant4 Advanced Examples Geant4.
Simulation of X-ray Fluorescence and Application to Planetary Astrophysics A. Mantero, M. Bavdaz, A. Owens, A. Peacock, M. G. Pia IEEE NSS -- Portland,
Maria Grazia Pia, INFN Genova 1 Part V The lesson learned Summary and conclusions.
Geant4-Genova Group Validation of Susanna Guatelli, Alfonso Mantero, Barbara Mascialino, Maria Grazia Pia, Valentina Zampichelli INFN Genova, Italy IEEE.
Barbara MascialinoIEEE-NSSOctober 21 th, 2004 Application of statistical methods for the comparison of data distributions Susanna Guatelli, Barbara Mascialino,
Barbara Mascialino, INFN Genova An update on the Goodness of Fit Statistical Toolkit B. Mascialino, A. Pfeiffer, M.G. Pia, A. Ribon, P. Viarengo
Maria Grazia Pia, INFN Genova A Toolkit for Statistical Data Analysis M.G. Pia S. Donadio, F. Fabozzi, L. Lista, S. Guatelli, B. Mascialino, A. Pfeiffer,
Maria Grazia Pia, INFN Genova CERN, 26 July 2004 Background of the Project.
Validation of the Bremsstrahlung models Susanna Guatelli, Barbara Mascialino, Luciano Pandola, Maria Grazia Pia, Pedro Rodrigues, Andreia Trindade IEEE.
Geant4-INFN (Genova-LNS) Team Validation of Geant4 electromagnetic and hadronic models against proton data Validation of Geant4 electromagnetic and hadronic.
Maria Grazia Pia Systematic validation of Geant4 electromagnetic and hadronic models against proton data Systematic validation of Geant4 electromagnetic.
G. Cosmo & M.G. Pia. The Training Kit Project Gabriele Cosmo & Maria Grazia Pia Geant4 Workshop Paris, October 2000
Comparison of data distributions: the power of Goodness-of-Fit Tests
Maria Grazia Pia, INFN Genova Software Process: Physics Maria Grazia Pia INFN Genova on behalf of the Geant4 Collaboration Budker Inst. of Physics IHEP.
BPS - 3rd Ed. Chapter 211 Inference for Regression.
University of Ottawa - Bio 4118 – Applied Biostatistics © Antoine Morin and Scott Findlay 21/09/2015 7:46 PM 1 Two-sample comparisons Underlying principles.
Modeling and Simulation CS 313
M.G. Pia et al. Brachytherapy at IST Results from an atypical Comparison Project Stefano Agostinelli 1,2, Franca Foppiano 1, Stefania Garelli 1, Matteo.
An Empirical Likelihood Ratio Based Goodness-of-Fit Test for Two-parameter Weibull Distributions Presented by: Ms. Ratchadaporn Meksena Student ID:
1 Statistical Distribution Fitting Dr. Jason Merrick.
Tests for Random Numbers Dr. Akram Ibrahim Aly Lecture (9)
CS433: Modeling and Simulation Dr. Anis Koubâa Al-Imam Mohammad bin Saud University 15 October 2010 Lecture 05: Statistical Analysis Tools.
Maria Grazia Pia, INFN Genova Test & Analysis Project aka “statistical testing” Maria Grazia Pia, INFN Genova on behalf of the T&A team
Provide tools for the statistical comparison of distributions  equivalent reference distributions  experimental measurements  data from reference sources.
Week 10 Nov 3-7 Two Mini-Lectures QMM 510 Fall 2014.
Alberto Ribon, CERN Statistical Testing Project Alberto Ribon, CERN on behalf of the Statistical Testing Team CLHEP Workshop CERN, 28 January 2003.
Maria Grazia Pia, INFN Genova Statistical Toolkit Recent updates M.G. Pia B. Mascialino, A. Pfeiffer, M.G. Pia, A. Ribon, P. Viarengo
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
Susanna Guatelli & Barbara Mascialino G.A.P. Cirrone (INFN LNS), G. Cuttone (INFN LNS), S. Donadio (INFN,Genova), S. Guatelli (INFN Genova), M. Maire (LAPP),
Geant4 Space User Workshop 2004 Maria Grazia Pia, INFN Genova Proposal of a Space Radiation Environment Generator interfaced to Geant4 S. Guatelli 1, P.
Education 793 Class Notes Presentation 10 Chi-Square Tests and One-Way ANOVA.
An update on the Statistical Toolkit Barbara Mascialino, Maria Grazia Pia, Andreas Pfeiffer, Alberto Ribon, Paolo Viarengo July 19 th, 2005.
Maria Grazia Pia, INFN Genova Update on the Goodness of Fit Toolkit M.G. Pia B. Mascialino, A. Pfeiffer, M.G. Pia, A. Ribon, P. Viarengo
Statistical Methods for Data Analysis Introduction to the course Luca Lista INFN Napoli.
Question paper 1997.
Geant4 Training 2006 Short Course Katsuya Amako (KEK) Gabriele Cosmo (CERN) Susanna Guatelli (INFN Genova) Aatos Heikkinen (Helsinki Institute of Physics)
Maria Grazia Pia, INFN Genova Statistics Toolkit Project Maria Grazia Pia, INFN Genova AIDA Workshop.
The Statistical Testing Project Stefania Donadio and Barbara Mascialino January 15 TH, 2003.
Barbara MascialinoMonte Carlo 2005Chattanooga, April 19 th 2005 Monte Carlo Chattanooga, April 2005 B. Mascialino, A. Pfeiffer, M. G. Pia, A. Ribon,
Geant4 Training 2004 Short Course Katsuya Amako (KEK) Gabriele Cosmo (CERN) Giuseppe Daquino (CERN) Susanna Guatelli (INFN Genova) Aatos Heikkinen (Helsinki.
Maria Grazia Pia, INFN Genova and CERN1 Geant4 highlights of relevance for medical physics applications Maria Grazia Pia INFN Genova and CERN.
Maria Grazia Pia, INFN Genova - G4 WG Coord. Meeting, 13/11/2001 ow Energy Electromagnetic Physics ow Energy Electromagnetic Physics New physics features.
MEGN 537 – Probabilistic Biomechanics Ch.5 – Determining Distributions and Parameters from Observed Data Anthony J Petrella, PhD.
BPS - 5th Ed. Chapter 231 Inference for Regression.
TK2023 Object-Oriented Software Engineering
Proposal of Geant4 Physics Book
Update on the Goodness of Fit Toolkit
Hypothesis testing. Chi-square test
A Statistical Toolkit for Data Analysis
Data analysis in HEP: a statistical toolkit
Hypothesis testing. Chi-square test
B.Mascialino, A.Pfeiffer, M.G.Pia, A.Ribon, P.Viarengo
Chi Square (2) Dr. Richard Jackson
Short Course Siena, 5-6 October 2006
An update on the Goodness of Fit Statistical Toolkit
Introductory Course ORNL, May 2008
Short Course IEEE NSS/MIC 2003 Katsuya Amako (KEK) Makoto Asai (SLAC)
Validating a Random Number Generator
Precision validation of Geant4 electromagnetic physics
Statistical Testing Project
G. A. P. Cirrone1, G. Cuttone1, F. Di Rosa1, S. Guatelli1, A
Comparison of data distributions: the power of Goodness-of-Fit Tests
Data analysis in HEP: a statistical toolkit
Presentation transcript:

Maria Grazia Pia, INFN Genova Test & Analysis Project Maria Grazia Pia, INFN Genova on behalf of the T&A team Geant4 Workshop, TRIUMF, September 2003 Statistical TestingPhysics Testing  2 N-L =13.1 – =20 - p=0.87  2 N-S =23.2 – =15 - p=0.08

Maria Grazia Pia, INFN Genova Test & Analysis Project Test & Analysis is a project to develop a statistical analysis system for usage in Geant4 testing Main application areas Provide tools to compare Geant4 simulation results with reference data – equivalent reference distributions (for instance, regression testing) – experimental measurements – data libraries from reference distribution sources – functions deriving from theoretical calculations or from fits physics validation regression testing system testing

Maria Grazia Pia, INFN Genova Users Other potential users: users of the Geant4 Toolkit, to verify the results of their applications with respect to reference data or their own experimental results PhysicsDevelopers  provide and document requirements  provide feedback on prototypes  perform beta testing  provide use cases for acceptance testingSTT  provide and document requirements  perform formal acceptance testing + users of the standalone Statistical Toolkit

Maria Grazia Pia, INFN Genova History “Statistical testing” agreed as a collaboration-wide goal Initial ideas for this project presented at a TSB meeting, ~end 2001 Informal discussions, spring – summer 2002 Test & Analysis Project launched at Geant4 Workshop 2002 Statistical Toolkit Project Physics Testing Project

Maria Grazia Pia, INFN Genova GoF component Provide statistics algorithms to compare various kinds of distributions i.e. binned, unbinned, continuous, multi-dimensional, affected by experimental errors, … Set of goodness of fit tests chi squared, Kolmogorov-Smirnov, Cramer-von Mises, Anderson-Darling, etc. Geant4 Physics Test Provide distributions of physical quantities of interest, to be compared to reference ones Architecture use Set of Geant4 physics tests cross sections, stopping powers, angular distributions, etc. Statistical Toolkit Project Physics Testing Project

Maria Grazia Pia, INFN Genova Developers Pablo Cirrone (INFN) Stefania Donadio (INFN) Susanna Guatelli (INFN) Alfonso Mantero (INFN) Barbara Mascialino (INFN) Luciano Pandola (INFN) Sandra Parlati (INFN) Andreas Pfeiffer (CERN) MG Pia (INFN) Alberto Ribon (CERN) Simona Saliceti (INFN) Paolo Viarengo (IST) S. Donadio F. Fabozzi S. Guatelli L. Lista B. Mascialino A. Pfeiffer MG Pia A. Ribon P. Viarengo discussions with Fred James, Louis Lyons, Giovanni Punzi Collaboration: G. Cosmo, V. Ivanchenko, M. Maire, S. Sadilov, L. Urban Production resources: Gran Sasso Laboratory HEPstatistics Team

Maria Grazia Pia, INFN Genova One year later... Scope Architecture Software process Statistical algorithms Current status HEPstatistics Scope Physics tests Results Resources needed StatisticsPhysics This workshop: discuss results, open problems discuss integration in regular Geant4 testing agree on future plans synergy with Geant4 Physics Book

Maria Grazia Pia, INFN Genova Statistical Testing Project GoF component of HEPstatistics

Maria Grazia Pia, INFN Genova What is? Provide tools for the statistical comparison of distributions  equivalent reference distributions  experimental measurements  data from reference sources  functions deriving from theoretical calculations or fits Physics analysis Simulation validation Detector monitoring Regression testing Reconstruction vs. expectation Main application areas: A project to develop a statistical comparison system A project to develop a statistical comparison system

Maria Grazia Pia, INFN Genova Vision: the basics software process Rigorous software process vision Have a vision for the project –Motivated by Geant4 –First core of a statistics toolkit for HEP stakeholders Who are the stakeholders? users Who are the users? developers Who are the developers? architecture Build on a solid architecture Clearly define scopeobjectives scope, objectives Flexible, extensible, maintainable Flexible, extensible, maintainable system quality Software quality Clearly define roles

Maria Grazia Pia, INFN Genova User Requirements Document Specific requirements: capability requirements Comparing distributions (  2, KSG, KS, AD, CVM, Kuiper, Lilliefors, list of histograms, select the proper algorithm) Converting distributions (binned  unbinned) Confidence levels Handling distributions (mono/many dim. distributions, reduce dimensionality, filters, toy MonteCarlo) Treatment of errors ( handle statistical and systematic experimental errors) Plotting (original, normalised, cumulative distributions)

Maria Grazia Pia, INFN Genova Architectural guidelines solid architectural approach The project adopts a solid architectural approach functionalityquality –to offer the functionality and the quality needed by the users maintainable –to be maintainable over a large time scale extensible –to be extensible, to accommodate future evolutions of the requirements Component-based approach –to facilitate re-use and integration in diverse frameworksAIDA –adopt a (HEP) standard –no dependence on any specific analysis toolPython –for interactivity LCG Architecture Blueprint RTAG The approach adopted is compatible with the recommendations of the LCG Architecture Blueprint RTAG

Maria Grazia Pia, INFN Genova Software process guidelines USDPtailored USDP, specifically tailored to the project RUP –practical guidance and tools from the RUP –both rigorous and lightweight –mapping onto ISO ISO Guidance from ISO –standard! Incremental and iterative life cycle model Various software process artifacts available on the web –Vision –User Requirements –Architecture and Design model –Traceability matrix –etc.

Maria Grazia Pia, INFN Genova Historical introduction to EDF tests In 1933 Kolmogorov published a short but landmark paper on the Italian Giornale dell’Istituto degli Attuari. He formally defined the empirical distribution function (EDF) and then enquired how close this would be to the true distribution F(x) when this is continuous. It must be noticed that Kolmogorov himself regarded his paper as the solution of an interesting probability problem, following the general interest of the time, rather than a paper on statistical methodology. After Kolmogorov article, over a period of about 10 years, the foundations were laid by a number of distinguished mathematicians of methods of testing fit to a distribution based on the EDF (Smirnov, Cramer, Von Mises, Anderson, Darling, …). The ideas in this paper have formed a platform for vast literature, both of interesting and important probability problems, and also concerning methods of using the Kolmogorov statistics for testing fit to a distribution. The literature continues with great strength today showing no sign to diminish.

Maria Grazia Pia, INFN Genova Goodness-of-fit tests Pearson’s  2 test Kolmogorov test Kolmogorov – Smirnov test Goodman approximation of KS test Lilliefors test Fisz-Cramer-von Mises test Cramer-von Mises test Anderson-Darling test Kuiper test … It is a difficult domain… Implementing algorithms is easy But comparing real-life distributions is not easy Incremental and iterative software process Collaboration with statistics experts Patience, humility, time… System open to extension and evolution Suggestions welcome!

Maria Grazia Pia, INFN Genova

Simple user layer Shields the user from the complexity of the underlying algorithms and design AIDA objectscomparison algorithm Only deal with AIDA objects and choice of comparison algorithm

Maria Grazia Pia, INFN Genova

Traceability Requirements Design Implementation Test & test results Documentation (coming...)

Maria Grazia Pia, INFN Genova binned Applies to binned distributions It can be useful also in case of unbinned distributions, but the data must be grouped into classes Cannot be applied if the counting of the theoretical frequencies in each class is < 5 When this is not the case, one could try to unify contiguous classes until the minimum theoretical frequency is reached binned Applies to binned distributions It can be useful also in case of unbinned distributions, but the data must be grouped into classes Cannot be applied if the counting of the theoretical frequencies in each class is < 5 When this is not the case, one could try to unify contiguous classes until the minimum theoretical frequency is reached Pearson’s  2

Maria Grazia Pia, INFN Genova The easiest among non-parametric tests continuous Verify the adaptation of a sample coming from a random continuous variable Based on the computation of the maximum distance between an empirical repartition function and the theoretical repartition one Test statistics: D = sup | F O (x) - F T (x)| The easiest among non-parametric tests continuous Verify the adaptation of a sample coming from a random continuous variable Based on the computation of the maximum distance between an empirical repartition function and the theoretical repartition one Test statistics: D = sup | F O (x) - F T (x)| Kolmogorov test ORIGINAL DISTRIBUTIONS EMPIRICAL DISTRIBUTION FUNCTION

Maria Grazia Pia, INFN Genova Problem of the two samples –mathematically similar to Kolmogorov’s maximum difference Instead of comparing an empirical distribution with a theoretical one, try to find the maximum difference between the distributions of the two samples F n and G m : D mn = sup |F n (x) - G m (x)| continuous Can be applied only to continuous random variables Conover (1971) and Gibbons and Chakraborti (1992) tried to extend it to cases of discrete random variables Problem of the two samples –mathematically similar to Kolmogorov’s maximum difference Instead of comparing an empirical distribution with a theoretical one, try to find the maximum difference between the distributions of the two samples F n and G m : D mn = sup |F n (x) - G m (x)| continuous Can be applied only to continuous random variables Conover (1971) and Gibbons and Chakraborti (1992) tried to extend it to cases of discrete random variables Kolmogorov-Smirnov test

Maria Grazia Pia, INFN Genova Goodman approximation of K-S test Goodman Goodman (1954) demonstrated that the Kolmogorov-Smirnov exact test statistics D mn = sup |F n (x) - G m (x)| can be easily converted into a  2 :  2 = 4D 2 mn [m*n / (m+n)] This approximated test statistics follows the  2 distribution with 2 degrees of freedom continuous Can be applied only to continuous random variables Goodman Goodman (1954) demonstrated that the Kolmogorov-Smirnov exact test statistics D mn = sup |F n (x) - G m (x)| can be easily converted into a  2 :  2 = 4D 2 mn [m*n / (m+n)] This approximated test statistics follows the  2 distribution with 2 degrees of freedom continuous Can be applied only to continuous random variables

Maria Grazia Pia, INFN Genova Similar to Kolmogorov test Based on the null hypothesis that the random continuous variable is normally distributed N(m,  2 ), with m and  2 unknown Performed comparing the empirical repartition function F(z 1,z 2,...,z n ) with the one of the standardized normal distribution  (z): D* = sup | F O (z) -  (z)| Similar to Kolmogorov test Based on the null hypothesis that the random continuous variable is normally distributed N(m,  2 ), with m and  2 unknown Performed comparing the empirical repartition function F(z 1,z 2,...,z n ) with the one of the standardized normal distribution  (z): D* = sup | F O (z) -  (z)| Lilliefors test

Maria Grazia Pia, INFN Genova Fisz-Cramer-von Mises test Problem of the two samples The test statistics contains a weight function Based on the test statistics: t = n 1* n 2 / (n 1 +n 2 ) 2  i [F 1 (x i ) – F 2 (x i )] 2 binned Can be performed on binned variables Satisfactory for symmetric and right-skewed distribution Problem of the two samples The test statistics contains a weight function Based on the test statistics: t = n 1* n 2 / (n 1 +n 2 ) 2  i [F 1 (x i ) – F 2 (x i )] 2 binned Can be performed on binned variables Satisfactory for symmetric and right-skewed distribution Based on the test statistics:  2 = integral (F O (x) - F T (x)) 2 dF(x) The test statistics contains a weight function unbinned Can be performed on unbinned variables Satisfactory for symmetric and right-skewed distributions Based on the test statistics:  2 = integral (F O (x) - F T (x)) 2 dF(x) The test statistics contains a weight function unbinned Can be performed on unbinned variables Satisfactory for symmetric and right-skewed distributions Cramer-von Mises test

Maria Grazia Pia, INFN Genova Anderson-Darling test Performed on the test statistics: A 2 = integral { [F O (x) – F T (x)] 2 / [F T (x) (1-F T (X))] } dF T (x) binnedunbinned Can be performed both on binned and unbinned variables The test statistics contains a weight function suitable to any data-set skewness Seems to be suitable to any data-set (Aksenov and Savageau ) with any skewness (symmetric distributions, left or right skewed) fat tail of distributions Seems to be sensitive to fat tail of distributions Performed on the test statistics: A 2 = integral { [F O (x) – F T (x)] 2 / [F T (x) (1-F T (X))] } dF T (x) binnedunbinned Can be performed both on binned and unbinned variables The test statistics contains a weight function suitable to any data-set skewness Seems to be suitable to any data-set (Aksenov and Savageau ) with any skewness (symmetric distributions, left or right skewed) fat tail of distributions Seems to be sensitive to fat tail of distributions

Maria Grazia Pia, INFN Genova Based on a quantity that remains invariant for any shift or re-parameterisation not work well on tails Does not work well on tails D* = max (F O (x)-F T (x)) + max (F T (x)-F O (x)) It is useful for observation on a circle, because the value of D* does not depend on the choice of the origin. Of course, D* can also be used for data on a line Based on a quantity that remains invariant for any shift or re-parameterisation not work well on tails Does not work well on tails D* = max (F O (x)-F T (x)) + max (F T (x)-F O (x)) It is useful for observation on a circle, because the value of D* does not depend on the choice of the origin. Of course, D* can also be used for data on a line Kuiper test

Maria Grazia Pia, INFN Genova Power of the tests In terms of power:  2 loses information in a test for continuous distribution by grouping the data into cells  Kac, Kiefer and Wolfowitz (1955) showed that D requires n 4/5 observations compared to n observations for  2 to attain the same power Cramer-von Mises and Anderson-Darling statistics are expected to be superior to D, since they make a comparison of the two distributions all along the range of x, rather than looking for a marked difference at one point 2222 2222 Kolmogorov- Smirnov Tests containing a weight function < < The power of a test is the probability of rejecting the null hypothesis correctly

Maria Grazia Pia, INFN Genova  2 design UR 1.1 The user shall be able to compare binned distributions by means of  2 test.

Maria Grazia Pia, INFN Genova Unit test:  2 (1) EXAMPLE FROM PICCOLO BOOK (STATISTICS - page 711)  2 test-statistics = 15.8 Expected  2 = 15.8 Exact p-value= Expected p-value= Months The study concerns monthly birth and death distributions (binned data)

Maria Grazia Pia, INFN Genova Unit test:  2 (2) EXAMPLE FROM CRAMER BOOK (MATHEMATICAL METHODS OF STATISTICS - page 447) The study concerns the sex distribution of children born in Sweden in 1935  2 test-statistics = Expected  2 = Exact p-value=0 Expected p-value=0

Maria Grazia Pia, INFN Genova ADB design UR 1.2 The user shall be able to compare binned distributions by means of Anderson- Darling test. WORK IN PROGRESS

Maria Grazia Pia, INFN Genova KSG Design UR 1.3 The user shall be able to compare unbinned distributions by means of Kolmogorov-Smirnov Goodman test.

Maria Grazia Pia, INFN Genova Unit test: K-S Goodman (1) EXAMPLE FROM PICCOLO BOOK (STATISTICS - page 711)  2 test-statistics = 3.9 Expected  2 = 3.9 Exact p-value= Expected p-value= Months The study concerns monthly birth and death distributions (unbinned data) Cumulative Function

Maria Grazia Pia, INFN Genova Unit test: K-S Goodman (2)  2 test-statistics = 1.5 Expected  2 = 1.5 EXAMPLE FROM LANDENNA BOOK (NONPARAMETRIC TESTS BASED ON FREQUENCIES - page 287) We consider body lengths of two independent groups of anopheles Exact p-value= Expected p-value= Body lengths

Maria Grazia Pia, INFN Genova KS Design UR 1.4 The user shall be able to compare unbinned distributions by means of Kolmogorov-Smirnov test.

Maria Grazia Pia, INFN Genova Unit test: Kolmogorov-Smirnov(1) EXAMPLE FROM D test-statistics = Expected D = Exact p-value= Expected p-value=0.035 The study concerns how long a bee stays near a particular tree (Redwell/Whitney) Cumulative

Maria Grazia Pia, INFN Genova Unit test: Kolmogorov-Smirnov (2) EXAMPLE FROM LANDENNA BOOK (NONPARAMETRIC STATISTICAL METHODS - page ) We consider one clinical parameter of two independent groups of patients D test-statistics = 0.65 Expected D = 0.65 Exact p-value= Expected p-value= Cumulative

Maria Grazia Pia, INFN Genova ADU Design UR 1.5 The user shall be able to compare unbinned distributions by means of Anderson- Darling test.

Maria Grazia Pia, INFN Genova CVMB Design UR 1.6 The user shall be able to compare binned distributions by means of Cramer- Von Mises test.

Maria Grazia Pia, INFN Genova CVMU Design UR 1.7 The user shall be able to compare unbinned distributions by means of Cramer- Von Mises test.

Maria Grazia Pia, INFN Genova...and more No time to illustrate all the algorithms and statistics details... Other components in progress, not only GoF –PDF –Toy Monte Carlo –(L. Lista et al., INFN Napoli, BaBar) A general purpose, open source tool for statistical data analysis –interest in HEP community: LCG, BaBar, CDF etc. –2 talks at PHYSTAT 2003, SLAC, 8-11 September 2003

Maria Grazia Pia, INFN Genova Do we need such sophisticated algorithms? Alfonso Mantero, Thesis, Univ. Genova, 2002  2 not appropriate (< 5 entries in some bins, physical information would be lost if rebinned) ESA test beam at Bessy, Bepi Colombo mission Anderson-Darling A c (95%) =0.752

Maria Grazia Pia, INFN Genova Status and plans  -release March st (preliminary) release June 2003 –basic algorithms –unit and system tests Recent developments –added new algorithms, improved design Work in progress –filtering –treatment of errors (uncertainties) In preparation –improved documentation –user examples (now use system tests as preliminary examples)

Maria Grazia Pia, INFN Genova Conclusions A lot of progress since last year’s workshop... –a lot of work, but also a lot of fun! A group of bright, enthusiastic, hard-working young collaborators... Ground for Geant4 Physics Book

Maria Grazia Pia, INFN Genova Physics Testing Project see talk on Validation, Thursday plenary

Maria Grazia Pia, INFN Genova More at IEEE-NSS, Portland, October 2003 B. Mascialino et al., A Toolkit for statistical data analysis L. Pandola et al., Precision validation of Geant4 electromagnetic physics