Alberto Ribon CERN Geant4Workshop Vancouver, September 2003 Tutorial of the Statistical Toolkit

Slides:



Advertisements
Similar presentations
Statistical Toolkit Power of Goodness-of-Fit tests
Advertisements

Maria Grazia Pia, INFN Genova Precision Electromagnetic Physics in Geant4: the Atomic Relaxation Models A. Mantero, B. Mascialino, Maria Grazia Pia, S.
Maria Grazia Pia, INFN Genova 1 Part IV Geant4 results.
Maria Grazia Pia, INFN Genova Test & Analysis Project Maria Grazia Pia, INFN Genova on behalf of the T&A team
Barbara MascialinoGeant4 WorkshopCatania, October Electromagnetic physics validation Katsuya Amako,Susanna Guatelli, Vladimir Ivanchenko, Michel.
Maria Grazia Pia, INFN Genova Epistemic and systematic uncertainties in Monte Carlo simulation: Epistemic and systematic uncertainties in Monte Carlo simulation:
Maria Grazia Pia, INFN Genova Statistical Testing Project Maria Grazia Pia, INFN Genova on behalf of the Statistical Testing Team
Maria Grazia Pia, INFN Genova 1 Data analysis with R in an experimental physics environment Andreas Pfeiffer (CERN) and Maria Grazia Pia (INFN Genova)
Maria Grazia Pia, INFN Genova Conceptual challenges and computational progress in X-ray simulation Maria Grazia Pia INFN Genova, Italy Maria Grazia Pia.
M. Glaser, G. Guatelli, B. Mascialino, M. Moll, M.G. Pia, F. Ravotti Simulation for LHC Radiation Background Optimisation of monitoring detectors and experimental.
Precision validation of Geant4 electromagnetic physics Katsuya Amako, Susanna Guatelli, Vladimir Ivanchenko, Michel Maire, Barbara Mascialino, Koichi Murakami,
Maria Grazia Pia, INFN Genova Geant4 Physics Validation (mostly electromagnetic, but also hadronic…) K. Amako, S. Guatelli, V. Ivanchenko, M. Maire, B.
Maria Grazia Pia Geant4 LowE Workshop 30-31/5/2002 ow Energy e.m. Workshop CERN, May 2002.
Maria Grazia Pia, INFN Genova PhysicsLists in Geant4 Advanced Examples Geant4.
Maria Grazia Pia, INFN Genova Atomic Relaxation Models A. Mantero, B. Mascialino, Maria Grazia Pia INFN Genova, Italy P. Nieminen ESA/ESTEC
Maria Grazia Pia, INFN Genova Low Energy Electromagnetic Physics Maria Grazia Pia INFN Genova on behalf of the Low Energy Electromagnetic WG Geant4 Workshop.
Geant4-Genova Group Validation of Susanna Guatelli, Alfonso Mantero, Barbara Mascialino, Maria Grazia Pia, Valentina Zampichelli INFN Genova, Italy IEEE.
Barbara MascialinoIEEE-NSSOctober 21 th, 2004 Application of statistical methods for the comparison of data distributions Susanna Guatelli, Barbara Mascialino,
Maria Grazia Pia Experimental validation of models in the pre-equilibrium and nuclear de-excitation phase G.A.P. Cirrone 1, G. Cuttone 1, F. Di Rosa 1,
Barbara Mascialino, INFN Genova An update on the Goodness of Fit Statistical Toolkit B. Mascialino, A. Pfeiffer, M.G. Pia, A. Ribon, P. Viarengo
Maria Grazia Pia, INFN Genova Geant4 Physics Validation Geant4 Space User Workshop Pasadena, 6-10 November 2006 M.G. Pia On behalf of the LowE EM and Advanced.
S. Guatelli, M.G Pia, INFN Genova S. Guatelli ( CERN, INFN Genova ) CERN, 13 November 2002 Users Workshop Where to put analysis in Geant4 Applications.
Luciano Pandola, INFN Gran Sasso Luciano Pandola INFN Gran Sasso Zaragoza, September 15 th, 2005 Geant4 and the underground physics community... (part.
Maria Grazia Pia, INFN Genova Geant4 Electromagnetic Validation (mostly electromagnetic, but also a bit of hadronic…) K. Amako, G.A.P. Cirrone, G. Cuttone,
Maria Grazia Pia, INFN Genova A Toolkit for Statistical Data Analysis M.G. Pia S. Donadio, F. Fabozzi, L. Lista, S. Guatelli, B. Mascialino, A. Pfeiffer,
Maria Grazia Pia, INFN Genova Test & Analysis Project Maria Grazia Pia, INFN Genova on behalf of the T&A team
Validation of the Bremsstrahlung models Susanna Guatelli, Barbara Mascialino, Luciano Pandola, Maria Grazia Pia, Pedro Rodrigues, Andreia Trindade IEEE.
Maria Grazia Pia Systematic validation of Geant4 electromagnetic and hadronic models against proton data Systematic validation of Geant4 electromagnetic.
Comparison of data distributions: the power of Goodness-of-Fit Tests
Maria Grazia Pia, INFN Genova Low Energy Electromagnetic Physics R. Capra, S. Chauvie, G.A.P. Cirrone, G. Cuttone, F. Di Rosa, Z. Francis, S. Guatelli,
Alfonso Mantero, INFN Genova Models for the Simulation of X-Ray Fluorescence and PIXE A. Mantero, S. Saliceti, B. Mascialino, Maria Grazia Pia INFN Genova,
M.G. Pia et al. Brachytherapy at IST Results from an atypical Comparison Project Stefano Agostinelli 1,2, Franca Foppiano 1, Stefania Garelli 1, Matteo.
F. Foppiano, B. Mascialino, M. G. Pia, M. Piergentili Geant4 Simulation of an Accelerator Head for Intensity Modulated RadioTherapy Monte Carlo 2005 Topical.
Geant4 Workshop 2004 Maria Grazia Pia, INFN Genova Physics Book Maria Grazia Pia INFN Genova on behalf of the Physics Book Team
Maria Grazia Pia, INFN Genova Test & Analysis Project aka “statistical testing” Maria Grazia Pia, INFN Genova on behalf of the T&A team
Alberto Ribon, CERN Statistical Testing Project Alberto Ribon, CERN on behalf of the Statistical Testing Team CLHEP Workshop CERN, 28 January 2003.
A General Purpose Brachytherapy Software Simulation + Analysis (isodose calculation) 2/10/2002 Geant4 Workshop CERN Susanna Guatelli Univ. and INFN Genova.
Maria Grazia Pia, INFN Genova Statistical Toolkit Recent updates M.G. Pia B. Mascialino, A. Pfeiffer, M.G. Pia, A. Ribon, P. Viarengo
Susanna Guatelli & Barbara Mascialino G.A.P. Cirrone (INFN LNS), G. Cuttone (INFN LNS), S. Donadio (INFN,Genova), S. Guatelli (INFN Genova), M. Maire (LAPP),
Maria Grazia Pia, INFN Genova Introduction to medical physics applications Maria Grazia Pia, INFN Genova Geant4 Workshop,
An update on the Statistical Toolkit Barbara Mascialino, Maria Grazia Pia, Andreas Pfeiffer, Alberto Ribon, Paolo Viarengo July 19 th, 2005.
Maria Grazia Pia, INFN Genova Update on the Goodness of Fit Toolkit M.G. Pia B. Mascialino, A. Pfeiffer, M.G. Pia, A. Ribon, P. Viarengo
Precision Validation of Geant4 Electromagnetic Physics Geant4 DNA Project Meeting 26 July 2004, CERN Michela.
Maria Grazia Pia, INFN Genova Statistics Toolkit Project Maria Grazia Pia, INFN Genova AIDA Workshop.
The Statistical Testing Project Stefania Donadio and Barbara Mascialino January 15 TH, 2003.
Barbara MascialinoMonte Carlo 2005Chattanooga, April 19 th 2005 Monte Carlo Chattanooga, April 2005 B. Mascialino, A. Pfeiffer, M. G. Pia, A. Ribon,
Validation of the bremssrahlung process IV Workshop on Geant4 physics validation Susanna Guatelli, Luciano Pandola, Maria Grazia Pia, Valentina Zampichelli.
Luciano Pandola, INFN Gran Sasso Luciano Pandola INFN Gran Sasso Genova, July 18 th, 2005 Geant4 and the underground physics community.
Alberto Ribon CERN Geant4Workshop Vancouver, September 2003 Tutorial of the Statistical Toolkit
Electromagnetic physics
Update on the Goodness of Fit Toolkit
Advanced examples Test & Analysis Project LowE e.m. physics
A Statistical Toolkit for Data Analysis
F. Foppiano, S. Guatelli, B. Mascialino, M. G. Pia, M. Piergentili
Advanced examples Test & Analysis Project LowE e.m. physics
Low Energy Electromagnetic Physics
Introduction to medical physics applications
Data analysis in HEP: a statistical toolkit
B.Mascialino, A.Pfeiffer, M.G.Pia, A.Ribon, P.Viarengo
Geant4 physics validation: Bragg Peak
Short Course Siena, 5-6 October 2006
The Hadrontherapy Geant4 advanced example
An update on the Goodness of Fit Statistical Toolkit
Introductory Course ORNL, May 2008
Short Course IEEE NSS/MIC 2003 Katsuya Amako (KEK) Makoto Asai (SLAC)
Precision validation of Geant4 electromagnetic physics
Statistical Testing Project
The Geant4 Hadrontherapy Advanced Example
Comparison of data distributions: the power of Goodness-of-Fit Tests
Data analysis in HEP: a statistical toolkit
Presentation transcript:

Alberto Ribon CERN Geant4Workshop Vancouver, September 2003 Tutorial of the Statistical Toolkit

Alberto Ribon Test & Analysis Test & Analysis is a project to develop a statistical analysis system for usage in Geant4 testing Main application areas Main application areas Provide tools to compare Geant4 simulation results with reference data – equivalent reference distributions (for instance, regression testing) – experimental measurements – data libraries from reference distribution sources – functions deriving from theoretical calculations or from fits physics validation regression testing system testing What is the Test & Analysis project?

Alberto Ribon Team of Geant4 Test The performed tests were established with the help of: Vladimir Ivanchenko (CERN) Michel Maire (IN2P3/LAPP) Coordinators: Maria Grazia Pia (INFN, Genova) Andreas Pfeiffer (CERN) Realisation of tests: Pablo Cirrone (INFN Catania) Susanna Guatelli (Genova, CERN) Alfonso Mantero (Genova, ESA) Luciano Pandola (INFN, Gran Sasso) Automatisation/production of tests: Sandra Parlati (INFN, Gran Sasso) Statistical analysis: Stefania Donadio (INFN, Genova) Barbara Mascialino (INFN, Genova) Maria Grazia Pia (INFN, Genova) Andreas Pfeiffer (CERN) Alberto Ribon (CERN) Paolo Viarengo (IST, Genova)

Alberto Ribon Electromagnetic physics: Hadronic physics G4LowE G4Standard G4Penelope Particle CSDA range Particle Stopping Power Transmission coefficient Backscattering coefficient Gamma Attenuation coefficient Gamma processes cross section Particle range Bremmstrahlung energy spectrum Multiple scattering test Energy deposition in absorber material etc. Proton Bragg Peak Study of possible tests to perform Flexibility Plug-in test50

Alberto Ribon X-ray Attenuation Coefficient - Ge G4 Standard G4 LowE NIST-XCOM 2 N-L =26.3 – =23 - p= N-S =27.9 – =23 - p=0.22

Alberto Ribon CSDA Range – Al –G4LowE Geant NIST-ESTAR Geant Regression testing

Alberto Ribon Backscattering low energies - Au

Alberto Ribon Download/setup the Statistical Toolkit Download : StatisticsTesting-V tgz tarball You also need to install the following libraries: 1) GSL (GNU Scientific Library) 2) AIDA 3.0 3) Anaphe Edit the script buildAll.py to set the proper GSL path GSL_DIR=/afs/cern.ch/sw/lhcxx/specific/redhat73/gcc3.2/ PublicDomainPackages/2.0.0/ Run the script: $./buildAll.py

Alberto Ribon Statistiscal Tests available Currently the available statistical tests are: 1) Chi2 test (for binned distributions); 2) Kolmogorov-Smirnov test (for unbinned distributions). For the next future: 3) Cramer-von Mises test (for both binned and unbinned distributions); 4) Anderson-Darling test (for both binned and unbinned distributions); …

Alberto Ribon The AIDA classes you need to know AIDA::IHistogram1D : for 1-dimensional binned distribution AIDA::ICloud1D : for 1-dimensional unbinned distribution AIDA::IDataPointSet : vector of IDataPoint; IDataPoint : is a vector of IMeasurement; IMeasurement : (value, errorPlus, errorMinus).

Alberto Ribon The StatisticsTesting classes you need to know namespace StatisticsTesting { template // For instance: Chi2ComparisonAlgorithm, // KolmogorovSmirnovComparisonAlgorithm. class StatisticsComparator { public: ComparisonResult compare (const AIDA::IDataPointSet & dps1, const AIDA::IDataPointSet & dps2); ComparisonResult compare (const AIDA::IHistogram1D & histo1, const AIDA::IHistogram1D & histo2); ComparisonResult compare (const AIDA::ICloud1D & cloud1, const AIDA::ICloud1D & cloud2);... } class ComparisonResult { public: double distance( ); double quality ( ); double ndf ( );... }

Alberto Ribon Chi2 test between histograms #Include AIDA/AIDA.h #include StatisticsTesting/StatisticsComparator.h #include Chi2ComparisonAlgorithm.h #include ComparisonResult.h using namespace StatisticsTesting; std::auto_ptr af( AIDA_createAnalysisFactory() ); std::auto_ptr tf( af -> createTreeFactory() ); std::auto_ptr tree( tf -> create() ); std::auto_ptr hf( af->createHistogramFactory( *tree ) ); AIDA::IHistogram1D& hA = *( hf->createHistogram1D( "A", 100, 0.0, 50.0) ); AIDA::IHistogram1D& hB = *( hf->createHistogram1D( "B", 100, 0.0, 50.0) ); hA.fill( 15.7 );... hB.fill( 23.4 );... StatisticsComparator comparator; ComparisonResult result = comparator.compare( hA, hB ); std::cout << distance= << result.distance() << ndf= << result.ndf() << p-value= << result.quality();

Alberto Ribon Kolmorogov-Smirnov test between clouds #Include AIDA/AIDA.h #include StatisticsTesting/StatisticsComparator.h #include KolmogorovSmirnovComparisonAlgorithm.h" #include ComparisonResult.h using namespace StatisticsTesting; std::auto_ptr af( AIDA_createAnalysisFactory() ); std::auto_ptr tf( af -> createTreeFactory() ); std::auto_ptr tree( tf -> create() ); std::auto_ptr hf( af->createHistogramFactory( *tree ) ); AIDA::ICloud1D& cloudA = *( hf->createCloud1D( "A" ) ); AIDA::ICloud1D& cloudB = *( hf->createCloud1D( "B" ) ); cloudA.fill( 15.7 );... cloudB.fill( 23.4 );... StatisticsComparator comparator; ComparisonResult result = comparator.compare( cloudA, cloudB ); std::cout << K-S distance= << result.distance() << p-value= << result.quality();

Alberto Ribon Example of an XML data file <dataPointSet dimension=2 name=attenuation coefficient path=/ title=attenuation coefficient in Ge> …

Alberto Ribon How to write a XML data file aFact = AIDA_createAnalysisFactory(); treeFact = aFact -> createTreeFactory(); … theTree = treeFact -> create(test50.xml, xml, false, true, uncompress ); dataPointFactory = aFact -> createDataPointSetFactory( *theTree ); particleTransmissionDataPoint = dataPointFactory -> create(Transmission test, 2); … particleTransmissionDataPoint -> addPoint(); AIDA::IDataPoint* point = particleTransmissionDataPoint -> point( PointNumber ); AIDA::IMeasurement* coordinateX = point -> coordinate( 0 ); coordinateX -> setValue( primaryParticleEnergy ); AIDA::IMeasurement* coordinateY = point -> coordinate( 1 ); coordinateY -> setValue( TransFraction ); coordinateY -> setErrorPlus( TransError ); coordinateY -> setErrorMinus( TransError ); … theTree -> commit(); theTree -> close();

Alberto Ribon How to compare XML dataPointSets #Include AIDA/AIDA.h #include StatisticsTesting/StatisticsComparator.h #include Chi2ComparisonAlgorithm.h #include ComparisonResult.h using namespace StatisticsTesting; std::auto_ptr af( AIDA_createAnalysisFactory() ); std::auto_ptr tf( af -> createTreeFactory() ); std::auto_ptr tree( tf -> create() ); std::auto_ptr dpsf( af -> createDataPointSetFactory( *tree ) ); std::auto_ptr treeXML1( tf -> create(gamma_lowE_Ge.xml, xml, true, false); std::auto_ptr treeXML2( tf -> create(NIST_attenuationGamma_Ge.xml, true, false); AIDA::IDataPointSet & dps1 = * ( dynamic_cast ( treeXML1 -> find(Gamma attenuation coefficient test ) ) ); AIDA::IDataPointSet & dps2 = * ( dynamic_cast ( treeXML2 -> find( Gamma attenuation coefficient test ) ) ); std::cout << title << dps1.title() << size= << dps1.size() << dimension= << dps1.dimension() StatisticsComparator comparator; ComparisonResult result = comparator.compare( dps1, dps2 ); std::cout << distance= << result.distance() << ndf= << result.ndf() << p-value= << result.quality();

Alberto Ribon ConclusionsConclusions The Statistical Toolkit provides already some important functionalities; It is already used in test50 for Geant4 physics tests and regression; it is simple to install and to use; we are working on the documentation; other statistical tests are under development; various long-term extensions are foreseen …