September CHEP 2004 LCG Generator P. Bartalini (University of Florida) F. Ambroglini, S. Belov, J. Cuevas Maestro, L. Dudko, M. Kirsanov, S. Makarychev, F. Moortgat, H. Naves, A. Ribon, A. Sherstnev, I. Seluzhenkov, V. Uzhinsky, A. Vologdin
P.Bartalini – CHEP September LCG Simulation Project Organization Simulation Project G.Cosmo Framework V. Pokorski Geant4 J.Apostolakis FLUKA Integration A.Ferrari Physics Validation F.Gianotti/A.Ribon Shower Param Geant4 Project FLUKA Project Experiment Validation WP Generator Services P.Bartalini MC4LHC WP
P.Bartalini – CHEP September LCG Generator Florida (Coordination) CERN (Event Interfaces) LCG-Russia (Library & Data Base) LCG-Spain (Production) Collaboration with independent projects: LCG-UK (Validation, New MCs) Contact persons/Collaborators in MC Projects and LHC Experiments GOAL: to guarantee the generator support for LHC Started May 2003 Long Term Project Workshop on MC’s for the LHC (MC4LHC) CERN, July 7 - August WPA: GENERATOR SERVICES LIBRARY (GENSER) WPB: TUNING AND VALIDATION WPC: SHARED EVENT FILES: PRODUCTION & DATA BASE WPD: EVENT FORMATS AND EVENT INTERFACES GENSER ReviewGENSER Review CERN, March GENSER Review
P.Bartalini – CHEP September Between Two Different Worlds Small Theoretical groups Small Theoretical groups Huge fortran packages still in development Huge fortran packages still in development Need help with the new OO packages Need help with the new OO packages Need to share user support duties Need to share user support duties Large Experimental Collaborations Complex OO Simulation Frameworks Need easy access to many generators, responsive user support, bug fixes etc.
P.Bartalini – CHEP September WPA. The LCG Generator Library (GENSER)GENSER Mandate: Collaborate with MC authors to prepare LCG Compliant Code HERWIG (P.Richardson), PYTHIA (T.Sjöstrand), HIJING (X.N. Wang), ISAJET (F.E. Paige), LHAPDF (M.Whalley), ALPGEN (M.Mangano), COMPHEP (A.Sherstnev), EVTGENLHC (P. Robbe), GLAUBER (V.Uzhinsky), FROZEN (PHOTOS, PDFLIB), ORIGINAL CODE (Validation, Data Base, Production) Future: DPMJET, GRACE, MADGRAPH, NEXUS, PHOJET, SFM, TAUOLA, HERWIG++, PYTHIA 7, SHERPA, etc. Clients: Addressed to LHC experimentalists and theorists both at CERN and in external laboratories. (Other users welcome!) Tested by all the LHC experiments Quarterly Release Scheme Librarian ~ 1 FTE (MSU, ITEP, Troitsk, Protvino, CERN) ~ 1 FTE One person based at CERN another one rotating with ~ quarterly shifts GOAL: to replace the obsolete CERN Library ATLAS PRODUCTION RELIES ON GENSER !!!
P.Bartalini – CHEP September WPA. GENSER: Policy and StructureGENSER Releases decoupled from large library releases Quarterly official releases, quick bug fixes and special versions Most of the MC package versions released by the authors are installed Old versions have to be maintained as long as they are used Manage to provide all the requested LCG supported platforms CVS repository, AFS, Compact Tarball distribution, LCG Tools SCRAM release and building tool for librarian and end-users Binary distribution also provided Test/Validation software (provided by authors, end-users and librarians) Installed in the «Example» and «Test» modules Code development for WPA, WPB, WPC, WPD Documentation: Savannah Portal: AFS: /afs/cern.ch/sw/lcg/app/releases/GENSER
P.Bartalini – CHEP September WPA. MC Packages: Inside or Outside GENSER ? GENSER Three possibilities: Three possibilities: 1) To develop the MC package in GENSER: MCDB, EVTGENLHC EVTGENLHC derives from EVTGEN: EVTGENLHC derives from EVTGEN: Provided (Adapted) by LHCb Contact person P.Robbe Provided (Adapted) by LHCb Contact person P.Robbe LHCb has full access to the package in the GENSER repository LHCb has full access to the package in the GENSER repository Other contributors have a limited access (development of new decay models, decay files etc.) Other contributors have a limited access (development of new decay models, decay files etc.) 2) To fully export the MC generator code in GENSER defining the corresponding sub-package: MOST OF THE INSTALLED MCs 3) To install the MC generator as external software packages in the LCG environment and to store in GENSER just tests suites and other related code COMPHEP, EVTGEN Just a technical issue! For each MC package an ad-hoc solution is found taking into account the user requirements
P.Bartalini – CHEP September WPA. GENSER Repository Structure GENSER GENSER_0_2_0 | |--- config/ |--- doc/ |--- include/ |--- logs/ |--- release.notes |--- rh73_gcc323/lib/ |--- release.notes |--- rh73_gcc323/lib/ bin/ bin/ tests/ tests/ | ---include/ | ---include/ |--- src/herwig/data/ doc/ doc/ dummy/ dummy/ examples/ examples/ includes/ includes/ src/ src/ MC Package version provided as CVS Tag
P.Bartalini – CHEP September GENSER_0_1_0: Mid Apr (rh73_gcc32) GENSER_0_2_0: Mid Sep (+ rh73_gcc323) GENSER_1_0_0: Mid Dec (+ Scientific Linux) PYTHIA: 6.205, 6.217, 6.220, 6.221, 6.222, 6.223, 6.224, 6.304, … HERWIG: 6.500, 6.503, 6.504, 6.504b1, 6.505, … JIMMY: 4.1, … ISAJET: 7.67, 7.69, … EvtGen: alpha EvtGenLHC: 1.0, … AlpGen: 1.3.2, … CompHEP: 4.4.0, … LHAPDF: 1.1, 2.0, … PDFLIB: 804 PHOTOS: 202, 207 Glauber Xs (Heavy Ions): 1.0, … HIJING (Heavy Ions): 1.36, 1.37, 1.383, 1.383b, … Packages in the waiting list, with particular emphasis on SHERPA GENSER_0_2_0: 11 Generators 2 PDF Packages 30 Versions WPA.
P.Bartalini – CHEP September WPB. Monte Carlo Validation and Tuning GOAL: to cross-check MCs and compare with data Basic Sanity Checks Reference distributions (multiplicities, P T Spectra etc.) Standalone Studies Work on GENSER subpackages (Librarian, Beta Testers in Experiments) ALPGEN Validation (Perugia) Essential to verify ALPGEN in GENSER HIJING Validation (JINR, Dubna) Encouraged to contribute to JetWeb Validation Framework JetWeb: in production LCG-UK (U.C.London) [Comp. Phys. Comm. vol 153/ (2003)]Comp. Phys. Comm. vol 153/ (2003) Database of Data, MC & Comparisons Web interface, Job submission Will switch soon to GENSER ?
P.Bartalini – CHEP September Production framework June 2004 Proposal June 2004 Design Available. Prototype Stage Active institutions: CERN, Santander, Oviedo Production centre Configuration & Book-keeping ~ 1 FTE from LCG Russia (MSU, ITEP, JINR) MCDB: version Deployed! WPC: Production of Shared Event Files GOAL: to produce certified generator level events Will Be Officially Adopted By CMS Tested At Fermilab Tested By CMS Use them for benchmarks, comparisons & combinations in LHC W.G. Improve the quality of shared samples with respect to LEP W.G. era !!!
P.Bartalini – CHEP September WPC. Monte Carlo Data Base (MCDB) Motivations Motivations To Provide Configuration, Book-keeping, Documentation, Storage for the Shared Event Files To Provide Configuration, Book-keeping, Documentation, Storage for the Shared Event Files To keep track of the full generation chain, Exploiting the Competences of Monte Carlo Experts and Monte Carlo Authors To keep track of the full generation chain, Exploiting the Competences of Monte Carlo Experts and Monte Carlo Authors CMS MCDB CMS MCDB Only parton level files; AFS storage; No Searchable; No SQL Only parton level files; AFS storage; No Searchable; No SQL LCG MCDB [hep-ph/ ] LCG MCDB [hep-ph/ ]hep-ph/ Same authors + Additional human resources and technical support Same authors + Additional human resources and technical support Core software supported by LCG Software Project Infrastructure Core software supported by LCG Software Project Infrastructure MySQL; POOL; CASTOR (RFIO); CGI; Perl; Apache MySQL; POOL; CASTOR (RFIO); CGI; Perl; Apache Web Interface, Dedicated Web Server Web Interface, Dedicated Web Server version supports only Mozilla Browser version supports only Mozilla Browser
P.Bartalini – CHEP September WPC. MCDB: SQL Structure and Access Policy End-User Read articles, comments, news Read articles, comments, news Download event files Download event filesAuthor Post articles Post articles Upload event files Upload event files Post news and answer the comments Post news and answer the commentsModerator Manage users profiles Manage users profiles Moderate articles and comments Moderate articles and commentsAdministrator Control the system Control the system Technical database management Technical database management Application Package Interface Automated articles posting/retrieving and samples upload/download Automated articles posting/retrieving and samples upload/download SQL Table Structure
P.Bartalini – CHEP September WPC. MCDB Web Interface Screen-shot Only Mozilla Browser Supported (for the time being)
P.Bartalini – CHEP September WPD. Event Formats and Event Interfaces The modularisation The modularisation Basic idea in ThePEG, PYTHIA 7, HERWIG++, SHERPA Basic idea in ThePEG, PYTHIA 7, HERWIG++, SHERPA From April 2004 LCG Generator participates to the development of ThePEG From April 2004 LCG Generator participates to the development of ThePEG We started with improving the doxygen documentation (A.Ribon) Activity coordinated with Phenogrid (N.Glover et al.) Relevant for the Herwig++ development The MC truth Interface The MC truth Interface HEPML proposal (XML - Les Houches Agreement I compliant) HEPML proposal (XML - Les Houches Agreement I compliant) Meta-data format facilitating automated documentation HEPMC HEPMC Under the responsibility of CLHEP (still some problems with translators) GOAL: standardize interfaces, support the new OO MCs
P.Bartalini – CHEP September WPD. HEPML Proposal All data in event file are divided in two parts All data in event file are divided in two parts Header - describe the general information Header - describe the general information Author, the file itself (creation date etc.), supported specifications, collider description etc. Author, the file itself (creation date etc.), supported specifications, collider description etc. Generator specific info, cuts, physical parameters etc. Generator specific info, cuts, physical parameters etc. Info for parser (format of event records etc.) Info for parser (format of event records etc.) Event Records - variable data of events written in some compact format to one string (particle momenta, color chains etc.) Event Records - variable data of events written in some compact format to one string (particle momenta, color chains etc.) Header is stored in a txt file with XML Syntax Header is stored in a txt file with XML Syntax Event Records are (zip) compressed and attached to the Header file Event Records are (zip) compressed and attached to the Header file (COMPHEP Collaboration) [S.Sherstnev]
P.Bartalini – CHEP September WPD. XML in HEP A.Kryukov Skobeltsyn Institute of Nuclear Physics On behalf of CompHEP Collaboration Presented to LCG-Generator Why XML ? Why XML ? Describe the document structure Describe the document structure Extensible language Extensible language WEB ready WEB ready Well defined open standard Well defined open standard Available tools to treat of XML Available tools to treat of XML
P.Bartalini – CHEP September LCG Generator Future Plans MILESTONES MILESTONES Q4/2004-WPA- GENSER: release version (ref. MC RTAG report) Q4/2004-WPA- GENSER: release version (ref. MC RTAG report)MC RTAG reportMC RTAG report Q1/2005-WPA- SHERPA (F.Krauss et al.) fully integrated in GENSER Q1/2005-WPA- SHERPA (F.Krauss et al.) fully integrated in GENSER Q2/2005-WPC- Generator level production framework: beta version Q2/2005-WPC- Generator level production framework: beta version Q3/2005-WPD- First test of ThePEG and EvtGenLHC integration in Herwig++ (and Possible insertion of Herwig++ in GENSER) Q3/2005-WPD- First test of ThePEG and EvtGenLHC integration in Herwig++ (and Possible insertion of Herwig++ in GENSER) Q3/2005-WPC- Production centre integrated in the grid-middleware Q3/2005-WPC- Production centre integrated in the grid-middleware Q4/2005-WPC- Generator level production framework: release version Q4/2005-WPC- Generator level production framework: release version Q1/2006-WPC- MCDB Integration with Generator level production framework, experiment specific APIs and management of large files Q1/2006-WPC- MCDB Integration with Generator level production framework, experiment specific APIs and management of large files WPB milestones to be verified with other independent projects WPB milestones to be verified with other independent projects Enphasis on the collaboration with Object Oriented MC projects
P.Bartalini – CHEP September WEB page Links to relevant documentation, CVS repository, release.notes etc. CDS Agenda HomeCDS Agenda Home > Projects > LHC Computing Grid > Physics GeneratorsProjectsLHC Computing GridPhysics Generators Minutes of meetings, slides of presentations Meetings: Last Thursday of the month at 5 PM in CERN-32-1-A24 & VRVS (Last and Next meetings anticipated to SEP 23 and OCT 21 respectively) Next meeting will be devoted to the Management of GENSER Releases Applications area mailing list: Organisational Issues
P.Bartalini – CHEP September Backup General
P.Bartalini – CHEP September Agenda of LCG Generator Review (March 2004) 13:45Introduction ( Bartalini, P.) 13:50LHAPDF v2 ( Whalley, M.) 14:20GENSER, the generator library ( Alexander, S.) 14:50GENSER validation ( Ambroglini, F.) 15:05User experience with GENSER ( Hinchliffe, I.) 15:25The LCG generator library (Discussion) ( all ) 16:25---Coffee--- 16:40A framework for generator level production&validation ( Bartalini, P.) 16:55The MCDB project ( Dudko, L.) 17:10MCDB Implementation ( Vologdin, A.) 17:25The HEPCODE project ( Stirling, W.J.) 17:35Event formats & Monte Carlo data base. (Discussion) ( all )
P.Bartalini – CHEP September WPC. Simple Generator Level Production Framework Progress on the simple generator level production framework were achieved during the quarter. Progress on the simple generator level production framework were achieved during the quarter. A Prototype is being developed by H.Naves and J.Cuevas (Santander). A Prototype is being developed by H.Naves and J.Cuevas (Santander). It reads generator cards, produces events, performs basic event selection, and writes out using the HEPMC structure in POOL and in ROOT formats. It reads generator cards, produces events, performs basic event selection, and writes out using the HEPMC structure in POOL and in ROOT formats.
P.Bartalini – CHEP September Backup MCDB
P.Bartalini – CHEP September MCDB Introduction Correct Monte Carlo simulation of complicated processes requires rather sophisticated expertise Different physics groups often are in need of the same MC samples Public availability of the event files speeds up their validation Central and public location where well-documented event files can be found would be very useful [S.Belov]
P.Bartalini – CHEP September CMS MCDB Advantages Based on web technologies Based on web technologies Stores detailed documentation for each set of event samples Stores detailed documentation for each set of event samples Storing data on AFS let make its easy of access Storing data on AFS let make its easy of access Provide rapid communication between users and experts Provide rapid communication between users and experts Problems Stores parton level events only Stores parton level events only The expected number of physics processes in the database is several hundred (data are stored on AFS so there is a space limitation) The expected number of physics processes in the database is several hundred (data are stored on AFS so there is a space limitation) Cannot serve sophisticated queries owing to it isn’t SQL based Cannot serve sophisticated queries owing to it isn’t SQL based [S.Belov]
P.Bartalini – CHEP September Main Features of LCG MCDB The most important reason to develop LCG MCDB is to expel the restrictions of CMS MCDB An SQL-based database An SQL-based database Wide search abilities Wide search abilities Possibility to keep the events at particle level as well as at partonic level Possibility to keep the events at particle level as well as at partonic level Direct programming interface from LCG collaboration software Direct programming interface from LCG collaboration software Inheritance of all the advantages of the predecessor - CMS MCDB Inheritance of all the advantages of the predecessor - CMS MCDB [S.Belov]
P.Bartalini – CHEP September MCDB: General Conceptions and Terms Event file (sample) – the file containing particle or partonic events Event file (sample) – the file containing particle or partonic events Article – a document describing the set of samples Article – a document describing the set of samples Event data – the events themselves, all the data are stored into event files Event data – the events themselves, all the data are stored into event files Event metadata – information which describes all events in a sample in general (beam description, physics parameters, applied cuts, etc.) Event metadata – information which describes all events in a sample in general (beam description, physics parameters, applied cuts, etc.) [S.Belov]
P.Bartalini – CHEP September MCDB Interfaces Search form Search form Article management (creating, changing, removing) Article management (creating, changing, removing) Downloading sample files Downloading sample files User management (registration, editing profiles, removing) User management (registration, editing profiles, removing) Editing metadata on physical models and parameters Editing metadata on physical models and parameters Moderating articles and user comments on them Moderating articles and user comments on them Application programming interface to allow program-based article creation Application programming interface to allow program-based article creation Comment the articles Comment the articles [S.Belov]
P.Bartalini – CHEP September Access Ways to LCG MCDB End-User Read articles, comments, news Read articles, comments, news Download event files Download event filesAuthor Post articles and upload event files Post articles and upload event files Post news and answer the comments Post news and answer the commentsModerator Manage users profiles Manage users profiles Moderate articles and comments Moderate articles and commentsAdministrator Control the system Control the system Technical database management Technical database management Application software Automatic articles posting and samples upload Automatic articles posting and samples upload [S.Belov]
P.Bartalini – CHEP September The Most Common Operations with MCDB Users in general 1) Browses articles via the tree view of physics categories 2) Searches by author, title, keywords, physical processes, parameters values, etc. 3) Downloads desired sample after agreement on MCDB License Articles’ authors 1) Uploading event files 2) Completing article with the information that wasn’t not included in the description of an event file [S.Belov]
P.Bartalini – CHEP September MCDB: Software Selection SQL DBMS: MySQL SQL DBMS: MySQL Script system: Perl Script system: Perl Web-server: Apache2 Web-server: Apache2 Web-interface: CGI+Perl Web-interface: CGI+Perl CASTOR: RFIO package CASTOR: RFIO package All the products are supported as standard software in LCG All the products are supported as standard software in LCG [A.Sherstnev]
P.Bartalini – CHEP September SQL tables in MCDB Authors Authors Articles Articles Categories Categories Generators and their parameters Generators and their parameters Models and parameters names Models and parameters names Model parameters’ values Model parameters’ values Processes, initial and final states Processes, initial and final states Applied cuts Applied cuts Event files, graph files Event files, graph files and a few other tables [S.Belov]
P.Bartalini – CHEP September MCDB: Current Status SQL tables structure: done SQL tables structure: done Internal Perl modules (access to the MCDB SQL tables, parse configuration file, many others): partly done Internal Perl modules (access to the MCDB SQL tables, parse configuration file, many others): partly done User area: User area: Web site: first prototype done (mcdb.cern.ch) Web site: first prototype done (mcdb.cern.ch) Web catalog script: done Web catalog script: done Author area Author area Web interface for creation of articles: done Web interface for creation of articles: done Uploading of files to HDD: done, to CASTOR: in progress Uploading of files to HDD: done, to CASTOR: in progress Web interface for editing of articles: in progress Web interface for editing of articles: in progress Administrative area: Administrative area: User management interface: done User management interface: done Article management interface: in progress Article management interface: in progress [A.Sherstnev]
P.Bartalini – CHEP September MCDB: Short Term Plans Documentation Documentation Internal programming interface with CASTOR Internal programming interface with CASTOR Web-interface for editing of articles Web-interface for editing of articles First real tests First real tests Web access control: internal login/password, GRID certificates Web access control: internal login/password, GRID certificates Internal access control: to provide author’s privacy Internal access control: to provide author’s privacy Web-server tuning Web-server tuning Providing of compatibility with different browsers (IE, Netscape, Opera) Providing of compatibility with different browsers (IE, Netscape, Opera) HEPML: developing of XML tag set HEPML: developing of XML tag set Main milestone: first working prototype in December Main milestone: first working prototype in December [A.Sherstnev]
P.Bartalini – CHEP September MCDB: Mid Term Plans Search engine in MCDB SQL database Search engine in MCDB SQL database Moderator interface (to manage articles and related objects, user comments, Web catolog) Moderator interface (to manage articles and related objects, user comments, Web catolog) Internal parser for HEPML Internal parser for HEPML Security system Security system Logging system Logging system System of collection of statistics System of collection of statistics MCDB Integration APIs MCDB Integration APIs [A.Sherstnev]
P.Bartalini – CHEP September Backup A.Kryukov Skobeltsyn Institute of Nuclear Physics On behalf of CompHEP Collaboration Presented to LCG-Generator XML in HEP
P.Bartalini – CHEP September Outlook Why XML ? Why XML ? Describe the document structure Describe the document structure Extensible language Extensible language WEB ready WEB ready Well defined open standard Well defined open standard Available tools to treat of XML Available tools to treat of XML [A.Kryukov]
P.Bartalini – CHEP September Outlook (continue) Experimental events and/or events simulation (illustration) Experimental events and/or events simulation (illustration) Detector models Detector models HEP models HEP models Partonic event structure Partonic event structure Hadronized events Hadronized events Digitized events Digitized events Conclusion Conclusion [A.Kryukov]
P.Bartalini – CHEP September Why XML ? XML describe the document structure. XML describe the document structure. Most information in HEP can be and must be presented as a tree structure or in more general case acyclic graph Most information in HEP can be and must be presented as a tree structure or in more general case acyclic graph XML is extensible language XML is extensible language XML has no fixed set of targets and attributes. User can introduce own targets and attributes. XML has no fixed set of targets and attributes. User can introduce own targets and attributes. [A.Kryukov]
P.Bartalini – CHEP September Why XML (continue) ? WEB ready WEB ready XML document can be distribute through Internet XML document can be distribute through Internet XML document can be displayed in different way depends on the user request (XSLT, XSD) XML document can be displayed in different way depends on the user request (XSLT, XSD) Most modern Web-browsers (Mozilla, IE) can display XML documents by default. Most modern Web-browsers (Mozilla, IE) can display XML documents by default. Well defined open standard Well defined open standard Recomended by W3C Recomended by W3C Supported by IBM, Sun, Microsoft, Linux/UN*X community Supported by IBM, Sun, Microsoft, Linux/UN*X community [A.Kryukov]
P.Bartalini – CHEP September Why XML (continue) ? Available tools to treat of XML Available tools to treat of XML SAX - Java SAX - Java Expat – C/C++ Expat – C/C++ Perl Perl Python Python Other … Other … SW developer can re-use these tools for applications SW developer can re-use these tools for applications [A.Kryukov]
P.Bartalini – CHEP September Experimental events and/or simulated events HEP models ME Gen (CompHEP) Hadronization (Pythia) Simulation (Geant) Detector or it models 4 Reconstruction (ORCA) 7 6 Phys. analysis Event DB 5’ [A.Kryukov]
P.Bartalini – CHEP September HEP models QED QED Alpha QED (1/137) Electron (See more details in A.Demichev, A.Kryukov, A.Rodionov, hep- ph/ See also [A.Kryukov]
P.Bartalini – CHEP September ME events: partonic level The LHC project The LHC project u, D -> n, e, E, b, B u, D -> n, e, E, b, B …( Below partonic events. NOT in XML format! )… [A.Kryukov]
P.Bartalini – CHEP September Simulated events: particle level (illustration) <!– etc. (Below Pythia-like events. NOT in XML format!) [A.Kryukov]
P.Bartalini – CHEP September Detector models (illustration) Compact Muon Solenoid <!– etc. [A.Kryukov]
P.Bartalini – CHEP September Next points … 5. Simulated/experimental events 6. Reconstructed events 7. Visualization, Web browsing 8. What else? [A.Kryukov]
P.Bartalini – CHEP September Conclusions (pro) XML is very useful for presentation of structured data XML is very useful for presentation of structured data Models, Events Models, Events Easy for realization on Java, C/C++, Perl, … Easy for realization on Java, C/C++, Perl, … Give a lot of possibilities for publication results through the WEB. Give a lot of possibilities for publication results through the WEB. Unify interfaces of application software Unify interfaces of application software XML DB id native format of data base for XML presented data. XML DB id native format of data base for XML presented data. [A.Kryukov]
P.Bartalini – CHEP September Conclusions (contra) Poor structured information Poor structured information Huge amount of information with strong defined and fixed structure Huge amount of information with strong defined and fixed structure ME generated events, Pythia events and others ME generated events, Pythia events and others [A.Kryukov]
P.Bartalini – CHEP September Main conclusion: XML has great advantage in HEP application Thank you for attention!