Jean-Roch Vlimant, CERN Physics Performance and Dataset Project Physics Data & MC Validation Group McM : The Evolution of PREP. The CMS tool for Monte-Carlo.

Slides:



Advertisements
Similar presentations
Network II.5 simulator ..
Advertisements

Configuration management
Database System Concepts and Architecture
MS CRM Integration WhosOn Service Integration Presentation MS CRM User Group.
T-FLEX DOCs PLM, Document and Workflow Management.
Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Core Application Software Activities Ian Fisk US-CMS Physics Meeting April 20, 2001.
Usage of the Python Programming Language in the CMS Experiment Rick Wilkinson (Caltech), Benedikt Hegner (CERN) On behalf of CMS Offline & Computing 1.
Software Configuration Management (SCM)
Automated Tests in NICOS Nightly Control System Alexander Undrus Brookhaven National Laboratory, Upton, NY Software testing is a difficult, time-consuming.
Configuration Management
A tool to enable CMS Distributed Analysis
CASE Tools And Their Effect On Software Quality Peter Geddis – pxg07u.
Data Quality Monitoring of the CMS Tracker
A Scalable Application Architecture for composing News Portals on the Internet Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta Famagusta.
Ian Fisk and Maria Girone Improvements in the CMS Computing System from Run2 CHEP 2015 Ian Fisk and Maria Girone For CMS Collaboration.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
Tutorial 10 Adding Spry Elements and Database Functionality Dreamweaver CS3 Tutorial 101.
DOE Review Jean-Roch Vlimant
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Introduzione al Software di CMS N. Amapane. Nicola AmapaneTorino, Aprile Outline CMS Software projects The framework: overview Finding more.
Components of Database Management System
Chapter © 2006 The McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/ Irwin Chapter 7 IT INFRASTRUCTURES Business-Driven Technologies 7.
ATLAS and GridPP GridPP Collaboration Meeting, Edinburgh, 5 th November 2001 RWL Jones, Lancaster University.
Bookkeeping Tutorial. Bookkeeping & Monitoring Tutorial2 Bookkeeping content  Contains records of all “jobs” and all “files” that are created by production.
5 May 98 1 Jürgen Knobloch Computing Planning for ATLAS ATLAS Software Week 5 May 1998 Jürgen Knobloch Slides also on:
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
4/5/2007Data handling and transfer in the LHCb experiment1 Data handling and transfer in the LHCb experiment RT NPSS Real Time 2007 FNAL - 4 th May 2007.
MINER A Software The Goals Software being developed have to be portable maintainable over the expected lifetime of the experiment extensible accessible.
David Adams ATLAS ADA, ARDA and PPDG David Adams BNL June 28, 2004 PPDG Collaboration Meeting Williams Bay, Wisconsin.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
A Technical Validation Module for the offline Auger-Lecce, 17 September 2009  Design  The SValidStore Module  Example  Scripting  Status.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Overview of STEP09 monitoring issues Julia Andreeva, IT/GS STEP09 Postmortem.
STAR Event data storage and management in STAR V. Perevoztchikov Brookhaven National Laboratory,USA.
EPICS Release 3.15 Bob Dalesio May 19, Features for 3.15 Support for large arrays - done for rsrv in 3.14 Channel access priorities - planned to.
Connecting with Computer Science2 Objectives Learn how software engineering is used to create applications Learn some of the different software engineering.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
Monte-Carlo Event Database: current status Sergey Belov, JINR, Dubna.
CERN IT Department t LHCb Software Distribution Roberto Santinelli CERN IT/GS.
Integration of the ATLAS Tag Database with Data Management and Analysis Components Caitriana Nicholson University of Glasgow 3 rd September 2007 CHEP,
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
UTA MC Production Farm & Grid Computing Activities Jae Yu UT Arlington DØRACE Workshop Feb. 12, 2002 UTA DØMC Farm MCFARM Job control and packaging software.
Bookkeeping Tutorial. 2 Bookkeeping content  Contains records of all “jobs” and all “files” that are produced by production jobs  Job:  In fact technically.
Online Monitoring System at KLOE Alessandra Doria INFN - Napoli for the KLOE collaboration CHEP 2000 Padova, 7-11 February 2000 NAPOLI.
ANALYSIS PHASE OF BUSINESS SYSTEM DEVELOPMENT METHODOLOGY.
ATLAS-specific functionality in Ganga - Requirements for distributed analysis - ATLAS considerations - DIAL submission from Ganga - Graphical interfaces.
The “Comparator” Atlfast vs. Full Reco Automated Comparison Chris Collins-Tooth 19 th February 2006.
Pavel Nevski DDM Workshop BNL, September 27, 2006 JOB DEFINITION as a part of Production.
Data aggregation and tools to monitor the status of Monte Carlo samples production in CMS Author: Ramūnas Peleckas University supervisor: Algimantas Juozapavičius.
Kati Lassila-Perini EGEE User Support Workshop Outline: – CMS collaboration – User Support clients – User Support task definition – passive support:
Finding Data in ATLAS. May 22, 2009Jack Cranshaw (ANL)2 Starting Point Questions What is the latest reprocessing of cosmics? Are there are any AOD produced.
JRA1 Meeting – 09/02/ Software Configuration Management and Integration EGEE is proposed as a project funded by the European Union under contract.
Monitoring the Readiness and Utilization of the Distributed CMS Computing Facilities XVIII International Conference on Computing in High Energy and Nuclear.
VI/ CERN Dec 4 CMS Software Architecture vs Hybrid Store Vincenzo Innocente CMS Week CERN, Dec
The Database Project a starting work by Arnauld Albert, Cristiano Bozza.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES The Common Solutions Strategy of the Experiment Support group.
Joe Foster 1 Two questions about datasets: –How do you find datasets with the processes, cuts, conditions you need for your analysis? –How do.
Barthélémy von Haller CERN PH/AID For the ALICE Collaboration The ALICE data quality monitoring system.
Overview of the Belle II computing
PLM, Document and Workflow Management
Software for Spectrometer T0 jumps correction
Chapter 11: Software Configuration Management
Physical Data Model – step-by-step instructions and template
CIS16 Application Development – Programming with Visual Basic
 YongPyong-High Jan We appreciate that you give an opportunity to have this talk. Our Belle II computing group would like to report on.
Chapter 11: Software Configuration Management
Serpil TOK, Zeki BAYRAM. Eastern MediterraneanUniversity Famagusta
Simulation Framework Subproject cern
Presentation transcript:

Jean-Roch Vlimant, CERN Physics Performance and Dataset Project Physics Data & MC Validation Group McM : The Evolution of PREP. The CMS tool for Monte-Carlo Request Management. Chains of Requests User can visualize and follow the successive requests required for the completion of the production of a given Monte-Carlo sample. Steps in the Production of a Monte-Carlo Sample The CMS software framework is organized such that the production of a dataset for analysis has to be done in successive steps, with I/O from disk. Arrangement of the steps is dictated by : software specifics, storage and software requirements, flexibility for further reprocessing, etc. In this document, a request is the logical container of the information required to perform one of such step. McM allows for the formation of any combination of steps. GEN Event generation SIM Detector simulation DIGI Emulation of electronics & High Level Trigger RECO Event reconstruction ANALYSIS Cycles of Production Requests prepared in McM are sent for production to the computing system, monitored during processing, and the output is fed back to McM for completion of the production or preparation of the next request. McM Computin g Requests Manager Request Monitorin g Generato r Contact Evolution in Between Requests Requests are prepared by the contacts from various Physics groups. McM can produce validation histograms to check the physics content prior to full production. At each relevant stage the contact, then the generator convener has to sign-off on the content. The request is then processed through the desired chain of requests. prepare validate approve process chain process chain process Production Contacts Prepare requests based on the needs of Physics groups Generator Convener Check the content of the requests Production Operators Handle the operation at production sites McM Handles automated operations on requests Service Infrastructure and Technology McM has a very simple and common infrastructure, with schema oriented documents stored in a database, object manipulations are done on the server side in a python application, visualization and operation in JavaScript on the client side. CouchDB TM [ Python [ CherryPy [ JavaScript, Angularjs [ Validation of The Physics Content The configuration of all varieties of event generator is complex. The full production of a samples can take a long time. In order to not waste resource ans delay delivery, a validation job on a selected number of events, is driven by McM and upload a set of plots to the Data Quality Monitoring (DQM) server. Actions to be Taken on Samples Production operators set the required action on the root requests before creating the chained requests. Specific priority block and partial processing can be specified. Campaigns of Requests Requests for all intermediate steps are gathered in campaign by type, energy, release software, etc. For book-keeping and clarity. Modification to the baseline request configuration are allowed for special requests, and within a certain flow of requests between campaigns. Building the Request Configuration A request document holds a sequence of arguments to a configuration utility, bound to the software release used in production. It can be modified from the values defined in the campaign in specific cases. Chains of Campaigns Campaigns that are connected with “flows” form chains of campaigns. A “chained campaign” defines what are the necessary operations toward the finalization of the production of a sample for analysis. Flow between Campaigns Specific modifications to the baseline parameters of a campaign are allowed, once agreed and relevant for a relatively large number of sample production requests. The specification goes into the “flow” object, which links many campaigns to a single campaign. Documentation and Glossary A set of documentation twiki [ is written with instructions and glossary for users, contacts and operators. The documentation is a collaborative effort. The analysis of the LHC data at the CMS experiment requires the production of a large number of simulated events. In 2012, CMS has produced over 4 Billion simulated events in about 100 thousands of datasets. Over the past years a tool (PREP) has been developed for managing such a production of thousands of samples. A lot of experience working with this tool has been gained, and conclusions on its limitations have been drawn. For better interfacing with the CMS production infrastructure and data book-keeping system, new database technology (couchDB) has been adopted. More recent server infrastructure technology (cherrypy + javascript) has been set as the new platform for an evolution of PREP. The operational limitations encountered over the years of usage have been solved in the new system. The aggregation of the production information of samples has been much improved for a better traceability and prioritization of work.This contribution will cover the description of the functionalities of this major evolution of the software for managing samples of simulated events for CMS. Overview and Outlooks McM uses evolutive contemporary technology. It incorporates the experience gained from past years of operation of production tools at the interface of Analysis and Computing. Novel concepts (flow, chained requests, chained campaigns, actions) were introduced to simplify and automatize the production of a sample through the logical and technical steps of processing. McM enforces testing and validation of the single requests prior to production to improve manpower and computing efficiencies. Special and vanilla requests can be handled within targeted campaigns of production. McM serves as bookkeeping of sample production. It permits the aggregation of information on production from within or from other CMS services. Extensive documentation and tutorials are provided to the users and contacts. McM is the service for central production of samples for analysis in CMS. Jean-Roch Vlimant for the CMS Collaboration