LHCC Comprehensive Review of LCG - 25 Nov 2003 Experiment Integration and Validation David R. Quarrie CERN/LBNL

Slides:



Advertisements
Similar presentations
Physicist Interfaces Project an overview Physicist Interfaces Project an overview Jakub T. Moscicki CERN June 2003.
Advertisements

Athena/POOL integration
D. Düllmann - IT/DB LCG - POOL Project1 POOL Release Plan for 2003 Dirk Düllmann LCG Application Area Meeting, 5 th March 2003.
Simulation Project Major achievements (past 6 months 2007)
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
Simulation Project Organization update & review of recommendations Gabriele Cosmo, CERN/PH-SFT Application Area Internal.
SEAL V1 Status 12 February 2003 P. Mato / CERN Shared Environment for Applications at LHC.
LCG Milestones for Deployment, Fabric, & Grid Technology Ian Bird LCG Deployment Area Manager PEB 3-Dec-2002.
Introduction to RUP Spring Sharif Univ. of Tech.2 Outlines What is RUP? RUP Phases –Inception –Elaboration –Construction –Transition.
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
Nightly Releases and Testing Alexander Undrus Atlas SW week, May
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
A. Aimar - EP/SFT LCG - Software Process & Infrastructure1 Software Process panel SPI GRIDPP 7 th Collaboration Meeting 30 June – 2 July 2003 A.Aimar -
EGEE is a project funded by the European Union under contract IST Testing processes Leanne Guy Testing activity manager JRA1 All hands meeting,
Nick Brook Current status Future Collaboration Plans Future UK plans.
David Adams ATLAS ATLAS Distributed Analysis Plans David Adams BNL December 2, 2003 ATLAS software workshop CERN.
N ATIONAL E NERGY R ESEARCH S CIENTIFIC C OMPUTING C ENTER Charles Leggett The Athena Control Framework in Production, New Developments and Lessons Learned.
ATLAS and GridPP GridPP Collaboration Meeting, Edinburgh, 5 th November 2001 RWL Jones, Lancaster University.
LCG Applications Area – Overview, Planning, Resources Torre Wenaus, BNL/CERN LCG Applications Area Manager LHCC Comprehensive Review.
SEAL: Core Libraries and Services Project CERN/IT After-C5 Meeting 6 June 2003 P. Mato / CERN.
MINER A Software The Goals Software being developed have to be portable maintainable over the expected lifetime of the experiment extensible accessible.
ROOT Application Area Internal Review September 2006.
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
Outline: LHCb and LCG-AA Ph.Charpentier B 00 l e.
GLite – An Outsider’s View Stephen Burke RAL. January 31 st 2005gLite overview Introduction A personal view of the current situation –Asked to be provocative!
20/09/2006LCG AA 2006 Review1 Committee feedback to SPI.
Feedback from the POOL Project User Feedback from the POOL Project Dirk Düllmann, LCG-POOL LCG Application Area Internal Review October 2003.
SEAL Core Libraries and Services CLHEP Workshop 28 January 2003 P. Mato / CERN Shared Environment for Applications at LHC.
SEAL Project Core Libraries and Services 18 December 2002 P. Mato / CERN Shared Environment for Applications at LHC.
GDB Meeting - 10 June 2003 ATLAS Offline Software David R. Quarrie Lawrence Berkeley National Laboratory
- Early Adopters (09mar00) May 2000 Prototype Framework Early Adopters Craig E. Tull HCG/NERSC/LBNL ATLAS Arch CERN March 9, 2000.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
15 December 2015M. Lamanna “The ARDA project”1 The ARDA Project (meeting with the LCG referees) Massimo Lamanna CERN.
David Quarrie: ATLAS LCG Apps Area Feedback LCG Applications Area Internal Review – 30 March - 1 April ATLAS LCG Applications Area Feedback David.
Feedback from LHC Experiments on using CLHEP Lorenzo Moneta CLHEP workshop 28 January 2003.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
G.Govi CERN/IT-DB 1 September 26, 2003 POOL Integration, Testing and Release Procedure Integration  Packages structure  External dependencies  Configuration.
1 SICBDST and Brunel Migration status and plans. 2 Migration Step 1: SICBMC/SICBDST split  Last LHCb week: Split done but not tested  Software week.
23/2/2000Status of GAUDI 1 P. Mato / CERN Computing meeting, LHCb Week 23 February 2000.
A. Aimar - EP/SFT LCG - Software Process & Infrastructure1 SPI Software Process & Infrastructure for LCG Project Overview LCG Application Area Internal.
K. Harrison CERN, 3rd March 2004 GANGA CONTRIBUTIONS TO ADA RELEASE IN MAY - Outline of Ganga project - Python support for AJDL - LCG analysis service.
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland t DBCF GT Upcoming Features and Roadmap Ricardo Rocha ( on behalf of the.
LCG – AA review 1 Simulation LCG/AA review Sept 2006.
Valeri Fine Athena/POOL integration.
12 March, 2002 LCG Applications Area - Introduction slide 1 LCG Applications Session LCG Launch Workshop March 12, 2002 John Harvey, CERN LHCb Computing.
D. Duellmann - IT/DB LCG - POOL Project1 The LCG Dictionary and POOL Dirk Duellmann.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
1 Comments to SPI. 2 General remarks Impressed by progress since last review Widespread adoption by experiments and projects Savannah, ExtSoft Build system.
Summary of persistence discussions with LHCb and LCG/IT POOL team David Malon Argonne National Laboratory Joint ATLAS, LHCb, LCG/IT meeting.
LCG Applications Area Internal Review Response (preliminary and brief version) (main points are on last slide) Torre Wenaus, BNL/CERN LCG Applications.
ATLAS Distributed Analysis Dietrich Liko IT/GD. Overview  Some problems trying to analyze Rome data on the grid Basics Metadata Data  Activities AMI.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
LHCC Referees Meeting – 28 June LCG-2 Data Management Planning Ian Bird LHCC Referees Meeting 28 th June 2004.
Follow-up to SFT Review (2009/2010) Priorities and Organization for 2011 and 2012.
Project Work Plan SEAL: Core Libraries and Services 7 January 2003 P. Mato / CERN Shared Environment for Applications at LHC.
D. Duellmann, IT-DB POOL Status1 POOL Persistency Framework - Status after a first year of development Dirk Düllmann, IT-DB.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
Development Project Management Jim Kowalkowski. Outline Planning and managing software development – Definitions – Organizing schedule and work (overall.
Comments on SPI. General remarks Essentially all goals set out in the RTAG report have been achieved. However, the roles defined (Section 9) have not.
POOL Based CMS Framework Bill Tanenbaum US-CMS/Fermilab 04/June/2003.
POOL Historical Notes POOL has been the most advanced and the most used AA project. Currently, excellent teamwork with experiments on new features and.
SPI Report for the LHCC Comprehensive Review Stefan Roiser for the SPI project.
(on behalf of the POOL team)
SPI external software build tool and distribution mechanism
Dirk Düllmann CERN Openlab storage workshop 17th March 2003
Simulation Framework Subproject cern
Simulation and Physics
SEAL Project Core Libraries and Services
Presentation transcript:

LHCC Comprehensive Review of LCG - 25 Nov 2003 Experiment Integration and Validation David R. Quarrie CERN/LBNL

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov Summarize feedback from all experiments on all products Main emphasis will be on POOL integration because that’s the first one to be deployed by experiments Incorporates significant SEAL components as well But I’ll try to cover other products/projects Brief overview of status of validation Some of the lessons learned and responses to those Many thanks to people who provided input for this talk Introduction

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov POOL/SEAL Components (LHCb view) Technology dependent.h Dictionary Generation CINT dictionary I/O Data I/O LCG dictionary code.xml Code Generator LCG dictionary Gateway Reflection … …

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov CMS and ATLAS share model parsing C++.h files using gcc_xml and generating dictionary filling code XML configuration file allows overrides (e.g. transient data members) LHCb use XML files as primary description and generate both C++.h file and dictionary filling code from there In both approaches the SEAL dictionary plays a central role Integration tested two major areas The filling of the SEAL dictionary (and the dictionary itself) Coupling to ROOT I/O through gateway POOL Integration Approaches

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov Fruitful collaboration with POOL team since inception 2.6 FTE direct participation Efficient communication Savannah Portal Direct mail and phone exchange among developers In person meetings when required Continuous and prompt feedback CMS typically provides feedback on any new pre-release in few hours POOL typically responds to bug reports in 24/48 hours  Only a few took more than a week to be fixed in a new pre-release CMS & POOL

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov First POOL production release on 30 June 2003 In reality just an honest prototype with many bugs, missing features and major performance problems Demonstrated that internal unit and integration testing had poor coverage and inadequate complexity COBRA release base on POOL available in early August Still not production ready (unexplained errors and crashes) CMS put many weeks of effort into debugging, in close collaboration with POOL team CMS Integration Experience

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov First public release of CMS products (COBRA etc.) based on POOL 1.3.x available on 30 Sep 2003 Used in production, deployed to physicists e.g. 2 Million events produced with OSCAR (G4 simulation) in a weekend New tutorials (just started) based on software released against POOL Essentially same functionality as Objectivity-base code, but: No concurrent update of databases  No direct connection to central database while running Remote access limited to RFIO or dCache No schema evolution Still a few bugs, missing features and performance problems Affect more complex use-cases Impact the deployment to a large developer/user community CMS Current Status

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov SEAL looks like a collection of quite heterogeneous and independent products SPI distribution hides the complexity of the project well Difficult for individuals willing to test or integrate a single component Dependencies on ROOT & CLHEP makes integration difficult Apart from what was ported from Iguana, CMS does not make any direct use of SEAL at the moment No plan to use the dictionary outside pool No plan to use high level framework infrastructure (e.g. whiteboard) Minuit investigations begun Concerns about impact of different experiment priorities on the project CMS Comments on SEAL

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov POOL schema frozen for next 18 months Follow a minimalist approach to avoid further confrontations with bugs, missing features, performance problems New projects (such as Conditions DB) will build upon LCG/AAA software and make larger use of it Bottom line is that POOL is no longer on critical path towards CMS Data Challenge in 2004 CMS Plans

David R. Quarrie: Software Status and Plans Infrastructure and Summary - 7 Mar Gaudi/Athena & LCG Components ConverterAlgorithmEvent Data Service Persistency Service Data Files Algorithm Transient Event Store Detec. Data Service Persistency Service Data Files Transient Detector Store Message Service JobOptions Service Particle Prop. Service Other Services Histogram Service Persistency Service Data Files Transient Histogram Store Application Manager Converter LCG Coupling Event Selector

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov Adiabatic adaptation of Gaudi to SEAL/POOL Slow integration according to available manpower (1 year for full migration) Take advantage of face-lifting of “bad” interfaces and implementations Minimal changes to interfaces visible to physicists Integration of SEAL in steps Dictionary integration and plugin manager Use of SEAL services later Integration of POOL earlier than SEAL Working prototype by the end of the year Necessity to read “old” ROOT data Keep the LHCb event model unchanged LHCb Strategy

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov Mechanics for persistency of event data implemented New Gaudi plugin created New Gaudi conversion service  One converter class for all object types New EventSelector service to access implicit POOL collections SEAL dictionary generator recently completed MC Truth relationships recently completed LHCb Status

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov Phased integration of POOL/SEAL POOL first Then client access to SEAL dictionary Then other SEAL components Design of POOL integration started before end of 2002 Actual integration started in spring 2003 ATLAS POOL/SEAL Strategy

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov Integration took much longer than originally expected The fact that ATLAS developer working on this had not been a part of the core POOL team (and was remote) contributed Lots of nuisance technical obstacles Conflicts in how cmt/scram/ATLAS/SPI handle build environments, compiler/linker settings, external packages & versions... Conflicts between Gaudi/Athena dynamic loading infrastructure and SEAL plugin management Conflicts in lifetime management with Athena/POOL transient caches Figuring out “gotchas” like classes with private constructors Obscure error messages etc. ATLAS POOL/SEAL Integration Feedback

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov ATLAS SEAL/POOL Status Required functionality is essentially in place Main outstanding problem is that of CLHEP matrix class ATLAS has not yet tested EDG RLS-based file catalog POOL explicit and implicit collections are available via Athena, but fuller integration will require extensions from POOL (and refactoring on ATLAS side) We’re moving from an expert to general developer environment The event data model is being filled out Goal is to have it essentially complete by end of the year Although parts of transient model are still being redesigned Python scripting support is being incorporated now PyROOT, PyLCGDict, GaudiPython, etc.

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov PI Feedback Re-implementing of Gaudi/Athena histogram service based on AIDA/ROOT implementation (needed by both LHCb & ATLAS) is complete Available in next Gaudi release ATLAS does not presently intend to take advantage of enlarged AIDA API Wait until more experience gained from physics analysis CMS has ported to PI all code that was previously based on ANAPHE A few missing features were identified but have been rectified by PI team

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov Simulation Feedback; Geant4 LHCb, CMS and ATLAS have long history of active involvement with Geant4 Extensive validation studies Physics and memory/cpu performance have reached point where it’s deemed to be ready for production All are now actively deploying (CMS) or in the final preparations for deployment (LHCb, ATLAS) for their Data Challenges in 2004 Also ATLAS use for combined testbeam Apr-Oct 2004

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov Simulation Feedback: Generators CMS has already produced generator-level events for their DC- 04 Data Challenge CMS is in process of integration tests of GENSER First simulation is a candidate as first user LHCb believe that the generators should be treated as external (cf Boost, Xerces) and should not be copied into a CVS repository as GENSER which couples them together ATLAS is in the process of validating the GENSER generator distribution Migration to use some generators from distribution underway now

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov SPI Feedback Primary focus is the LCG projects themselves But very useful interactions support for the experiments Savannah portal is widely used Primarily for bug tracking Response for upgrades has been tempered by problems with OpenSource environment, but these have apparently been addressed now ATLAS, CMS and LHCb are migrating to the SPI External installations Useful scripts and procedures are available from QA/testing Although “some assembly required” because of different package structures Generally very good interactions with SPI

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov Build Tool Feedback The build/configuration tool issue has been painful scram/cmt/appwork I suspect that no-one is really happy with the present situation This might be an inevitable conclusion given the disparate systems already in use by the experiments LHCb and ATLAS are very supportive of the decision to add support for CMT to the products

LCG and ALICE By the time LCG started ALICE had already a full system in place, including a distributed computing grid solution –Of course still far from the final system! ALICE is not depending on any of the LCG AA projects –POOL functionality is provided by a combination of AliEn file catalogue and native ROOT –SEAL and PI functionalities is provided directly by ROOT ALICE is collaborating intimately with the ROOT and FLUKA team ALICE is very worried by existing unnecessary duplications and supports strongly the pledge to reconsider the user-provider relation with ROOT and “converge” expressed by the internal review

LCG and ALICE ALICE develops generic technologies of interest to LCG –The Virtual MonteCarlo has been declared of interest by the Simulation project Unfortunately no manpower has been found to be assigned to it and ALICE is continuing its development alone –The geometrical modeller as a montecarlo-independent complement to the virtual montecarlo –The PROOF system, developed together with the ROOT team for interactive parallel and distributed analysis Demonstrated together with AliEn at Supercomputing 2003 To be used in production for the ALICE Data Challenge 1H04 –The AliEn is a complete but open and extensible Grid solution based on Web Services model and standard protocols

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov Validations Most major validation activity is still to come However, 6 of the 14 milestones have already been met DescriptionDateStatus CMS POOL integration: POOL persistency of CMS event2003/7/31Done v=0 CMS POOL acceptance for PCP2003/7/31Done v=0 CMS SEAL integration supporting POOL usage2003/7/31Done v=0 ATLAS POOL integration: POOL persistency in Release 72003/9/10Done v=1 ATLAS SEAL integration supporting POOL usage2003/9/10Done v=1 CMS POOL validation with PCP data2003/10/31Done v=-10 ATLAS int: ROOT implementation of AIDA histograms in Athena 2003/11/30 LHCb POOL integration: Gaudi persistency replaced by POOL 2003/12/19 LHCb integration: SEAL plugin manager integrated in Gaudi 2003/12/19 ATLAS integration: SEAL integration into Athena2003/12/31 ATLAS POOL validation with DC1 data2004/1/19 ATLAS validation of POOL metadata/event collections2004/1/31 ATLAS POOL validation with complete Event Data Model2004/3/31 ATLAS int: Generic simulation framework validated by ATLAS 2004/4/30

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov Lessons Learned and Responses (1) Effort needed for integration generally under estimated Original development model was for frequent releases and rapid feedback Original integration model was that developers from the experiment working on a product would integrate it as well However, this model has been found to be flawed  Those developers still have ongoing deliverables  Not every experiment has people working on all aspects of products Response has been to assign integrator/liaison where appropriate  E.g. POOL/ATLAS - is beginning to work well Different priorities and timescales have driven schedules for integration as well as manpower limitations

David R. Quarrie: Experiment Integration and Validation LHCC Comprehensive Review of LCG - 25 Nov Lessons Learned and Responses (2) Configuration management is hard Even within a project Also because of cascade of version dependencies across products  e.g. consistent version of CLHEP/ROOT across all products There is still an advantage to being resident in the same site for development and integration Initially the fact that LHCb and ATLAS already were collaborating on a variety of tools was not given any weight by LCG This is being (somewhat belatedly) addressed with e.g. CMT support and offer of help with SEAL integration