Work Package 9 – EO Applications

Slides:



Advertisements
Similar presentations
DataTAG WP4 Meeting CNAF Jan 14, 2003 Interfacing AliEn and EDG 1/13 Stefano Bagnasco, INFN Torino Interfacing AliEn to EDG Stefano Bagnasco, INFN Torino.
Advertisements

ESA Data Integration Application Open Grid Services for Earth Observation Luigi Fusco, Pedro Gonçalves.
Stephen Burke - WP8 Status - 9/5/2002 Partner Logo WP8 Status Stephen Burke, PPARC/RAL.
Andrew McNab - Manchester HEP - 17 September 2002 Putting Existing Farms on the Testbed Manchester DZero/Atlas and BaBar farms are available via the Testbed.
ATLAS/LHCb GANGA DEVELOPMENT Introduction Requirements Architecture and design Interfacing to the Grid Ganga prototyping A. Soroko (Oxford), K. Harrison.
4 th DataGRID Project Conference, Paris, 5 March 2002 Testbed Software Test Plan I. Mandjavidze on behalf of L. Bobelin – CS SI; F.Etienne, E. Fede – CPPM;
Réunion DataGrid France, Lyon, fév CMS test of EDG Testbed Production MC CMS Objectifs Résultats Conclusions et perspectives C. Charlot / LLR-École.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
DataGrid is a project funded by the European Union CHEP 2003 – March 2003 – Title – n° 1 Grid Data Management in Action Experience in Running and.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
EU 2nd Year Review – Jan – WP9 WP9 Earth Observation Applications Demonstration Pedro Goncalves :
LCG Milestones for Deployment, Fabric, & Grid Technology Ian Bird LCG Deployment Area Manager PEB 3-Dec-2002.
EDG Application The European DataGrid Project Team
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
WP9 – Earth Observation Applications – n° 1 Experiences with Testbed1, plans and objectives for Testbed 2 Testbed retreat th August 2002
KNMI Applications on Testbed 1 …and other activities.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Testing Basics of Testing Presented by: Vijay.C.G – Glister Tech.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
1 DIRAC – LHCb MC production system A.Tsaregorodtsev, CPPM, Marseille For the LHCb Data Management team CHEP, La Jolla 25 March 2003.
- Distributed Analysis (07may02 - USA Grid SW BNL) Distributed Processing Craig E. Tull HCG/NERSC/LBNL (US) ATLAS Grid Software.
WP9 – Earth Observation Applications – n° 1 WP9 report to Plenary ESA, KNMI, IPSL Presented by M. Petitdidier, IPSL DataGrid Plenary Session 5 th Project.
EU 2nd Year Review – Jan – WP9 WP9 Earth Observation Applications
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
First attempt for validating/testing Testbed 1 Globus and middleware services WP6 Meeting, December 2001 Flavia Donno, Marco Serra for IT and WPs.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
DataGRID WPMM, Geneve, 17th June 2002 Testbed Software Test Group work status for 1.2 release Andrea Formica on behalf of Test Group.
DataGRID PTB, Geneve, 10 April 2002 Testbed Software Test Plan Status Laurent Bobelin on behalf of Test Group.
DataGRID Testbed Enlargement EDG Retreat Chavannes, august 2002 Fabio HERNANDEZ
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
EDG Applications The European DataGrid Project Team
Andrew McNab - Manchester HEP - 17 September 2002 UK Testbed Deployment Aim of this talk is to the answer the questions: –“How much of the Testbed has.
Report from the WLCG Operations and Tools TEG Maria Girone / CERN & Jeff Templon / NIKHEF WLCG Workshop, 19 th May 2012.
INFSO-RI Enabling Grids for E-sciencE Charon Extension Layer. Modular environment for Grid jobs and applications management Jan.
EC Review – 01/03/2002 – WP9 – Earth Observation Applications – n° 1 WP9 Earth Observation Applications 1st Annual Review Report to the EU ESA, KNMI, IPSL,
ESA DataGrid Review – 10 June 2002 – n° 1 Summary item 3  ESA and WP9 part 1 (45m)  DataGrid Frascati infrastructure (AT, 7.5m)  ESA GOME application.
Data Management The European DataGrid Project Team
Testing the HEPCAL use cases J.J. Blaising, F. Harris, Andrea Sciabà GAG Meeting April,
Site Certification Process (Round Table) Fabio Hernandez IN2P3 Computing Center - Lyon October
15-Feb-02Steve Traylen, RAL WP6 Test Bed Report1 RAL/UK WP6 Test Bed Report Steve Traylen, WP6 PPGRID/RAL, UK
INFSO-RI Enabling Grids for E-sciencE gLite Certification and Deployment Process Markus Schulz, SA1, CERN EGEE 1 st EU Review 9-11/02/2005.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
Testing Overview Software Reliability Techniques Testing Concepts CEN 4010 Class 24 – 11/17.
CERN Certification & Testing LCG Certification & Testing Team (C&T Team) Marco Serra - CERN / INFN Zdenek Sekera - CERN.
DataGrid is a project funded by the European Commission under contract IST rd EU Review – 19-20/02/2004 WP9 Earth Observation applications.
Earth Observation inputs to ATF Annalisa Terracina EU-DataGrid Project Work Package 9 – EO Applications April 2003 CERN.
DataGrid France 12 Feb – WP9 – n° 1 WP9 Earth Observation Applications.
Chapter 25 – Configuration Management 1Chapter 25 Configuration management.
WP 9.4 Use Case (ESA-ESRIN, IPSL, KNMI)
The EDG Testbed Deployment Details
Software Engineering (CSI 321)
EGEE Middleware Activities Overview
U.S. ATLAS Grid Production Experience
EO Applications Parallel Session
Practical: The Information Systems
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
Introduction to Grid Technology
Testbed Software Test Plan Status
EDG Final Review Demonstration
Testing Activities on the CERT-TB Status report
CMS report from FNAL demo week Marco Verlato (INFN-Padova)
OGSA Data Architecture Scenarios
Stephen Burke, PPARC/RAL Jeff Templon, NIKHEF
CHAPTER 2 Testing Throughout the Software Life Cycle
Leigh Grundhoefer Indiana University
Wide Area Workload Management Work Package DATAGRID project
How To Integrate an Application on Grid
The EU DataGrid Fabric Management Services
Presentation transcript:

Work Package 9 – EO Applications Experiences with Testbed1, plans and objectives for Testbed 2 Testbed retreat 27-28th August 2002 Julian.Linford@esa.int EU-DataGrid Project Work Package 9 – EO Applications

Experience with Testbed v1.2 User Interface installed RH 6.2 & 7.2 Basic job submission tests (single short jobs using input sandbox and CERN UI) Data replication tests ongoing problems with GDMP_CONFIG_FILE Started installing 1.2 SE Intensive use of 1.2 Testbed planned starting September carry out processing of 1 month of GOME data (about 350 15Mb files 5.25Gb), data needed by IPSL for Validation tests upgrade CE to 1.2 + interface with AFS/LSF complete installation of 1.2 SE + interfaces with the ESA MSS system Work will concentrate on preparing several high-profile demonstrations implementation of GOME Data processing and validation Use Case EO WebMap Portal visualisation of global Ozone measurements on-demand product processing using EDG services

Experience with Testbed v1.1 1.1.4 Middleware Installed at ESRIN User Interface Network Monitoring Tools Computing Element (with PBS and LSF) Has been used to partially carry out the basic GOME use case: Transfer Level1 (raw) data to the Grid Storage Element Register Level1 data with the Replica Manager Submit jobs to process the Level1 data, produce Level2 data products Jobs running on the CEs locate Leve1 data by using the BrokerInfoAPI Repeat step 1-3 for level 2 products Transfer Level2 data products to the Storage Element Register Level2 data products with the Replica Manager Submit jobs to the Grid to validate Level2 data products Retrieve validation results and visualize at the User Interface

Commands we used Job execution Data management dg-job-list-match dg-job-submit dg-job-status dg-job-get-output Data management gdmp_register_localfile gdmp_publish_catalog gdmp_get_catalog gdmp_replicate_get

Results Application Environment installation not straightforward need to contact sites directly to verify / fix problems Perceived general instability of the testbed high incidence of unrecoverable errors, intermittent errors Job submission comands, execution cycle basically OK but need better support for handling multiple simultaneous jobs not easy for apps to work with CLI Data management commands not easy to use complex replication sequence unreliable, intermittent working difficult to diagnose cause of malfunctions

Meeting EO Requirements EO Requirements have been surveyed and matched against the testbed functionality (D9.6 Scaling Study activity) Analysis of 45 basic requirements (several were grouped together) 4 Satisfied 19 Partially satisfied 12 Expected in future releases 4 Planned in future releases 6 Need to be verified Overall basic functionality of job submission and data replication considered satisfied in TB1 Although the commands work, the testbed has not yet reached the required production quality 10 12 19 4 6

EO requirements priorities Reliability Improve job failure rate under high load conditions Increase robustness and fault tolerance Improve configuration / installation Remove / avoid single points of failure Documentation Needs constant revision to keep up with software changes Installation manual, user manual should be mandatory in delivered RPMs Verified and approved by quality control / testing measures

EO requirements priorities Usability New Data management s/w not yet very well understood Overlapping command sets GDMP Replica Manager Replica Catalog Interface to MSS Need for clear procedures / instructions Cryptic error messages Need better error recovery (core dumps are a show-stopper) Need built-in error handling and fault tolerance

EO requirements priorities Application interfaces low-level commands require application functional layer and need to be designed appropriately middleware command interfaces subject to change apps will need some minimum backward compatibility Site uniformity a job should produce the same result regardless of where it executes it should not require hard coded values for a specific site avoid end-users having to contact sites directly

EO requirements priorities Automatic job decomposition based on input dataset need use case / examples Brokerinfo use cases method to locate the replicated files locally on the CE SE storage (for data & scratch space) management query amount of space available advance reservation Create / destroy VOs + groups within VOs Integration of EO archives and catalogue systems

EO Use Case File Numbers 1 Year of Gome data Data Number of files to be stored and replicated Size Level 1 4,724 15 Mb Level 2 NNO (ESA) 9,448,000 10 kb Opera (KNMI) 12 kb Validation Lidar (IPSL) 12 2.5 Mb Total: 18,900,736 267 Gbyte Gome has a data set of 5 years Gome is relatively small (in both size and number of files)

EO requirements for Testbed2 SE storage management policies need for standard – even automatic - procedures for freeing space on the SE Capability to store user-defined application metadata in RC use metadata keys as alternative to to LFN to describe input data RB support for data pipelining Ability to specify input data which will be produced as a result of a previous processing step Retrieval of QoS measures for data access, storage and processing (lost data at RAL!) Per VO / user quotas on job submission, RC & SE usage

Quality Objectives Should be carefully selected to make a rapid impact on production Testbed stability and reliability Minimal disruption to the production testbed during upgrades, patches & site / service outages or re- configurations Reduce priority of new functionality until existing infrastructure is stable and proven priority to bug fixes and basic system enhancements Develop & apply documentation suite & standards Acceptance test procedures

Quality Objectives QA representatives should actively promote use of fault tolerance, defensive programming techniques, diagnostic facilities, etc. sw development cycle should include testing and validation plans for each unit make reliability a major design objective Quality control checklist for single RPMs to include testing and verification details log comprehensive installation and user manual automatic installation and configuration scripts test and verification scripts

A few suggestions Documentation suite and standards dedicated document writers Reference test suite Acceptance tests – automatic procedures Towards automatic monitoring & anomaly detection background information gathering by test probe jobs Testbed status / news / info update e.g. like the login banner which reports current status of sites & services down, critical bugs, etc. Clear instructions on when to test and what to test