WP9 – Earth Observation Applications – n° 1 Experiences with Testbed1, plans and objectives for Testbed 2 Testbed retreat 27-28 th August 2002

Slides:



Advertisements
Similar presentations
DataTAG WP4 Meeting CNAF Jan 14, 2003 Interfacing AliEn and EDG 1/13 Stefano Bagnasco, INFN Torino Interfacing AliEn to EDG Stefano Bagnasco, INFN Torino.
Advertisements

ESA Data Integration Application Open Grid Services for Earth Observation Luigi Fusco, Pedro Gonçalves.
Stephen Burke - WP8 Status - 9/5/2002 Partner Logo WP8 Status Stephen Burke, PPARC/RAL.
Andrew McNab - Manchester HEP - 17 September 2002 Putting Existing Farms on the Testbed Manchester DZero/Atlas and BaBar farms are available via the Testbed.
ATLAS/LHCb GANGA DEVELOPMENT Introduction Requirements Architecture and design Interfacing to the Grid Ganga prototyping A. Soroko (Oxford), K. Harrison.
4 th DataGRID Project Conference, Paris, 5 March 2002 Testbed Software Test Plan I. Mandjavidze on behalf of L. Bobelin – CS SI; F.Etienne, E. Fede – CPPM;
Réunion DataGrid France, Lyon, fév CMS test of EDG Testbed Production MC CMS Objectifs Résultats Conclusions et perspectives C. Charlot / LLR-École.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
DataGrid is a project funded by the European Union 22 September 2003 – n° 1 EDG WP4 Fabric Management: Fabric Monitoring and Fault Tolerance
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
Implementation/Acceptance Testing / 1 Implementation and Acceptance Testing Physical Implementation Criteria: 1. Data availability 2. Data reliability.
EU 2nd Year Review – Jan – WP9 WP9 Earth Observation Applications Demonstration Pedro Goncalves :
LCG Milestones for Deployment, Fabric, & Grid Technology Ian Bird LCG Deployment Area Manager PEB 3-Dec-2002.
EDG Application The European DataGrid Project Team
OSG Public Storage and iRODS
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
KNMI Applications on Testbed 1 …and other activities.
The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Dataset Caitlin Minteer & Kelly Clynes.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting October 10-11, 2002.
Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI.
DataGrid WP1 Massimo Sgaravatto INFN Padova. WP1 (Grid Workload Management) Objective of the first DataGrid workpackage is (according to the project "Technical.
Nadia LAJILI User Interface User Interface 4 Février 2002.
WP8 Status – Stephen Burke – 30th January 2003 WP8 Status Stephen Burke (RAL) (with thanks to Frank Harris)
- Distributed Analysis (07may02 - USA Grid SW BNL) Distributed Processing Craig E. Tull HCG/NERSC/LBNL (US) ATLAS Grid Software.
WP9 – Earth Observation Applications – n° 1 WP9 report to Plenary ESA, KNMI, IPSL Presented by M. Petitdidier, IPSL DataGrid Plenary Session 5 th Project.
EU 2nd Year Review – Jan – WP9 WP9 Earth Observation Applications
Author - Title- Date - n° 1 Partner Logo EU DataGrid, Work Package 5 The Storage Element.
Author - Title- Date - n° 1 Partner Logo WP5 Summary Paris John Gordon WP5 6th March 2002.
First attempt for validating/testing Testbed 1 Globus and middleware services WP6 Meeting, December 2001 Flavia Donno, Marco Serra for IT and WPs.
DataGRID WPMM, Geneve, 17th June 2002 Testbed Software Test Group work status for 1.2 release Andrea Formica on behalf of Test Group.
DataGRID PTB, Geneve, 10 April 2002 Testbed Software Test Plan Status Laurent Bobelin on behalf of Test Group.
GridPP Collaboration Meeting 5 th November 2001 Dan Tovey, University of Sheffield Non-LHC and Non-US-Collider Experiments’ Requirements Dan Tovey, University.
DataGRID Testbed Enlargement EDG Retreat Chavannes, august 2002 Fabio HERNANDEZ
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
EDG Applications The European DataGrid Project Team
2-Sep-02Steve Traylen, RAL WP6 Test Bed Report1 RAL and UK WP6 Test Bed Report Steve Traylen, WP6
Andrew McNab - Manchester HEP - 17 September 2002 UK Testbed Deployment Aim of this talk is to the answer the questions: –“How much of the Testbed has.
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
EC Review – 01/03/2002 – WP9 – Earth Observation Applications – n° 1 WP9 Earth Observation Applications 1st Annual Review Report to the EU ESA, KNMI, IPSL,
Data Management The European DataGrid Project Team
ESA DataGrid Review – 10 June 2002 – n° 1 Summary item 3  ESA and WP9 part 1 (45m)  DataGrid Frascati infrastructure (AT, 7.5m)  ESA GOME application.
Data Management The European DataGrid Project Team
Testing the HEPCAL use cases J.J. Blaising, F. Harris, Andrea Sciabà GAG Meeting April,
Site Certification Process (Round Table) Fabio Hernandez IN2P3 Computing Center - Lyon October
15-Feb-02Steve Traylen, RAL WP6 Test Bed Report1 RAL/UK WP6 Test Bed Report Steve Traylen, WP6 PPGRID/RAL, UK
INFSO-RI Enabling Grids for E-sciencE gLite Certification and Deployment Process Markus Schulz, SA1, CERN EGEE 1 st EU Review 9-11/02/2005.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
Tests at Saclay D. Calvet, A. Formica, Z. Georgette, I. Mandjavidze, P. Micout DAPNIA/SEDI, CEA Saclay Gif-sur-Yvette Cedex.
Testing Overview Software Reliability Techniques Testing Concepts CEN 4010 Class 24 – 11/17.
Stephen Burke – Sysman meeting - 22/4/2002 Partner Logo The Testbed – A User View Stephen Burke, PPARC/RAL.
WMS baseline issues in Atlas Miguel Branco Alessandro De Salvo Outline  The Atlas Production System  WMS baseline issues in Atlas.
CERN Certification & Testing LCG Certification & Testing Team (C&T Team) Marco Serra - CERN / INFN Zdenek Sekera - CERN.
SAM Status Update Piotr Nyczyk LCG Management Board CERN, 5 June 2007.
Bob Jones – Project Architecture - 1 March n° 1 Project Architecture, Middleware and Delivery Schedule Bob Jones Technical Coordinator, WP12, CERN.
Earth Observation inputs to ATF Annalisa Terracina EU-DataGrid Project Work Package 9 – EO Applications April 2003 CERN.
DataGrid France 12 Feb – WP9 – n° 1 WP9 Earth Observation Applications.
WP 9.4 Use Case (ESA-ESRIN, IPSL, KNMI)
Work Package 9 – EO Applications
Software Engineering (CSI 321)
EGEE Middleware Activities Overview
U.S. ATLAS Grid Production Experience
EO Applications Parallel Session
Introduction to Grid Technology
Testbed Software Test Plan Status
EDG Final Review Demonstration
CMS report from FNAL demo week Marco Verlato (INFN-Padova)
Stephen Burke, PPARC/RAL Jeff Templon, NIKHEF
Presentation transcript:

WP9 – Earth Observation Applications – n° 1 Experiences with Testbed1, plans and objectives for Testbed 2 Testbed retreat th August 2002 EU-DataGrid Project Work Package 9 – EO Applications

WP9 – Earth Observation Applications – n° 2 Experience with Testbed v1.2  User Interface installed RH 6.2 & 7.2  Basic job submission tests (single short jobs using input sandbox and CERN UI)  Data replication tests ongoing  problems with GDMP_CONFIG_FILE  Started installing 1.2 SE  Intensive use of 1.2 Testbed planned starting September  carry out processing of 1 month of GOME data (about Mb files 5.25Gb), data needed by IPSL for Validation tests  upgrade CE to interface with AFS/LSF  complete installation of 1.2 SE + interfaces with the ESA MSS system  Work will concentrate on preparing several high-profile demonstrations  implementation of GOME Data processing and validation Use Case  EO WebMap Portal  visualisation of global Ozone measurements  on-demand product processing using EDG services

WP9 – Earth Observation Applications – n° 3 Experience with Testbed v1.1  Middleware Installed at ESRIN  User Interface  Network Monitoring Tools  Computing Element (with PBS and LSF)  Has been used to partially carry out the basic GOME use case: 1.Transfer Level1 (raw) data to the Grid Storage Element 2.Register Level1 data with the Replica Manager 3.Submit jobs to process the Level1 data, produce Level2 data products  Jobs running on the CEs locate Leve1 data by using the BrokerInfoAPI 4.Repeat step 1-3 for level 2 products 1.Transfer Level2 data products to the Storage Element 2.Register Level2 data products with the Replica Manager 3.Submit jobs to the Grid to validate Level2 data products 5.Retrieve validation results and visualize at the User Interface

WP9 – Earth Observation Applications – n° 4 Commands we used  Job execution dg-job-list-match dg-job-submit dg-job-status dg-job-get-output  Data management gdmp_register_localfile gdmp_publish_catalog gdmp_get_catalog gdmp_replicate_get

WP9 – Earth Observation Applications – n° 5 Results  Application Environment installation not straightforward  need to contact sites directly to verify / fix problems  Perceived general instability of the testbed  high incidence of unrecoverable errors, intermittent errors  Job submission comands, execution cycle basically OK  but need better support for handling multiple simultaneous jobs  not easy for apps to work with CLI  Data management commands not easy to use  complex replication sequence  unreliable, intermittent working  difficult to diagnose cause of malfunctions

WP9 – Earth Observation Applications – n°  EO Requirements have been surveyed and matched against the testbed functionality (D9.6 Scaling Study activity)  Analysis of 45 basic requirements (several were grouped together)  4 Satisfied  19 Partially satisfied  12 Expected in future releases  4 Planned in future releases  6 Need to be verified  Overall basic functionality of job submission and data replication considered satisfied in TB1  Although the commands work, the testbed has not yet reached the required production quality Meeting EO Requirements

WP9 – Earth Observation Applications – n° 7 EO requirements priorities  Reliability  Improve job failure rate under high load conditions  Increase robustness and fault tolerance  Improve configuration / installation  Remove / avoid single points of failure  Documentation  Needs constant revision to keep up with software changes  Installation manual, user manual should be mandatory in delivered RPMs  Verified and approved by quality control / testing measures

WP9 – Earth Observation Applications – n° 8 EO requirements priorities  Usability  New Data management s/w not yet very well understood  Overlapping command sets  GDMP  Replica Manager  Replica Catalog  Interface to MSS  Need for clear procedures / instructions  Cryptic error messages  Need better error recovery (core dumps are a show-stopper)  Need built-in error handling and fault tolerance

WP9 – Earth Observation Applications – n° 9 EO requirements priorities  Application interfaces  low-level commands require application functional layer and  need to be designed appropriately  middleware command interfaces subject to change  apps will need some minimum backward compatibility  Site uniformity  a job should produce the same result regardless of where it executes  it should not require hard coded values for a specific site  avoid end-users having to contact sites directly

WP9 – Earth Observation Applications – n° 10 EO requirements priorities  Automatic job decomposition based on input dataset  need use case / examples  Brokerinfo use cases  method to locate the replicated files locally on the CE  SE storage (for data & scratch space) management  query amount of space available  advance reservation  Create / destroy VOs  + groups within VOs  Integration of EO archives and catalogue systems

WP9 – Earth Observation Applications – n° 11 EO Use Case File Numbers DataNumber of files to be stored and replicated Size Level 1 4,72415 Mb Level 2 NNO (ESA) 9,448,00010 kb Level 2 Opera (KNMI) 9,448,00012 kb Validation Lidar (IPSL) Mb Total: 18,900, Gbyte Gome has a data set of 5 years Gome is relatively small (in both size and number of files) 1 Year of Gome data

WP9 – Earth Observation Applications – n° 12 EO requirements for Testbed2  SE storage management policies  need for standard – even automatic - procedures for freeing space on the SE  Capability to store user-defined application metadata in RC  use metadata keys as alternative to to LFN to describe input data  RB support for data pipelining  Ability to specify input data which will be produced as a result of a previous processing step  Retrieval of QoS measures for data access, storage and processing (lost data at RAL!)  Per VO / user quotas on job submission, RC & SE usage

WP9 – Earth Observation Applications – n° 13 Quality Objectives  Should be carefully selected to make a rapid impact on production Testbed stability and reliability  Minimal disruption to the production testbed during upgrades, patches & site / service outages or re- configurations  Reduce priority of new functionality until existing infrastructure is stable and proven  priority to bug fixes and basic system enhancements  Develop & apply documentation suite & standards  Acceptance test procedures

WP9 – Earth Observation Applications – n° 14 Quality Objectives  QA representatives should actively promote use of fault tolerance, defensive programming techniques, diagnostic facilities, etc.  sw development cycle should include testing and validation plans for each unit  make reliability a major design objective  Quality control checklist for single RPMs to include  testing and verification details log  comprehensive installation and user manual  automatic installation and configuration scripts  test and verification scripts

WP9 – Earth Observation Applications – n° 15 A few suggestions  Documentation suite and standards  dedicated document writers  Reference test suite  Acceptance tests – automatic procedures  Towards automatic monitoring & anomaly detection  background information gathering by test probe jobs  Testbed status / news / info update  e.g. like the login banner which reports current status of sites & services down, critical bugs, etc.  Clear instructions on when to test and what to test