EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operational Tools M2 Update James Casey.

Slides:



Advertisements
Similar presentations
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Operations Dashboard Workplan Cyril.
Advertisements

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks From ROCs to NGIs The pole1 and pole 2 people.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks PoW for the second year Transition to EGI.
EGI: SA1 Operations John Gordon EGEE09 Barcelona September 2009.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The network monitoring in grid context Operations.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Middleware Deployment and Support in EGEE.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GStat 2.0 Joanna Huang (ASGC) Laurence Field.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team James Casey EGEE’08.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Service Availability Monitoring – Status.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Wojciech Lapka SAM Team CERN EGEE’09 Conference,
EGEE-III INFSO-RI Enabling Grids for E-sciencE Operations Automation Team KoM, May ROC VIEW (SWE)‏ Javier Lopez Cacheiro/
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation in EGEE-III What does.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MSG - A messaging system for efficient and.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks DSA1.4 – Objectives and Status Ioannis Liabotis.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Dashboard Cyril L’Orphelin - CNRS/IN2P3.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGI Operations Tiziana Ferrari EGEE User.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
EGEE-II INFSO-RI Enabling Grids for E-sciencE GStat Work Plans for EGEE-III Joanna Huang, ASGC/OPS EGEE SA1 F2F Meetings, Abingdon.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Task tracking SA3 All Hands Meeting Prague.
Julia Andreeva on behalf of the MND section MND review.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks APEL CPU Accounting in the EGEE/WLCG infrastructure.
PIC port d’informació científica EGEE – EGI Transition for WLCG in Spain M. Delfino, G. Merino, PIC Spanish Tier-1 WLCG CB 13-Nov-2009.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Operations procedures: summary for round table Maite Barroso OCC, CERN
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Monitoring Tools E. Imamagic, SRCE CE.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks COD-17
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Ian Bird All Activity Meeting, Sofia
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Pole 2 : Restructuration of the OPS Manual.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Implementing product teams Oliver Keeble.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Configuration Data or “What should be.
CERN - IT Department CH-1211 Genève 23 Switzerland t IT-GD-OPS attendance to EGEE’09 IT/GD Group Meeting, 09 October 2009.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid is a Bazaar of Resource Providers and.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks ROC model assessment AP ROC ShuTing Liao.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures Grant Agreement n
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations automation team presentazione.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Towards an Information System Product Team.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Update on Service Availability Monitoring (SAM) Marian Babik, David Collados,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Regional tools use cases overview Peter Solagna – EGI.eu On behalf of the.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GOCDB4 Gilles Mathieu, RAL-STFC, UK An introduction.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks IT ROC: Vision for EGEE III Tiziana Ferrari.
TSA1.4 Infrastructure for Grid Management Tiziana Ferrari, EGI.eu EGI-InSPIRE – SA1 Kickoff Meeting1.
Monitoring BOF, 23 rd Jan 2007 Grid Service Monitoring Working Group Monitoring WG BOF, January 2007 James Casey/Ian Neilson.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks An insight into GOCDB for ROD Operators.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Status of the SAM/Nagios/GSTAT Components.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MyEGEE David Horat (
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios Grid Monitor E. Imamagic, SRCE OAT.
Transition to EGI PSC-06 Istanbul Ioannis Liabotis Greece GRNET
James Casey, CERN IT-GD WLCG Workshop 1st September, 2007
NGI and Site Nagios Monitoring
John Gordon STFC OMB 26 July 2011
POW MND section.
Introduction to OAT presentations
Evolution of SAM in an enhanced model for monitoring the WLCG grid
Security Monitoring in a Nagios world
Advancements in Availability and Reliability computation Introduction and current status of the Comp Reports mini project C. Kanellopoulos GRNET.
March Availability Report for EGEE Sites based on Nagios
Operations & Coordination Tools
Maite Barroso, SA1 activity leader CERN 27th January 2009
Monitoring in EGEE Automatisierung & Regionalisierung im Hinblick auf EGI Torsten Antoni (KIT), James Casey (CERN), Sabine Reißer (KIT)
Solutions for federated services management EGI
Monitoring of the infrastructure from the VO perspective
Kashif Mohammad Deputy Technical Co-ordinator (South Grid) Oxford
Presentation transcript:

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operational Tools M2 Update James Casey SA1 Management Meeting Barcelona

Enabling Grids for E-sciencE EGEE-III INFSO-RI Summary of milestone timeline 2 We are here…

Enabling Grids for E-sciencE EGEE-III INFSO-RI Pending M1 Features - April 2009 Configuration repositories –First version of Aggregate Topology Provider (ATP)  What resources should I test ? ROC level Nagios based monitoring available –‘SAM Portal’ level of visualization complete Full Nagios testing of all resources in grid running –Used to validate equivalence to SAM 3 DONE

Enabling Grids for E-sciencE EGEE-III INFSO-RI M2 - Regional Dashboard ‘Regionalized’ dashboards interfaced with regional Nagios Raising alarms based on Nagios Nagios notifications are generated + sent to message bus Lavoisier can pick them up, and insert in DB New regional dashboard can display them –No ‘Alarm DB’ masking done Current regionalized dashboard at IN2P3 cannot display Nagios alerts 4 NOT DONE IN PROGRESS

Enabling Grids for E-sciencE EGEE-III INFSO-RI M2 - Configuration repositories Metric Description Database MDDB exists, and schema seems now finalized –Many iterations to get it to work with metric store and availability calculator –Profiles are now supported in the MDDB  We have a ‘ROC’ and a ‘Site’ profile No good UI yet for adding new metrics, profiles –We do DB inserts to get us bootstrapped –This is a general statement for most of the configuration repositories – any UI that exists is just in ‘demo’ form 5 IN PROGRESS

Enabling Grids for E-sciencE EGEE-III INFSO-RI M2 - SLA Calculation Multiple simultaneous availability calculations for a VO Design of availability calculation system is done –Based on profiles and the status changes calculated in the metric store Implementation not started yet by Gridview team 6 NOT DONE

Enabling Grids for E-sciencE EGEE-III INFSO-RI M2 - GOCDB Write functions for Programmatic Interface (XML over HTTP) available –XML over HTTP API available Prototype interface to new GOCDB4 available First regions deployed communicating with Central project-level GOCDB ‘addDowntime’ added as only needed write method GOCDB distributed by RPM –No public demo instance of central GOCDB available yet Work starting with HGSM developers on region 3 example 7 NOT DONE DEMO DONE

Enabling Grids for E-sciencE EGEE-III INFSO-RI M2 – ROC level Nagios Now publishes to new central metric store Submission framework fully uses ATP Central metric store result visualization (SAM Portal/Gridview) Waiting on ATP programmatic interface MyEGEE now at ‘demo’ quality –Need to get into ROC Nagios bundle for testing 8 DONE NOT DONE DEMO

Enabling Grids for E-sciencE EGEE-III INFSO-RI M2 - QR Reporting Portal Added reports for operations and user support use cases –Some still missing (operations.{1,2}, size.2) Operations Portal now scheduled for release in Dec 09 9 POSTPONED

Enabling Grids for E-sciencE EGEE-III INFSO-RI M2 - Accounting Tested ActiveMQ transport with some selected sites Patch submitted for gLite certification APEL now just ready for testing phase –Currently internal testing has been done –Not deployed at ay sites yet Need to integrate security configuration into main broker network 10 NOT DONE

Enabling Grids for E-sciencE EGEE-III INFSO-RI M2 - GGUS MSG interface for ticket submission/update available Interfaces defined, messages components written on both side –But no complete ticket flow demoed yet Need to integrate security configuration into main broker network 11 IN PROGRESS

Enabling Grids for E-sciencE EGEE-III INFSO-RI Other work done Gstat 2.0 –Very good progress, ahead of schedule – Operations Dashboard –Work done on new regional dashboard –Complete re-write, which is also for the central component GOCDB –Testing with regions –Chasing down people to turn off APIs 12

Enabling Grids for E-sciencE EGEE-III INFSO-RI My personal viewpoint Perhaps some focus lost –We understood the Nagios/SAM area well, but had not yet defined the areas so well at the start of the project –And didn’t take time to re-evaluate where we were Regional flavoured developments are creeping in –Before we have central components to replace the in-production central components ones –New developments are always more attractive ;( We’re now at the point for most components that we have ‘demo’ quality components –Works on laptop – sort-of –It takes 6-12 months to turn this into production  And we don’t have that much time ! 13

Enabling Grids for E-sciencE EGEE-III INFSO-RI What to do ? Accept our milestones have slipped Focus on delivering a replacement set of products by end of year –In the same model as current components –Ease back on regionalization where necessary Add more people to OAT management level –Tracking progress, project management Make clearer the separation of the EGI futures part from the delivery of something in EGEE-III Get something delivered that works for the ROCs –The work from the OAT teams is good, we just need to get it released, evaluated and into production. 14

Enabling Grids for E-sciencE EGEE-III INFSO-RI Resources Architecture and components Milestone tracking 15

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks OAT mandate till the end of EGEE III Maite Barroso, SA1 activity leader CERN 23 rd September 2009

Enabling Grids for E-sciencE EGEE-III INFSO-RI To cha nge : Vie w - > Hea der and Foo ter 17 Past OAT’s focus in the first year has been –Distributed/regional architecture –Implementation plan with well defined milestones –distributed monitoring (which was more advanced, and was the first step for a distributed regional support)

Enabling Grids for E-sciencE EGEE-III INFSO-RI To cha nge : Vie w - > Hea der and Foo ter 18 Priorities We know where we are today, late with the milestones (we knew we were optimistic ) We have 7 months to go There are two priorities in the operation tool area for the next months: –Have a solution based on new architecture + components that is functionally equivalent to the current one (and nothing else added) –Deploy one component of this in production, really replacing the central instance, get experience with it. The best candidate today is SAM/Nagios + associated databases with interfaces to the rest of the operation tools central instances

Enabling Grids for E-sciencE EGEE-III INFSO-RI To cha nge : Vie w - > Hea der and Foo ter 19 Future Refocus OAT and operation tools teams Reinforce OAT mgt with more people Split into present (deployment) and future (requirements from NGIs)

Enabling Grids for E-sciencE EGEE-III INFSO-RI To cha nge : Vie w - > Hea der and Foo ter 20 proposal OAT main responsible and chair: James Casey Mandate: reporting to the activity management, sit on both the teams and have the global view of both parallel activities OAT development and deployment (present) Mandate: –revise (update), track milestones defined at the beginning of EGEE-III, track the developments committed for these milestones and deploy them in production (M1, M2, M3 and M4) –Complete the definition of deployment/release procedures, and implement them, including verification criteria, early testers, and early SLAs with operation product teams –Own the ops tool interfaces (and finish its definition) Membership: all operational tools in EGEE Chair: Nick Thackray

Enabling Grids for E-sciencE EGEE-III INFSO-RI proposal OAT advisory (future) Mandate: –Act as advisory committee for all operation tools, replacing the present per-tool advisory committees –do a round of requirements gathering and define new set of milestones for regionalization of operation tools, with input from stakeholders (all ROCs and NGIs), to be implemented in the EGI era (post-M4) –With the egee III experience, propose a set of milestones for the next years, including timeline, interaction between tools and effort estimation –Reference regional implementation Membership: operational tools with dev effort in EGI, NGIs, ROCs, related infrastructure projects, VOs? Chair: wait for EGI bidding result To cha nge : Vie w - > Hea der and Foo ter 21

Enabling Grids for E-sciencE EGEE-III INFSO-RI Comments ? 22