Www.egi.eu EGI-InSPIRE RI-261323 EGI-InSPIRE www.egi.eu EGI-InSPIRE RI-261323 Ops Portal New Requirements.

Slides:



Advertisements
Similar presentations
Die Kooperation von Forschungszentrum Karlsruhe GmbH und Universität Karlsruhe (TH) Tools used for operations at GridKa Angela Poschlad, SCC.
Advertisements

July 2010 D2.1 Upgrading strategy Javier Soto Catalog Release 3. Communities.
Centre de Calcul de l’Institut National de Physique Nucléaire et de Physique des Particules Nothing is lost, nothing is created, everything is.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Pole 3 – COD TOOLS Cyril L’Orphelin - CNRS/IN2P3.
1. There are different assistant software tools and methods that help in managing the network in different things such as: 1. Special management programs.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EG recent developments T. Ferrari/EGI.eu ADC Weekly Meeting 15/05/
Real Time Monitor of Grid Job Executions Janusz Martyniak Imperial College London.
SEE-GRID-SCI SEE-GRID-SCI Operations Procedures and Tools Antun Balaz Institute of Physics Belgrade, Serbia The SEE-GRID-SCI.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
Overview of day-to-day operations Suzanne Poulat.
Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI.
Nagios Demonstration Tom Wlodek SLAC Tier2 workshop
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team James Casey EGEE’08.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Service Availability Monitoring – Status.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Overview of STEP09 monitoring issues Julia Andreeva, IT/GS STEP09 Postmortem.
Building Dashboards SharePoint and Business Intelligence.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
8 th CIC on Duty meeting Krakow /2006 Enabling Grids for E-sciencE Feedback from SEE first COD shift Emanoil Atanassov Todor Gurov.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Dashboard Cyril L’Orphelin - CNRS/IN2P3.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CIC portal Requirements from users WLCG service.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
AEGIS Academic and Educational Grid Initiative of Serbia Antun Balaz (NGI_AEGIS Technical Manager) Dusan Vudragovic (NGI_AEGIS Deputy.
Vendredi 19 février 2016 CIC portal development status and TODO list Gilles Mathieu, Osman Aidel, Cyril L’Orphelin IN2P3/CNRS Computing Centre, Lyon, France.
Global ADC Job Monitoring Laura Sargsyan (YerPhI).
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI SAM New Requirements from the SA1 Survey.
Operations model Maite Barroso, CERN On behalf of EGEE operations WLCG Service Workshop 11/02/2006.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Accounting Requirements Stuart Pullinger STFC 09/04/2013 EGI CF – Accounting.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI TS8.10 A new approach to Computing Availability/Reliability reports for EGI.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Mario Reale – GARR NetJobs: Network Monitoring Using Grid Jobs.
WLCG Service Report ~~~ WLCG Management Board, 17 th February 2009.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Operations Portal Development Update on Requirements Cyril L'Orphelin IN2P3/CNRS.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Security Monitoring Daniel Kouřil EGI-TF 2011.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Best Practices and Use cases David Bouvet,
START Application Spencer Johnson Jonathan Barella Cohner Marker.
GOCDB Handover + Status Update Quite heavy GGUS ticketing traffic; responding to user issues has been quite timely, especially in first few weeks (expected.
EGI Process Assessment and Improvement Plan – EGI core services – Tiziana Ferrari FedSM project 1EGI Process Assessment and Improvement Plan (Core Services)
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Regionalisation summary Prague 1.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI 2 nd level support training Marian Babik, David Collados, Wojciech Lapka,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Status of ARGUS support Peter Solagna – EGI.eu.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI GGUS Report Generator Günter Grein, KIT Helmut Dres, KIT Torsten Antoni,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI GLUE 2: Deployment and Validation Stephen Burke egi.eu EGI OMB March 26 th.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Accounting Requirements Stuart Pullinger STFC 09/04/2013 EGI CF – Accounting.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Update on Service Availability Monitoring (SAM) Marian Babik, David Collados,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Regional tools use cases overview Peter Solagna – EGI.eu On behalf of the.
Site notifications with SAM and Dashboards Marian Babik SDC/MI Team IT/SDC/MI 12 th June 2013 GDB.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI VO Services Activities VO Services Activities NA3 F2F Meeting (3/03/2011)
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Release Process Michel Drescher, EGI Kostas Koumantaros, GRNET 7/5/2016.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI APEL Regional Accounting Alison Packer (STFC) Iván Díaz Álvarez (CESGA) APEL.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Operations Portal OTAG September, 21th 2011 Cyril L’Orphelin – CCIN2P3/CNRS.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI COD activity in EGI-InSPIRE Marcin Radecki CYFRONET, Poland & COD Team 9/29/2016.
Transition to EGI PSC-06 Istanbul Ioannis Liabotis Greece GRNET
POW MND section.
Operational Tools Update OMB 27/07/2010
PRACE-EGI helpdesk integration
Evolution of SAM in an enhanced model for monitoring the WLCG grid
Operations & Coordination Tools
Cyril L’Orphelin (CC-IN2P3) COD-19, Bologna, March 30th 2009
Maite Barroso, SA1 activity leader CERN 27th January 2009
Pole 3 – Dashboard Assessment COD 20 - Helsinki
Solutions for federated services management EGI
Kashif Mohammad Deputy Technical Co-ordinator (South Grid) Oxford
EGEE Operation Tools and Procedures
Presentation transcript:

EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Ops Portal New Requirements

EGI-InSPIRE RI Availability: Despite the efforts to keep it up and running, we see that it is not always available and queries fail sometimes. We had different type of problems : 1) with the database, occurred last year: located on the whole IN2P3-CC site. The situation should be more stable now 2) different glitches (especially with the Operations Dashboard): should be solved with the next version of the dashboard 3) Please send us or open tickets when the portal is not available. This is complicated to know what happens one month ago. Increased Availability 2

EGI-InSPIRE RI Messages received more than once : Ops Portal cannot check on which mailing lists people are registered and people are registered on several lists! In the VO management tools, there will a connection between account portal and the VO. We can see the usage statistics from a short path in each VO action lists Need Discussion short paths are currently not implemented in the accounting portal Broadcast Messages and Accounting Statistics 3

EGI-InSPIRE RI Everytime a Security Nagios probe fails, an alarm is showed in the dashboard. In my opinion, the Security Dashboard should only show real security threads while the Security Nagios probe fails should be caught by other operation monitoring tools (ROD Dashboard?) since they are reporting a normal job sumission failure and not a security problems The CSRIT group has proposed recently a solution with a mapping on the important issue. The mapping is implemented since February, 16th. glitches of the Security dashboard We are not aware of that, please fill in a GGUS ticket when happens Security Dashboard 4

EGI-InSPIRE RI Occasional problems with inconsistency between regional and central installation still exists The situation is quite better. The integration of the virtual queues should solve the remaining issues. Better advertisement of newly added features Better documentation of Security Dashboard should be provided broadcast tool to announce the new release and the release notes are available on the Portal The documentation will be provided soon with the help of the CSIRT team. Inconsistency and documentation 5

EGI-InSPIRE RI ) Automatic alert masquerading (hierarchy) Need a detailed RT 2) Flapping detection Internal discussion ongoing if this should be done on the NAGIOS or Ops Portal side. On SAM side can be implemented for simple tests, but not for the complex ones like CE. CE has its own state machine so switching on Nagios flapping mechanism would just cause problems (tested) 3)GUI Unintuitive, Slow, too Complicated The slowness will be reduced in the Operations Dashboard with the next version We will take into account some of the remarks about GUI in the next version Some other requirements should be refined (i.e “too complicated”) Improvements 6

EGI-InSPIRE RI Optimized for mobile access foreseen in the plan of next year Optimized for mobile access 7

EGI-InSPIRE RI Ops Portal Current Developments and Roadmap

EGI-InSPIRE RI New version in-line since February, 16th Authentication Authentication model : authorization is applied based on GOC DB and EGI SSO Automatic load of the list of sites / NGI depending from the scope Overview - Visualize security problems : Summarized by ngi or site or also by tests With historical details provided within a chart sort problems by any columns permalinks to access directly to the desired information 3 types of view : monitoring ( normal view ) – history view (with recent “ok” statuses) – debug (for csirt group) Notepads / Tickets Notepad with a mail to Site Security Officer with a template adapted to the current problems on the site With the possibility to visualize the status of the related problems Possibility also to create a ticket against sites. Metrics Generate dynamically metrics with the choice of format (table or charts ) / ngi or site / testname possibility to save charts (csv, pdf, jpg ) Events Possibility to declare / delete events : rotation declaration, monitoring downtimes … Security Dashboard

EGI-InSPIRE RI Pilote version in-line since February, 16th Goal : Provide a tool allowing a quick and easy identification of resources failing automatic VO SAM tests (Dedicated VO Nagios Box). VO experts could then access to this tool to validate results and alert infrastructure providers about how to mitigate the issues Authentication Authentication model : authorization is applied based on GOC DB and CIC DB Automatic load of the list of sites / NGI depending from the scope Overview Visualize Nagios Issues : Summarized by ngi or site / tests With historical details provided within a chart sort problems by any columns permalinks to access directly to the desired information Possibility to create/update notepads or tickets : with a template adapted to the current problems on the site With the possibility to visualize the status of the related problems Administration Add VO Staff and VO shifters - restricted to VO Managers Events Possibility to declare / delete events : rotation shift, monitoring downtimes... VO Oriented Dashboard

EGI-InSPIRE RI Roadmap TasksPlanned completion time Security dashboard : production version February 2012 VO Operations Dashboard : Pilot version February 2012 VO Operations Dashboard : production version April 2012 Major Upgrade of the regional package May 2012 Refactoring of the Operations Dashboard July – August 2012 Availability / reliability moduleOctober – November 2012 Mobile versionMarch – April 2013