Presentation is loading. Please wait.

Presentation is loading. Please wait.

NE-ROC Nordics Operations

Similar presentations


Presentation on theme: "NE-ROC Nordics Operations"— Presentation transcript:

1 NE-ROC Nordics Operations
Nordic NE ROC Face 2 Face Meeting 15 June 2009 Vera Hansper CSC/NDGF/NE ROC

2 Joint Operations NDGF and SNIC operate jointly for T1 and EGEE operations monitoring. Core SNIC team Thomas Bellman Michaela Lechner Gilbert Netzer Roger Oscarsson Åke Sandgren Zeeshan Ali Shah Core NDGF team Vera Hansper Jens Larsson Tore Mauset Leif Nixon

3 Joint Operations Weekly shift on a 6 week rotation between the two teams – only one person on shift Duty operator covers both NDGF Operator on Duty (OoD) and EGEE Regional Operator on Duty (ROD) tasks. NE ROC ROD duty covers sites which fall under the NE ROC Nordics region. These include Baltic Grid Finland Norway Sweden

4 EGEE Operations EGEE operations have moved from a centrally managed system (COD) to a regional managed model (ROD). NE ROC has been in the regional model since the beginning of 2009 and has been instrumental in the creation process of the structure of the model. There are various layers to the Regional Model Site Administrators 1st Line Support ROD C-COD Monitors site availability through SAM tests Managed through the CIC portal The Regional Dashboard provides the dashboard and tools for ROD.

5 NE ROC Nordics Operations
Monitor alarms on the CIC portal Contact sites directly if alarms are <24 hours old Create tickets for sites for alarms that are >24 hours old A ticket is NOT a punishment BUT should be acted on ASAP! Ideally, the tickets should have a quick turnover – a matter of days! If the problem can't be solved the site should go into downtime Tickets > 30 days, or unhandled tickets are automatically escalated to C-COD This is a team effort – ROD needs to keep abreast of the status and site admins need to be actively responsive

6 NDGF-T1 Operations NDGF-T1 is a distributed T1!
NDGF Operations cover sites which are running the ARC middleware. These include Denmark Finland Norway Slovenia Sweden The NDGF team also takes on On Call duties during weekends and public holidays. T1s are expected to have 247 monitoring, and this will be fully implemented in the Nordic region by the 1st of July. The NDGF On Call is supplemented by Anders Rhod Gregersen Mattias Wadenstein

7 NDGF-T1 Monitoring NDGF uses a mix of tools and dashboards to monitor the health of the sites NAGIOS DCACHE dashboard SAM tests GRIDMAP GRIDVIEW GANGLIA Other dashboards – ie. FTS monitoring NDGF has it's own ticketing system for announcing downtimes and internal logging. There will be a demo of this later

8 Communication NE ROC (Nordics) operators have regular weekly phone meetings Several Face 2 Face meetings a year NDGF have regular weekly CHAT meetings (jabber) 3 – 4 Face 2 Face meetings a year All operators communicate directly via the CHAT room provided by NDGF. There are two mailing lists for sites to request support. Used more for NDGF operations Used more for NE ROC (Nordics) operations

9 What can site admins do? Site admins are encouraged to subscribe to the NDGF ticketing system Small volume list, mainly to notify admins about central (NDGF-T1) service maintenances Site admins should subscribe to EGEE alarm notifications Can be done on a site or node basis Be proactive The faster a problem is solved, the better the overall availability of the site

10 We are here to help Operators can issue downtime in the GOCDB on your behalf The mailing list is actively read Please feel free to use it to communicate with the operators and admins Developers also read this list Ask us for help – advice, training, etc. Some of the operators are cheap – you only need to mention beer! No question is too trivial

11 Questions? ?


Download ppt "NE-ROC Nordics Operations"

Similar presentations


Ads by Google