CERN IT Department CH-1211 Genève 23 Switzerland www.cern.ch/i t DIP Service, status, recent issues and plans for the future Mathias Dutour 28 April 2008.

Slides:



Advertisements
Similar presentations
1Copyright © 2010, Printer Working Group. All rights reserved. PWG Plenary Status Report Workgroup for Imaging Management Solutions (WIMS/PMP) Printer.
Advertisements

IBM SMB Software Group ® ibm.com/software/smb Maintain Hardware Platform Health An IT Services Management Infrastructure Solution.
PVSS and JCOP Framework Organization, Support & News Oliver Holme IT-CO.
ServiceDesk Plus Product Overview Presented by ManageEngine 1.
Computer Monitoring System for EE Faculty By Yaroslav Ross And Denis Zakrevsky Supervisor: Viktor Kulikov.
Lecturer: Sebastian Coope Ashton Building, Room G.18 COMP 201 web-page: Lecture.
Lesson 1: Configuring Network Load Balancing
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.
L. Granado Cardoso, F. Varela, N. Neufeld, C. Gaspar, C. Haen, CERN, Geneva, Switzerland D. Galli, INFN, Bologna, Italy ICALEPCS, October 2011.
Problem Management Overview
CERN IT Department CH-1211 Genève 23 Switzerland t Evolution of telecom services GTPA November 27 th 2014 IT Dept. CS group.
CERN - IT Department CH-1211 Genève 23 Switzerland t Service-Now UDS training [Jan 2011] - 1 Service-now training for UDS Service-now training.
Evaluations and recommendations for a user support toolkit Christine Cahoon George Munroe.
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
CERN IT Department CH-1211 Genève 23 Switzerland t Some Hints for “Best Practice” Regarding VO Boxes Running Critical Services and Real Use-cases.
Client Management. Introduction In a typical organization there are a lot of client machines used for day to day operations Client management is a necessary.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES WLCG operations: communication channels Andrea Sciabà WLCG operations.
CERN IT Department CH-1211 Genève 23 Switzerland t Virtualization with Windows at CERN Juraj Sucik, Emmanuel Ormancey Internet Services Group.
Emergency Management Information System - EMIS
Project Overview Application Migration for Client Name Concealed 2nd Client Name Concealed InnovaIT Web Services.
Current Job Components Information Technology Department Network Systems Administration Telecommunications Database Design and Administration.
1. There are different assistant software tools and methods that help in managing the network in different things such as: 1. Special management programs.
Scaling Up PVSS Phase II. 2 Purpose of this talk Start a discussion about the next phase of the Scaling Up PVSS Project. Start a discussion about the.
CERN IT Department CH-1211 Genève 23 Switzerland t Service Management GLM 15 November 2010 Mats Moller IT-DI-SM.
Working in the IT Industry.  On completion of this unit a learner should: 1. Know the characteristics that are valued by employers in the IT industry.
CERN IT Department CH-1211 Genève 23 Switzerland t EIS section review of recent activities Harry Renshall Andrea Sciabà IT-GS group meeting.
Update on Database Issues Peter Chochula DCS Workshop, June 21, 2004 Colmar.
CERN IT Department CH-1211 Genève 23 Switzerland t ITIL at CERN Tony Cass HEPiX LBL, 29 th October 2009.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES PhEDEx Monitoring Nicolò Magini CERN IT-ES-VOS For the PhEDEx.
CERN IT Department CH-1211 Genève 23 Switzerland t Windows Desktop Applications Life-cycle Management Sebastien Dellabella, Rafal Otto Internet.
Keeping Network Monitoring Current using Automated Nagios Configurations (WIP) Greg Wickham APAN July 2005.
CERN IT Department CH-1211 Geneva 23 Switzerland t Daniel Gomez Ruben Gaspar Ignacio Coterillo * Dawid Wojcik *CERN/CSIC funded by Spanish.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 10Slide 1 Architectural Design l Establishing the overall structure of a software system.
Operating Systems & Information Services CERN IT Department CH-1211 Geneva 23 Switzerland t OIS Update on Windows 7 at CERN & Remote Desktop.
Virtualised Worker Nodes Where are we? What next? Tony Cass GDB /12/12.
CERN IT Department CH-1211 Geneva 23 Switzerland t GDB CERN, 4 th March 2008 James Casey A Strategy for WLCG Monitoring.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF CERN Computer Centre Upgrade Project Wayne Salter HEPiX November.
CERN - IT Department CH-1211 Genève 23 Switzerland t OIS Deployment of Exchange 2010 mail platform Pawel Grzywaczewski, CERN IT/OIS HEPIX.
Exchange Deployment Planning Services Exchange 2010 Complementary Products.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Automatic server registration and burn-in framework HEPIX’13 28.
CERN IT Department CH-1211 Genève 23 Switzerland t 24x7 Service Support Tony Cass LCG GDB, 24 th November 2009.
Operating Systems & Information Services CERN IT Department CH-1211 Geneva 23 Switzerland t OIS First look at the Mobile Framework Ivan Deloose,
CERN IT Department CH-1211 Genève 23 Switzerland t DM Database Monitoring Tools Database Developers' Workshop CERN, July 8 th, 2008 Dawid.
CERN IT Department CH-1211 Genève 23 Switzerland t MSG Status update Daniel Rodrigues.
High Availability Technologies for Tier2 Services June 16 th 2006 Tim Bell CERN IT/FIO/TSI.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Agile Infrastructure Monitoring HEPiX Spring th April.
CERN - IT Department CH-1211 Genève 23 Switzerland t IT Dept Presentation [September 2009] - 1 User Support - Future Changes in Policy and.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Andrea Sciabà Hammercloud and Nagios Dan Van Der Ster Nicolò Magini.
Axios Systems IT Service Management Solutions TM Information Management Good Information Makes the Users Efficient and Positive Brian.
Operating Systems & Information Services CERN IT Department CH-1211 Geneva 23 Switzerland t OIS Drupal at CERN Juraj Sucik Jarosław Polok.
CERN - IT Department CH-1211 Genève 23 Switzerland t A Quick Overview of ITIL John Shade CERN WLCG Collaboration Workshop April 2008.
3 rd Party Registration & Account Management SMT Update To AMWG February 23, 2016.
CERN - IT Department CH-1211 Genève 23 Switzerland t Operating systems and Information Services OIS Proposed Drupal Service Definition IT-OIS.
1 CERN IT Department CH-1211 Genève 23 Switzerland t Risk of network incident during the last LHC run CERN, 10 January 2013
A Service-Based SLA Model HEPIX -- CERN May 6, 2008 Tony Chan -- BNL.
CERN IT Department CH-1211 Genève 23 Switzerland t SL(C) 5 Migration at CERN CHEP 2009, Prague Ulrich SCHWICKERATH Ricardo SILVA CERN, IT-FIO-FS.
CERN - IT Department CH-1211 Genève 23 Switzerland CASTOR F2F Monitoring at CERN Miguel Coelho dos Santos.
CERN - IT Department CH-1211 Genève 23 Switzerland t OIS Update on the anti spam system at CERN Pawel Grzywaczewski, CERN IT/OIS HEPIX fall.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF CC Monitoring I.Fedorko on behalf of CF/ASI 18/02/2011 Overview.
JCOP Framework and PVSS News ALICE DCS Workshop 14 th March, 2006 Piotr Golonka CERN IT/CO-BE Outline PVSS status Framework: Current status and future.
CERN - IT Department CH-1211 Genève 23 Switzerland t CERN - IT Department CH-1211 Genève 23 Switzerland t SharePoint 2007 deployment.
CERN IT Department CH-1211 Genève 23 Switzerland t Bamboo users meeting IT-CS-CT.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Author etc Alarm framework requirements Andrea Sciabà Tony Wildish.
CERN - IT Department CH-1211 Genève 23 Switzerland t ASM and Oracle Service Availability Monitoring LCG 3D Workshop CERN, January 26 th,
CERN IT Department CH-1211 Genève 23 Switzerland t EIS Section input to GLM For GLM attended by Director for Computing.
CERN IT Department CH-1211 Geneva 23 Switzerland t OIS Operating Systems & Information Services CERN IT Department CH-1211 Geneva 23 Switzerland.
Marc Dobson On behalf of CMS DAQ Team
Based on work by DoIT Network Services, UW-Madison
Presentation transcript:

CERN IT Department CH-1211 Genève 23 Switzerland t DIP Service, status, recent issues and plans for the future Mathias Dutour 28 April 2008

CERN IT Department CH-1211 Genève 23 Switzerland t DIP - 2 Plan DIP service overview Recent DNS issues Findings and actions Q/A

DIP service overview: DIP “DIP is a system which allows relatively small amounts of real-time data to be exchanged between very loosely coupled heterogeneous systems. […].” DIP - 3

DIP service overview: DNS 1/2 DIP DNS role: –Establish link between DIP Data subscribers and publishers. –Maintains list of available Publications. DNS location: –ITCO maintains 2 DNS on the GPN, best effort. –Located in the computing center (on 2 separated Linux PCs under (limited) Lemon monitoring. –Procedures for on –shift operators in case of a problem on the sub cluster. –Fallback mechanism. DIP - 4

DIP service overview: DNS 2/2 DIP DNS monitoring: –Monitored locally to diagnose issues, restarting the DNSs on the fly if necessary. PERL + C programs Alerts by s to service managers –Monitored (as a standard user) from Lxplus for SLS monitoring. Updated every 15 minutes for the 2 DNS, + total number of Publications DIP - 5

DIP service overview: Issues 1/2 DNS1 degraded: –(new) connections affected: Delays for new registrations or connections refused. During recovery period, scattered situation with Publishers and Subscribers on different DNSs. Actions: –DNS probing: Improve technical monitoring on the DNS machines, using more stringent DNS probing ( current lack of sensibility). Improve DNS technical feedback toward ITCO service supporters (better technical logging + SMS alerts). –Redundancy: Review fallback mechanism, upgrade DNSs. DIP - 6

DIP service overview: Issues 2/2 Communication not sufficient: –DNS status awareness: SLS page provides the current status as a snapshot, does not tell details on the failure. Lack of feedback on resolution progress. Actions: –User awareness: Provide more details on the SLS webpage about the DNS health and ongoing actions. Report toward users via DIP mailing list. –For the users: Report issues to prevent interpersonal reporting, often DIP - 7

DIP service overview: Future actions Other actions ongoing: –Prepare migration of the DNS on the TN (mid 2008) Prepare switchover in parallel to current GPN solution, announce in advance, provide support for the switch preparation. In depth (load, robustness) testing of new mechanisms prior switchover from GPN to TN. Renew hardware and improve Lemon monitoring for DNS cluster. –Other improvements: Investigate full redundancy and hardware abstraction to improve service availability. Integrate DNS monitoring in PVSS JCOP Framework (under discussion). DIP - 8

Last words DIP - 9 The issues that occurred revealed some weaknesses (monitoring, procedure for recovery, communication) There are ongoing actions to address these concerns. More information: –DIP Service Level Agreement (SLA): –DIP SLS web page: –DIP DNS fallback details: –DIP Usage recommendations:

Questions? Q/A TOTEM Roman Pots control - 10