CERN IT Department CH-1211 Genève 23 Switzerland www.cern.ch/i t CERN Site Report Helge Meinhard / CERN-IT HEPiX Spring 2009 Umea 25 May 2009.

Slides:



Advertisements
Similar presentations
Chapter 24 Quality Management.
Advertisements

An open source approach for grids Bob Jones CERN EU DataGrid Project Deputy Project Leader EU EGEE Designated Technical Director
Project Status David Britton,15/Dec/ Outline Programmatic Review Outcome CCRC08 LHC Schedule Changes Service Resilience CASTOR Current Status Project.
Categories of I/O Devices
SIMS Software Secondary User Forums Autumn 2009 Welcome.
CERN IT Department CH-1211 Genève 23 Switzerland t The Wigner Data Centre An Extension to the CERN Data Centre.
LAL Site Report Michel Jouvin LAL / IN2P3
12. March 2003Bernd Panzer-Steindel, CERN/IT1 LCG Fabric status
Date: 03/05/2007 Vendor Management and Metrics. 2 A.T. Kearney X/mm.yyyy/00000 AT Kearney’s IT/Telecom Vendor Facts IT/Telecom service, software and equipment.
Site report: CERN Helge Meinhard (at) cern ch HEPiX fall SLAC.
Hands-On Microsoft Windows Server 2003 Chapter 2 Installing Windows Server 2003, Standard Edition.
Lesson 1: Configuring Network Load Balancing
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 8 Introduction to Printers in a Windows Server 2008 Network.
Internet Services Alberto Pace. Internet Services Group u Mission and Goals u Provide core computing services, worldwide u Three specific areas u Collaborative.
Backup Rationalisation Reorganisation of the CERN Computer Centre Backups David Asbury IT/DS Friday 6 December 2002.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF CERN Business Continuity Overview Wayne Salter HEPiX April 2012.
1 Objectives Discuss the Windows Printer Model and how it is implemented in Windows Server 2008 Install the Print Services components of Windows Server.
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Michal Kwiatek, Juraj Sucik, Rafal.
Windows Server MIS 424 Professor Sandvig. Overview Role of servers Performance Requirements Server Hardware Software Windows Server IIS.
Fundamentals of Networking Discovery 1, Chapter 2 Operating Systems.
Section 11.1 Identify customer requirements Recommend appropriate network topologies Gather data about existing equipment and software Section 11.2 Demonstrate.
LCG Milestones for Deployment, Fabric, & Grid Technology Ian Bird LCG Deployment Area Manager PEB 3-Dec-2002.
Chapter 7: Using Windows Servers to Share Information.
Module 4: Add Client Computers and Devices to the Network.
CERN IT Department CH-1211 Geneva 23 Switzerland t OIS Ideas for 2011 Prepare must be done work items –Warranty –Software maintenance –Commitments.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN Site Report Helge Meinhard / CERN-IT HEPiX Fall 2011 Vancouver 24 October 2011.
Status of WLCG Tier-0 Maite Barroso, CERN-IT With input from T0 service managers Grid Deployment Board 9 April Apr-2014 Maite Barroso Lopez (at)
Site report: CERN Helge Meinhard (at) cern ch HEPiX spring CASPUR.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF CERN Remote Hosting First Experiences Wayne Salter (with input.
Site report: CERN Helge Meinhard (at) cern ch HEPiX fall BNL.
Hands-On Microsoft Windows Server 2003 Administration Chapter 2 Managing Windows Server 2003 Hardware and Software.
Natick Public Schools Technology Update January 14, 2008 Dennis Roche, CISA Director of Technology.
IMPLEMENTING F-SECURE POLICY MANAGER. Page 2 Agenda Main topics Pre-deployment phase Is the implementation possible? Implementation scenarios and examples.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN Site Report Helge Meinhard / CERN-IT HEPiX Spring 2011 GSI 02 May 2011.
CERN IT Department CH-1211 Genève 23 Switzerland t Experience with Windows Vista at CERN Rafal Otto Internet Services Group IT Department.
Operating Systems & Information Services CERN IT Department CH-1211 Geneva 23 Switzerland t OIS Working with Windows 7 at CERN Michał Budzowski.
SIMS Software Primary User Forums Autumn 2009 Welcome.
CERN IT Department CH-1211 Genève 23 Switzerland t Tier0 Status - 1 Tier0 Status Tony Cass LCG-LHCC Referees Meeting 18 th November 2008.
JLab Scientific Computing: Theory HPC & Experimental Physics Thomas Jefferson National Accelerator Facility Newport News, VA Sandy Philpott.
IST Storage & Backup Group 2011 Jack Shnell Supervisor Joe Silva Senior Storage Administrator Dennis Leong.
Operating Systems & Information Services CERN IT Department CH-1211 Geneva 23 Switzerland t OIS Update on Windows 7 at CERN & Remote Desktop.
S.Jarp CERN openlab CERN openlab Total Cost of Ownership 11 November 2003 Sverre Jarp.
HEPiX FNAL ‘02 25 th Oct 2002 Alan Silverman HEPiX Large Cluster SIG Report Alan Silverman 25 th October 2002 HEPiX 2002, FNAL.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF CERN Computer Centre Upgrade Project Wayne Salter HEPiX November.
CERN IT Department CH-1211 Genève 23 Switzerland t Frédéric Hemmer IT Department Head - CERN 23 rd August 2010 Status of LHC Computing from.
CERN - IT Department CH-1211 Genève 23 Switzerland t OIS Deployment of Exchange 2010 mail platform Pawel Grzywaczewski, CERN IT/OIS HEPIX.
Computer Security Risks for Control Systems at CERN Denise Heagerty, CERN Computer Security Officer, 12 Feb 2003.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Automatic server registration and burn-in framework HEPIX’13 28.
SLACFederated Storage Workshop Summary For pre-GDB (Data Access) Meeting 5/13/14 Andrew Hanushevsky SLAC National Accelerator Laboratory.
Data & Storage Services CERN IT Department CH-1211 Genève 23 Switzerland t DSS Castor incident (and follow up) Alberto Pace.
CERN Computer Centre Tier SC4 Planning FZK October 20 th 2005 CERN.ch.
ClinicalSoftwareSolutions Patient focused.Business minded. Slide 1 Opus Server Architecture Fritz Feltner Sept 7, 2007 Director, IT and Systems Integration.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN Site Report Helge Meinhard / CERN-IT HEPiX Fall 2009 LBNL 26 October 2009.
Drupal Service: Infrastructure Update 2 Marek Salwerowicz Sergio Fernandez ENTICE Meeting
David Foster LCG Project 12-March-02 Fabric Automation The Challenge of LHC Scale Fabrics LHC Computing Grid Workshop David Foster 12 th March 2002.
CERN IT Department CH-1211 Genève 23 Switzerland t Migration from ELFMs to Agile Infrastructure CERN, IT Department.
Windows Small Business Server 2003 R2 Powering Small Businesses.
CERN IT Department CH-1211 Genève 23 Switzerland t Next generation of virtual infrastructure with Hyper-V Juraj Sucik, Michal Kwiatek, Rafal.
CERN - IT Department CH-1211 Genève 23 Switzerland Operations procedures CERN Site Report Grid operations workshop Stockholm 13 June 2007.
CERN IT Department CH-1211 Genève 23 Switzerland t The Tape Service at CERN Vladimír Bahyl IT-FIO-TSI June 2009.
The Worldwide LHC Computing Grid Frédéric Hemmer IT Department Head Visit of INTEL ISEF CERN Special Award Winners 2012 Thursday, 21 st June 2012.
CERN - IT Department CH-1211 Genève 23 Switzerland t CERN - IT Department CH-1211 Genève 23 Switzerland t Windows Vista and.
Site Report: CERN Helge Meinhard / CERN-IT HEPiX, Jefferson Lab 09 October 2006.
Chapter 7: Using Windows Servers
Helge Meinhard / CERN-IT HEPiX Spring 2010 “LIP Lisbon” 19 April 2010
Windows 7 deployment at CERN
Olof Bärring LCG-LHCC Review, 22nd September 2008
CERN Windows Roadmap Tim Bell 8th June 2011.
HEPiX Spring 2009 Highlights
Presentation transcript:

CERN IT Department CH-1211 Genève 23 Switzerland t CERN Site Report Helge Meinhard / CERN-IT HEPiX Spring 2009 Umea 25 May 2009

Structure: Changes on 01-Jan-2009 CERN Directorate –Rolf Heuer (Director General) –Sergio Bertolucci (Research and Computing) –Stephen Myers (Accelerators and Technology) –Sigurd Lettow (Administration and General Infrastructure) IT Department –Department head: Frédéric Hemmer –Deputy: David Foster Two former IT groups changed department –CO (now in Engineering Department as Industrial Controls and Electronics) –AIS (now in General Infrastructure Services Department) CERN Site Report for HEPiX Spring 2009 – Helge Meinhard at cern.ch

LHC First circulating beams in LHC: 10 September –Very large press coverage –Web servers hit 20 times more often than usually Incident on 19 September 2008 –During attempt to ramp beam energy up to 7 TeV, a leak occurred in the cold mass causing significant loss of helium Repair work is ongoing –Instrumentation for detecting this kind of problem being added Schedule: beam end September 2009, collisions end October 2009, running until autumn 2010 –Collision energy: TeV –Short technical stop at Christmas 2009 Work on schedule CERN Site Report for HEPiX Spring 2009 – Helge Meinhard at cern.ch

Databases and Engineering Support TWiki –Started at CERN in 2003 –Now 5000 active users, topics, monthly updates –Atlas and CMS are heavy users –Security enhancements: write access to CERN accounts only, campaign to disable anonymous read Subversion –A better version control system –Following successful beta test, went into production in January 2009 –Coexistence with cvs for quite some time CERN Site Report for HEPiX Spring 2009 – Helge Meinhard at cern.ch

Fabric Infrastructure and Operation (1) Linux –Certification etc. SLC4 is current production version, mostly x86_64 on servers –Laptops problematic since 2006, desktops since a little later SLC5 ongoing, formal certification delayed because of CERNLIB Because of demand for SLC5, lxplus and (very sizeable) lxbatch facilities established, publicly open now Recommendation for experienced users without any known dependency on SLC4 is SLC5 –DNS incidents: Found that with RH5, the DNS lookup is conforming more closely with RFC 3484 No longer the first IP, or a random choice, but the geographically next IP is returned –Change of responsible: Jan van Eldik takes over from Jan Iven CERN Site Report for HEPiX Spring 2009 – Helge Meinhard at cern.ch

Fabric Infrastructure and Operation (2) Infrastructure –New computer centre New directorate very favourable towards CC project –Green solution: link new CC to heating of respective site (Prévessin) 4 detailed design studies delivered in 2008 Tender for client advisor Subsequently (unique) tender for construction –Total of 60 water-cooled racks of 10 kW each: Infrastructure ready Delivery of CPUs and disks in September 2008 will use them Backup –Growth remains higher than expectations (~ 35%) Users dont know what they want… –Alex Iribarren took over from David Asbury as service manager CERN Site Report for HEPiX Spring 2009 – Helge Meinhard at cern.ch

Fabric Infrastructure and Operation (3) Procurements (1) –Change of LHC running schedule required procurement planning to be adapted Initially: 2 big rounds for 2009, 2 big rounds for 2010 After September 19 th : Only one round for 2009, one for 2010 February: Emergency procedure added to have additional equipment in October 2009 –Tenders open or in preparation CERN Site Report for HEPiX Spring 2009 – Helge Meinhard at cern.ch TypeProductionVolume CPU serversOctober HEP-SPEC06 (about 500 systems) Small disk serversOctober systems Large disk serversOctober PB usable (about 200 systems) Servers and iSCSI backendsOctober PB usable Oracle serversOctober systems Midrange servers> October systems FC disk arrays> October systems CPU serversMarch HEP-SPEC06 (about 1500 systems) Large disk serversMarch PB usable (about 500 systems)

CERN CC currently (March 2009) 5700 systems, processing cores 5700 systems, processing cores CPU servers, disk servers, infrastructure servers CPU servers, disk servers, infrastructure servers TB usable on disk drives TB usable on disk drives TB on tape cartridges (56000 slots), 160 tape drives TB on tape cartridges (56000 slots), 160 tape drives

Fabric Infrastructure and Operation (4) Procurements (2) – Fun points –Defective memory modules –Yet another supplier went out of business in January > 100 systems in CC under warranty no longer maintained –Memory manufacturer insolvent –Late deliveries –Spurious error messages due to defective firmware Almost 1000 systems upgraded: BIOS and BMC –Disk drive problems: thousands of drives upgraded drives dropped out of RAID under heavy load I/O errors on disk drives Corruptions on disk drives Risk to make drives inoperable Default expiration date... –RAID controller / SAS expander incompatibilities Causing data corruptions, required firmware fix for RAID controller –I/O errors on disk array in JBOD mode –Dropped connections (or none at all) via BMC SOL –Unphysical (aka crazy) CPU accounting –Loss of network connection to BMC CERN Site Report for HEPiX Spring 2009 – Helge Meinhard at cern.ch

Fabric Infrastructure and Operation (5) Tapes –Following successful tests, drives upgraded to newest generation supporting ~1 TB, media being repacked –After many years of dealing with tapes, Charles Curran retired Vlado Bahyl has agreed to take over R&D –Lustre pilot project (see Arne Wiebalcks talk) –iSCSI tests (see Andras Horvaths talk) CERN Site Report for HEPiX Spring 2009 – Helge Meinhard at cern.ch

Internet Services (1) Windows services –Terminal services remain very popular, number steeply increasing (now ~ 60 in production, i.e. 40% growth over 2008) Ivan Deloose taking management over from Michael Kwiatek Windows clients –Vista default for sufficiently powerful machines (e.g. 2 GB or more), but XP remains a choice. Issue on Technical Network because of activation –Michael Kwiatek taking over from Rafal Otto CERN Site Report for HEPiX Spring 2009 – Helge Meinhard at cern.ch

Internet Services (2) Mail services –Migration of mailing lists and archives to e-groups completed –Mailboxes being moved to Exchange 2007 –Plagued by Thunderbird bug exhausting IMAP connections CERN Site Report for HEPiX Spring 2009 – Helge Meinhard at cern.ch

User Support Preparing for re-tendering the desktop / helpdesk contract –New contract as of 01-Jan-2010 –Current helpdesk workflow in Remedy being re- implemented until September 2009 Printing under consideration –(Far) Too many printers, too many different printer models, high (but mostly hidden) cost –Proposal for small number of multi-function devices, pay- by-sheet CERN Site Report for HEPiX Spring 2009 – Helge Meinhard at cern.ch

Security Incidents include –Well-known attack on network protocol on Linux machines back –Increased phishing attacks – some users really give their passwords away… –Users posting sensitive information on open channels (mailing lists, Web including TWiki etc.) Measures taken include –Tor added to list of blocked applications –Access to remote DNS servers blocked Skype: Trial ongoing under certain conditions –Tolerated as long as configured correctly and not more problems than expected occur Training courses for users and developers well received CERN Site Report for HEPiX Spring 2009 – Helge Meinhard at cern.ch

ITIL Information Technology Infrastructure Library CERN-IT getting serious about it… In some places not far off, but rigour sometimes missing –Meetings are working well, but invested effort cant be compared with measurable improvements of services –No clear separation yet of change management, incident management, service level management Hope is to reduce stress and heroism to run services KPI difficult in our environment Aim is to demonstrate that best practices are applied IT Service Review meeting being instantiated (role of DTF changes) –Need to unify approach of service review, avoid multiple meetings CERN Site Report for HEPiX Spring 2009 – Helge Meinhard at cern.ch

Miscellaneous CERN Openlab –Phase III started –Siemens joined as a partner for automation and controls –Oracle, Intel and HP renewed membership SURE monitoring retired after 15 years CERN Site Report for HEPiX Spring 2009 – Helge Meinhard at cern.ch