CVS Service at CERN status and LCG-dedicated service CERN IT/PS/UI October 2003.

Slides:



Advertisements
Similar presentations
Express5800/ft series servers Product Information Fault-Tolerant General Purpose Servers.
Advertisements

Report on CVS Services : Central and LCG-dedicated services CERN IT/PS/UI May 2004.
Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Torsten Antoni – LCG Operations Workshop, CERN 02-04/11/04 Global Grid User Support - GGUS -
Copyright Hub Software Engineering Ltd 2010All rights reserved Hub Document Manager Product Overview.
High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
DataGrid is a project funded by the European Union 22 September 2003 – n° 1 EDG WP4 Fabric Management: Fabric Monitoring and Fault Tolerance
Highly Available Central Services An Intelligent Router Approach Thomas Finnern Thorsten Witt DESY/IT.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 8: Implementing and Managing Printers.
Cambodia-India Entrepreneurship Development Centre - : :.... :-:-
Installing software on personal computer
Implementing High Availability
Microsoft Load Balancing and Clustering. Outline Introduction Load balancing Clustering.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 1: Introduction to Windows Server 2003.
CERN - IT Department CH-1211 Genève 23 Switzerland t SVN Pilot: CVS Replacement Manuel Guijarro Jonatan Hugo Hugosson Artur Wiecek David.
Chapter 2: Installing and Upgrading to Windows Server 2008 R2 BAI617.
Database Services for Physics at CERN with Oracle 10g RAC HEPiX - April 4th 2006, Rome Luca Canali, CERN.
Chapter-4 Windows 2000 Professional Win2K Professional provides a very usable interface and was designed for use in the desktop PC. Microsoft server system.
Module 13: Configuring Availability of Network Resources and Content.
Abstract The automated multi-platform software nightly build system is a major component in the ATLAS collaborative software organization, validation and.
Technology Overview. Agenda What’s New and Better in Windows Server 2003? Why Upgrade to Windows Server 2003 ?  From Windows NT 4.0  From Windows 2000.
INSTALLING MICROSOFT EXCHANGE SERVER 2003 CLUSTERS AND FRONT-END AND BACK ‑ END SERVERS Chapter 4.
Sofia, Bulgaria | 9-10 October SQL Server 2005 High Availability for developers Vladimir Tchalkov Crossroad Ltd. Vladimir Tchalkov Crossroad Ltd.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
7/2/2003Supervision & Monitoring section1 Supervision & Monitoring Organization and work plan Olof Bärring.
1 Linux in the Computer Center at CERN Zeuthen Thorsten Kleinwort CERN-IT.
October, Scientific Linux INFN/Trieste B.Gobbo – Compass R.Gomezel - T.Macorini - L.Strizzolo INFN - Trieste.
Yannick Patois – CVS and Autobuild tools at CCIN2P3 – hepix - October, n° 1 CVS setup at CC-IN2P3 and Datagrid edg- build tools CVS management,
CSU - DCE Internet Security... Privacy Overview - Fort Collins, CO Copyright © XTR Systems, LLC Setting Up & Using a Site Security Policy Instructor:
Csi315csi315 Client/Server Models. Client/Server Environment LAN or WAN Server Data Berson, Fig 1.4, p.8 clients network.
Module 2: Installing and Maintaining ISA Server. Overview Installing ISA Server 2004 Choosing ISA Server Clients Installing and Configuring Firewall Clients.
Managing the Oracle Application Server with Oracle Enterprise Manager 10g.
1 Week #10Business Continuity Backing Up Data Configuring Shadow Copies Providing Server and Service Availability.
1 The new Fabric Management Tools in Production at CERN Thorsten Kleinwort for CERN IT/FIO HEPiX Autumn 2003 Triumf Vancouver Monday, October 20, 2003.
CERN Physics Database Services and Plans Maria Girone, CERN-IT
Large Farm 'Real Life Problems' and their Solutions Thorsten Kleinwort CERN IT/FIO HEPiX II/2004 BNL.
Deployment work at CERN: installation and configuration tasks WP4 workshop Barcelona project conference 5/03 German Cancio CERN IT/FIO.
Chapter 10 Chapter 10: Managing the Distributed File System, Disk Quotas, and Software Installation.
An Agile Service Deployment Framework and its Application Quattor System Management Tool and HyperV Virtualisation applied to CASTOR Hierarchical Storage.
Lemon Monitoring Miroslav Siket, German Cancio, David Front, Maciej Stepniewski CERN-IT/FIO-FS LCG Operations Workshop Bologna, May 2005.
OSIsoft High Availability PI Replication
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Usage of virtualization in gLite certification Andreas Unterkircher.
Installing, running, and maintaining large Linux Clusters at CERN Thorsten Kleinwort CERN-IT/FIO CHEP
Chapter 2 Securing Network Server and User Workstations.
CERN-IT Oracle Database Physics Services Maria Girone, IT-DB 13 December 2004.
VMware vSphere Configuration and Management v6
CERN IT Department CH-1211 Genève 23 Switzerland PES 1 Ermis service for DNS Load Balancer configuration HEPiX Fall 2014 Aris Angelogiannopoulos,
HEPiX 2 nd Nov 2000 Alan Silverman Proposal to form a Large Cluster SIG Alan Silverman 2 nd Nov 2000 HEPiX – Jefferson Lab.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
David Foster LCG Project 12-March-02 Fabric Automation The Challenge of LHC Scale Fabrics LHC Computing Grid Workshop David Foster 12 th March 2002.
Maria Girone CERN - IT Tier0 plans and security and backup policy proposals Maria Girone, CERN IT-PSS.
CNAF Database Service Barbara Martelli CNAF-INFN Elisabetta Vilucchi CNAF-INFN Simone Dalla Fina INFN-Padua.
R. Krempaska, October, 2013 Wir schaffen Wissen – heute für morgen Controls Security at PSI Current Status R. Krempaska, A. Bertrand, C. Higgs, R. Kapeller,
A Service-Based SLA Model HEPIX -- CERN May 6, 2008 Tony Chan -- BNL.
Status of tests in the LCG 3D database testbed Eva Dafonte Pérez LCG Database Deployment and Persistency Workshop.
BNL dCache Status and Plan CHEP07: September 2-7, 2007 Zhenping (Jane) Liu for the BNL RACF Storage Group.
CERN 13-Jun-2002 Andreas Pfeiffer, CERN/IT-API, Development Infrastructure Andreas Pfeiffer CERN IT/API
Replicazione e QoS nella gestione di database grid-oriented Barbara Martelli INFN - CNAF.
Platform & Engineering Services CERN IT Department CH-1211 Geneva 23 Switzerland t PES Improving resilience of T0 grid services Manuel Guijarro.
9 Copyright © 2004, Oracle. All rights reserved. Getting Started with Oracle Migration Workbench.
Service Level Status Overview project Sebastian Lopienski CERN, IT/FIO HEPiX meeting, Jefferson Lab, October 10 th, 2006.
OSIsoft High Availability PI Replication Colin Breck, PI Server Team Dave Oda, PI SDK Team.
INFSO-RI Enabling Grids for E-sciencE Running reliable services: the LFC at CERN Sophie Lemaitre
Monitoring and Fault Tolerance
Status of Fabric Management at CERN
Database Services at CERN Status Update
WP4-install status update
A Technical Overview of Microsoft® SQL Server™ 2005 High Availability Beta 2 Matthew Stephen IT Pro Evangelist (SQL Server)
GGUS Partnership between FZK and ASCC
Software Version System Part1: Subversion at CERN
Presentation transcript:

CVS Service at CERN status and LCG-dedicated service CERN IT/PS/UI October 2003

CVS Service – M. Guijarro, S. Lopienski (CERN IT/PS/UI) – October 2003 Outline CVS Service for LCG The challenge Architecture Failure recovery Advantages, disadvantages Plans for the future CERN Central CVS Service Overview What does it offer? Projects Architecture Status Interactions with users Failures and fail-over Tools

CVS Service – M. Guijarro, S. Lopienski (CERN IT/PS/UI) – October 2003 CERN Central CVS Service Overview central service hosting CERN-related software projects created following a service request - collection of user requirements - architecture proposals - implementation based on assigned resources in production since the end of August 2002 currently hosting around 45 projects, 3 GB of data (major part is source code)

CVS Service – M. Guijarro, S. Lopienski (CERN IT/PS/UI) – October 2003 CERN Central CVS Service Projects AB-CO FPGA designs AB-RF-CS Section Software ABs Common-Build ATLAS TDAQ RobIn Anaphe Atlas (migrating) Automatic Driver Code Generation BIC surveillance project CASTOR project CCDB CERN Central CVS Service CERN Oracle Grid Data Management Services CESAR Project CMW administration tool CVS test Computer Centre - Operational Procedures Management Cyan Jaguar Diana Network device information analysing and monitoring Project EJB Platform Project Front End Software Architecture GUI Platform project HEPiX scripts IT-FIO software IT-PS-UI software Itcobe Java Api for Parameter Control Java Data Viewer LCG Oracle AS + DB Tier 1 Deployment LHC Alarm Service Project LHC Logging Project LHC Orbit feedback LHCb software Machine Component State Manager Macsy MammoGrid project Middle Tier Server for CMW NA48 analysis software NA60 Offline Project OASIS software OPERA project PHOS Trigger Router Unit PS LICMON Software QUATTOR components for CERN Section Activities Administration System...

CVS Service – M. Guijarro, S. Lopienski (CERN IT/PS/UI) – October 2003 CERN Central CVS Service What does it offer? secure and robust CVS service with up-to-date server software data integrity (mirror every hour, daily archiving) several access methods: Kerberos IV, SSH, pserver service support through Remedy automatic CVS lock monitoring and reporting good performance Web interfaces: CVSWeb, ViewCVS BUT: the service is not a project management tool

CVS Service – M. Guijarro, S. Lopienski (CERN IT/PS/UI) – October 2003 CERN Central CVS Service Architecture automatic and transparent load-balancing and fault tolerance (via an ISS DNS alias that distributes CVS requests among a farm of four servers) dependency on AFS and DNS monitoring availability every 10 min. service availability higher than 99.98% so far

CVS Service – M. Guijarro, S. Lopienski (CERN IT/PS/UI) – October 2003 CERN Central CVS Service Failures and fail-over On average, one out of four nodes is often down, due to: - software upgrades - 3ware disk controllers failures - hardware tests performed to investigate hangs (lxcvs02 has hung more than 20 times this year) 4 disks failed (no problem since they are mirrored) Automatic fail-over made the disruptions mentioned above transparent to CVS users Total down-time of the service this year < 12 hours (est.): - Computer Centre power cuts - several short network interruptions (mainly at non-working hours) - some others partial interruptions: ·AFS problems - affecting only some users ·xinetd configuration (wrong pserver port number, limit on kserver processes)

CVS Service – M. Guijarro, S. Lopienski (CERN IT/PS/UI) – October 2003 CERN Central CVS Service Status Software upgrades: -operating system → Red Hat 7.3 -CVS → (newest feature version) -applying patches when necessary Two additional nodes (SEIL 2x2.4GHz) have been added to the cluster – currently four servers Fully automated installation and configuration of the machines using WP4 tools (as in lxbatch): -CDB templates -kickstart files -... but still some SUE features to be translated into WP4 components

CVS Service – M. Guijarro, S. Lopienski (CERN IT/PS/UI) – October 2003 CERN Central CVS Service Interactions with users Almost 200 user requests or questions received and answered so far - around 90 on the Remedy system - others via Full documentation on the Web ( ) - user documentation - manuals - howto's - list of CVS books - technical documentation for administrators Web tools for users: - configuring access type for web interfaces: Public, Restricted or None - modified CVSWeb and ViewCVS - encrypting passwords for pserver access

CVS Service – M. Guijarro, S. Lopienski (CERN IT/PS/UI) – October 2003 CERN Central CVS Service Tools A series of tools were developed for service maintenance: detecting and increasing AFS volume size for projects which are about to get full CVS lock detection and reporting to librarians ISS statistics cluster nodes' information (availability; load; up-time; ISS information and enabling and disabling a node) cvsserver SUE feature (for automatic machine configuration) scripts for creating and deleting projects setup, availability and backup checking scripts etc.

CVS Service – M. Guijarro, S. Lopienski (CERN IT/PS/UI) – October 2003

CVS Service for LCG The LCG challenge LCG (LHC Computing Grid – group has requested a non AFS-based CVS Service Several proposals were prepared to meet this demand One solution was chosen, implemented and is to be evaluated by LCG The old CVS Service will remain in production, available for non LCG-related projects

CVS Service – M. Guijarro, S. Lopienski (CERN IT/PS/UI) – October 2003 CVS Service for LCG Architecture An „N+1” cluster: N active nodes with repositories on local file systems additional passive node („slave server”) – backup for all active servers data replication – repositories copied to the slave server for each repository there is a DNS alias pointing to the node hosting this repository when a node is down, aliases are redirected to the slave server currently 3+1 machines

CVS Service – M. Guijarro, S. Lopienski (CERN IT/PS/UI) – October 2003 CVS Service for LCG Architecture repositories copies of repositories slave X.cvs.cern.ch Project X 12N servers

CVS Service – M. Guijarro, S. Lopienski (CERN IT/PS/UI) – October 2003 CVS Service for LCG Advantages and disadvantages Advantages fastest possible access to the repositories (local file system) independent of AFS regular PC as servers (no special hardware bought) Disadvantages: constant mirroring may affect performance load-balance is on repository level (not request level) slave server down => no fault tolerance fail-over requires human intervention (for the time being) Plans for the future: Automatic fail-over (system decides when to switch to the slave server)

CVS Service – M. Guijarro, S. Lopienski (CERN IT/PS/UI) – October 2003 CVS Service for LCG Other information data replication is done by CVSup access to repositories and user home directories on a file-system level via NFS Web interface: CVSWeb (sample URL: instant DNS update CVS Service for LCG web page:

CVS Service – M. Guijarro, S. Lopienski (CERN IT/PS/UI) – October 2003 Thank you! Any questions? More information at: