AliEn central services Costin Grigoras. Hardware overview  27 machines  Mix of SLC4, SLC5, Ubuntu 8.04, 8.10, 9.04  100 cores  20 KVA UPSs  2 * 1Gbps.

Slides:



Advertisements
Similar presentations
Internet Information Services 7.0 and Internet Information Services 7.5 Infrastructure Planning and Design Published: June 2008 Updated: November 2011.
Advertisements

During the last three years, ALICE has used AliEn continuously. All the activities needed by the experiment (Monte Carlo productions, raw data registration,
ALICE G RID SERVICES IP V 6 READINESS
Pankaj Kumar Qinglan Zhang Sagar Davasam Sowjanya Puligadda Wei Liu
MONITORING WITH MONALISA Costin Grigoras. M ONITORING WITH M ON ALISA What is MonALISA ? MonALISA communication architecture Monitoring modules ApMon.
Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Lesson 1: Configuring Network Load Balancing
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 8 Introduction to Printers in a Windows Server 2008 Network.
CERN - IT Department CH-1211 Genève 23 Switzerland t Oracle and Streams Diagnostics and Monitoring Eva Dafonte Pérez Florbela Tique Aires.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 11 Managing and Monitoring a Windows Server 2008 Network.
VTS INNOVATOR SERIES Real Problems, Real solutions.
ALICE Operations short summary and directions in 2012 Grid Deployment Board March 21, 2011.
ALICE Operations short summary LHCC Referees meeting June 12, 2012.
ALICE Operations short summary and directions in 2012 WLCG workshop May 19-20, 2012.
1 Status of the ALICE CERN Analysis Facility Marco MEONI – CERN/ALICE Jan Fiete GROSSE-OETRINGHAUS - CERN /ALICE CHEP Prague.
Monitoring Scale-Out with the MySQL Enterprise Monitor Andy Bang Lead Software Engineer MySQL-Sun, Enterprise Tools Team Wednesday, April 16, :15.
G RID SERVICES IP V 6 READINESS
US ATLAS Western Tier 2 Status and Plan Wei Yang ATLAS Physics Analysis Retreat SLAC March 5, 2007.
Online Monitoring with MonALISA Dan Protopopescu Glasgow, UK Dan Protopopescu Glasgow, UK.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES P. Saiz (IT-ES) AliEn job agents.
INDIACMS-TIFR Tier 2 Grid Status Report I IndiaCMS Meeting, April 05-06, 2007.
Status of the production and news about Nagios ALICE TF Meeting 22/07/2010.
CERN IT Department CH-1211 Geneva 23 Switzerland t Daniel Gomez Ruben Gaspar Ignacio Coterillo * Dawid Wojcik *CERN/CSIC funded by Spanish.
Using Virtual Servers for the CERN Windows infrastructure Emmanuel Ormancey, Alberto Pace CERN, Information Technology Department.
Microsoft Azure SoftUni Team Technical Trainers Software University
Panda Grid Status Kilian Schwarz, GSI on behalf of PANDA GRID Group (slides to a large extend from Radoslaw Karabowicz)
PROOF Cluster Management in ALICE Jan Fiete Grosse-Oetringhaus, CERN PH/ALICE CAF / PROOF Workshop,
N EWS OF M ON ALISA SITE MONITORING
A Brief Documentation.  Provides basic information about connection, server, and client.
NETS UPS REPLACEMENT. NETS has UPS’s in all of our Communications Rooms Provide clean, reliable, AC power to protect from power blackouts, brownouts,
Site operations Outline Central services VoBox services Monitoring Storage and networking 4/8/20142ALICE-USA Review - Site Operations.
Overview of ALICE monitoring Catalin Cirstoiu, Pablo Saiz, Latchezar Betev 23/03/2007 System Analysis Working Group.
 Load balancing is the process of distributing a workload evenly throughout a group or cluster of computers to maximize throughput.  This means that.
KOLKATA Grid Site Name :- IN-DAE-VECC-02Monalisa Name:- Kolkata-Cream VO :- ALICECity:- KOLKATACountry :- INDIA Shown many data transfers.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES A. Abramyan, S. Bagansco, S. Banerjee, L. Betev, F. Carminati,
Monitoring with MonALISA Costin Grigoras. What is MonALISA ?  Caltech project started in 2002
High Availability in DB2 Nishant Sinha
Xrootd Monitoring and Control Harsh Arora CERN. Setting Up Service  Monalisa Service  Monalisa Repository  Test Xrootd Server  ApMon Module.
The CMS Top 5 Issues/Concerns wrt. WLCG services WLCG-MB April 3, 2007 Matthias Kasemann CERN/DESY.
JAliEn Java AliEn middleware A. Grigoras, C. Grigoras, M. Pedreira P Saiz, S. Schreiner ALICE Offline Week – June 2013.
Infrastructure availability and Hardware changes Slides prepared by Niko Neufeld Presented by Rainer Schwemmer for the Online administrators.
+ AliEn site services and monitoring Miguel Martinez Pedreira.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES L. Betev, A. Grigoras, C. Grigoras, P. Saiz, S. Schreiner AliEn.
03/09/2007http://pcalimonitor.cern.ch/1 Monitoring in ALICE Costin Grigoras 03/09/2007 WLCG Meeting, CHEP.
Status of AliEn2 Services ALICE offline week Latchezar Betev Geneva, June 01, 2005.
Data transfers and storage Kilian Schwarz GSI. GSI – current storage capacities vobox LCG RB/CE GSI batchfarm: ALICE cluster (67 nodes/480 cores for batch.
ALICE computing Focus on STEP09 and analysis activities ALICE computing Focus on STEP09 and analysis activities Latchezar Betev Réunion LCG-France, LAPP.
Plesk 8 for Linux/UNIX Server Automation SWSOFT GLOBAL HOSTING SUMMIT 2006 Todd L. Crumpler May 30-June 1, 2006.
Pledged and delivered resources to ALICE Grid computing in Germany Kilian Schwarz GSI Darmstadt ALICE Offline Week.
WLCG Transfers monitoring EGI Technical Forum Madrid, 17 September 2013 Pablo Saiz on behalf of the Dashboard Team CERN IT/SDC.
Storage discovery in AliEn
Virtual machines ALICE 2 Experience and use cases Services at CERN Worker nodes at sites – CNAF – GSI Site services (VoBoxes)
Availability of ALICE Grid resources in Germany Kilian Schwarz GSI Darmstadt ALICE Offline Week.
Federating Data in the ALICE Experiment
Jean-Philippe Baud, IT-GD, CERN November 2007
Consulting Services JobScheduler Architecture Decision Template
ALICE & Clouds GDB Meeting 15/01/2013
Database Replication and Monitoring
Torrent-based software distribution
Consulting Services JobScheduler Architecture Decision Template
Report PROOF session ALICE Offline FAIR Grid Workshop #1
Torrent-based software distribution
Conditions Data access using FroNTier Squid cache Server
Storage elements discovery
Simulation use cases for T2 in ALICE
Load Balancing: List Scheduling
AliEn central services (structure and operation)
Publishing ALICE data & CVMFS infrastructure monitoring
Load Balancing: List Scheduling
Presentation transcript:

AliEn central services Costin Grigoras

Hardware overview  27 machines  Mix of SLC4, SLC5, Ubuntu 8.04, 8.10, 9.04  100 cores  20 KVA UPSs  2 * 1Gbps uplinks T1/T2 workshop: AliEn central services

AliEn services Several instances under a common DNS alias  Authen  Proxy  PackMan  Optimizers (Jobs, Transfers, Catalogue,PackMan…)  API servers T1/T2 workshop: AliEn central services

DNS load balancing of central services  Each machine reports through ML to the central repository the full status of each machine, including:  Operational status of each service (tested every 15m)  Load on the machine, CPU, memory and swap utilization  No. of connected sockets  A weighted score is generated based on the parameters above, updating every minute the CERN DNS aliases with the IP addresses of the machines that are not overloaded.  The IP aliases are queried by users or site services when connecting to the central services; by using them we distribute the load evenly between the active machines and limit the damage that can be caused to the central services T1/T2 workshop: AliEn central services

DNS load balancing in action Wed Jul 9 07:23:24 CEST 2008 : alice-proxy Thu Jul 10 13:40:38 CEST 2008 : alice-proxy Thu Jul 10 13:44:52 CEST 2008 : alice-proxy T1/T2 workshop: AliEn central services

Information sources  LDAP  Services’ configuration  Users & Roles  MySQL  Transfer Queue: 3M transfers  Task Queue: 27.7M jobs Users (sync with LDAP): 700  Catalogue: 85M entries  MySQL Backup (replication) T1/T2 workshop: AliEn central services

Build servers  AliEn & AliROOT  32 and 64 bit  SLC4 and SLC5  Most of them virtual machines (VirtualBox)  Other build machines:  SLC4 on Itanium  OSX in 32 and 64 bits T1/T2 workshop: AliEn central services

Monitoring  MonALISA Repository  Storage client  Web interface  One database backend  Two more database backends  Redundancy  Load sharing T1/T2 workshop: AliEn central services

More services  6TB storage  Shared AliEn installation  Backup (configuration & DB)  alien.cern.ch website  Xrootd global redirector (more details from Fabrizio later)  ALICE::CERN::SE redirector  BitTorrent tracker and seeder (see Pablo’s talk)  ALICE Offline Project Management & Shift Management T1/T2 workshop: AliEn central services

Plans T1/T2 workshop: AliEn central services10  Installing more power to the room  Scheduled for today, should be transparent  3 new racks  Next week we are planning to move the hardware to them  One day downtime, will be announced  2 new 16-cores (Nehalem) machines  We’ll try to replace several old machines with virtual machines or services on the base machine