TCD Site Report Stuart Kenny*, Stephen Childs, Brian Coghlan, Geoff Quigley.

Slides:



Advertisements
Similar presentations
LCG WLCG Operations John Gordon, CCLRC GridPP18 Glasgow 21 March 2007.
Advertisements

Geoff Quigley, Stephen Childs and Brian Coghlan Trinity College Dublin
A couple of slides on RAL PPD Chris Brew CCLRC - RAL - SPBU - PPD.
Chris Brew RAL PPD Site Report Chris Brew SciTech/PPD.
HEAnet Conference 2006 John Walsh Grid-Ireland Grid Manager Trinity College Dublin The Grid Computing Infrastructure in Ireland and Abroad.
Site Report HEPHY-UIBK Austrian federated Tier 2 meeting
Site Report US CMS T2 Workshop Samir Cury on behalf of T2_BR_UERJ Team.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
Active Security Infrastructure Stuart Kenny Trinity College Dublin.
Computing/Tier 3 Status at Panjab S. Gautam, V. Bhatnagar India-CMS Meeting, Sept 27-28, 2007 Delhi University, Delhi Centre of Advanced Study in Physics,
08/11/908 WP2 e-NMR Grid deployment and operations Technical Review in Brussels, 8 th of December 2008 Marco Verlato.
The National Computational Grid for Ireland OpsCentre Infrastructure Staff TestGrid Porting Current Issues Future Plans Grid-Ireland OpsCentre.
INFSO-RI Enabling Grids for E-sciencE Status of LCG-2 porting Stephen Childs, Brian Coghlan and Eamonn Kenny Grid-Ireland/EGEE October.
BINP/GCF Status Report BINP LCG Site Registration Oct 2009
HPDC 2007 / Grid Infrastructure Monitoring System Based on Nagios Grid Infrastructure Monitoring System Based on Nagios E. Imamagic, D. Dobrenic SRCE HPDC.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.
SEE-GRID-SCI Regional Grid Infrastructure: Resource for e-Science Regional eInfrastructure development and results IT’10, Zabljak,
GridPP DB 12 th July 2007 Enabling Grids for E-sciencE Grid-Ireland Status John Walsh Date: 12 th July 2007Imperial College London.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
Monitoring the Grid at local, national, and Global levels Pete Gronbech GridPP Project Manager ACAT - Brunel Sept 2011.
Oxford Update HEPix Pete Gronbech GridPP Project Manager October 2014.
INDIACMS-TIFR Tier 2 Grid Status Report I IndiaCMS Meeting, April 05-06, 2007.
02/07/09 1 WLCG NAGIOS Kashif Mohammad Deputy Technical Co-ordinator (South Grid) University of Oxford.
UKI-SouthGrid Update Hepix Pete Gronbech SouthGrid Technical Coordinator April 2012.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios for Grid Services E. Imamagic, SRCE.
NCPHEP ATLAS/CMS Tier3: status update V.Mossolov, S.Yanush, Dz.Yermak National Centre of Particle and High Energy Physics of Belarusian State University.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Site Monitoring with Nagios E. Imamagic,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Stuart Kenny and Stephen Childs Trinity.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
Grid DESY Andreas Gellrich DESY EGEE ROC DECH Meeting FZ Karlsruhe, 22./
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
Tier 3 Status at Panjab V. Bhatnagar, S. Gautam India-CMS Meeting, July 20-21, 2007 BARC, Mumbai Centre of Advanced Study in Physics, Panjab University,
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
INFSO-RI Enabling Grids for E-sciencE Grid-wide Intrusion Detection Stuart Kenny*, Brian Coghlan Dept. of Computer Science Trinity.
INFSO-RI Enabling Grids for E-sciencE /10/20054th EGEE Conference - Pisa1 gLite Configuration and Deployment Models JRA1 Integration.
CERN IT Department CH-1211 Genève 23 Switzerland t CERN IT Monitoring and Data Analytics Pedro Andrade (IT-GT) Openlab Workshop on Data Analytics.
LCG WLCG Accounting: Update, Issues, and Plans John Gordon RAL Management Board, 19 December 2006.
An Active Security Infrastructure for Grids Stuart Kenny*, Brian Coghlan Trinity College Dublin.
BaBar Cluster Had been unstable mainly because of failing disks Very few (
Grid testing using virtual machines Stephen Childs*, Brian Coghlan, David O'Callaghan, Geoff Quigley, John Walsh Department of Computer Science Trinity.
Grid-Ireland test facilities Stephen Childs Dept. of Computer Science Trinity College Dublin.
EGEE-II TCD 22 nd -25 th May 2007 Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Experiences with a distributed.
Evangelos Markatos and Charalampos Gkikas FORTH-ICS Athens, th Mar Institute of Computer Science - FORTH Christos.
BNL dCache Status and Plan CHEP07: September 2-7, 2007 Zhenping (Jane) Liu for the BNL RACF Storage Group.
SAM Status Update Piotr Nyczyk LCG Management Board CERN, 5 June 2007.
Probes Requirement Review OTAG-08 03/05/ Requirements that can be directly passed to EMI ● Changes to the MPI test (NGI_IT)
II EGEE conference Den Haag November, ROC-CIC status in Italy
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures Grant Agreement n
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
INFN/IGI contributions Federated Clouds Task Force F2F meeting November 24, 2011, Amsterdam.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CYFRONET site report Marcin Radecki CYFRONET.
KIT – Universität des Landes Baden-Württemberg und nationales Forschungszentrum in der Helmholtz-Gemeinschaft Steinbuch Centre for Computing
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The status of IHEP Beijing Site WLCG Asia-Pacific Workshop Yaodong CHENG IHEP, China 01 December 2006.
Open Science Grid Configuring RSV OSG Resource & Service Validation Thomas Wang Grid Operations Center (OSG-GOC) Indiana University.
Monitoring Working Group Update Grid Deployment Board 5 th December, CERN Ian Neilson.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios Grid Monitor E. Imamagic, SRCE OAT.
Servizi core INFN Grid presso il CNAF: setup attuale
Experience of Lustre at QMUL
NGI and Site Nagios Monitoring
Use of Nagios in Central European ROC
A testbed for the SuperB computing model
Update on Plan for KISTI-GSDC
Experience of Lustre at a Tier-2 site
Quattor Usage at Nikhef
Christof Hanke, HEPIX Spring Meeting 2008, CERN
ALICE – FAIR Offline Meeting KVI (Groningen), 3-4 May 2010
Presentation transcript:

TCD Site Report Stuart Kenny*, Stephen Childs, Brian Coghlan, Geoff Quigley

TCD Two roles: –UKI grid site –Grid-Ireland operations centre 18 sites centrally managed by operations team (8 members, soon to be 7) Responsible for TCD site and Grid- Ireland central services Quattor deployed and managed –Extensive use of Xen VMs

Hardware Dell 2950 gateway host [16GB DRAM + 6TB RAID6] –Xen host (CE, UI, R-GMA MON, test WNs) Dell 2950 SE host [16GB DRAM + 6TB RAID6] 96 x Dell 1950 WNs [16GB DRAM + 500GB] –50 x U/G lab Condor pool WNs 8 x Dell 2950 central server hosts [16GB DRAM + 16TB RAID6] –host01: webserver + rt –host02: repository –host03: VOMS, myproxy, gLite WMS –Host04: BDII, R-GMA, WMS –host05: monitoring server, oracle server –host06: portal servers –host07: datamgt servers –host08: alternate middleware 8 x Dell 2950 redundant central server hosts [16GB DRAM + 16TB RAID6] 1 Ge networking, with 3 x 10Ge uplinks

Storage TCD already had some –Dell Poweredge 2950 (2xQuad Xeon)‏ –Dell MD1000 (SAS - JBOD) After procurement data store has total –8x Dell PE2950 –30x MD1000, each with 15x 1TB disks ~11.6 TiB after RAID6 and XFS format (~348 TiB) –Dell Blade Chassis with 8x M600 blades –Dell tape library (24x Ultrium 4 tapes)‏ –HP ExDS9100 with 4 capacity blocks of 82x 1TB disks and 4 blades ~ 233 TiB total available for NFS/http export Storage Workshop - Geoff Quigley Thurs 13:50

Infrastructure Room needed upgrade –Another cooler –UPS maxed out New high-current AC circuits added 2x 3kVA UPS per rack acquired for Dell equipment ExDS has 4x 16A 3Ø - 2 on room UPS, 2 raw 10 GbE to move data! Storage Workshop - Geoff Quigley Thurs 13:50

Redundant Operations Centre Aim is to keep up-to- date replicas of core server VMs to allow failover in case of network or hardware failures Design decisions –Replicate storage “underneath” Xen VMs –Replicate at block level: avoid need for service-specific replication policies –Manual failover initially

Monitoring A lot of work recently on monitoring configuration –Want to configure as much as possible from common Quattor templates Nagios –Submitting local WLCG grid probes for G-I VOs Lemon Ganglia Also used –Weathermap –Cacti –ASI (Security Day talk) –…

Grid-Ireland Setup Monitoring server EGEE SAM GI SAM Quattor templates Site admins Get site status Issue alarms TCD Site Nagios NSCA Lemon Agent Lemon Host Check Nagios NRPE gridui GI Sites

Lemon-Nagios Integration Lemon service added Additional lemon metrics added to hosts Cron executes lemon-host-check –Output sent to nagios via nsca Exception results in Lemon service failure

Lemon-Nagios Integration

Monitoring - Weathermap

Active Security Existing Grid security activities focused on prevention –Authentication, authorization Active security focused on –Detection –Reaction 3 components –Security monitoring –Alert Analysis –Control Engine Security Day – Stuart Kenny Wed 10:15

Active Security - Report