RAL Site Report John Gordon HEPiX/HEPNT Catania 17th April 2002.

Slides:



Advertisements
Similar presentations
Tony Doyle - University of Glasgow GridPP EDG - UK Contributions Architecture Testbed-1 Network Monitoring Certificates & Security Storage Element R-GMA.
Advertisements

23rd April 2002HEPSYSMAN April Oxford University Particle Physics Site Report Pete Gronbech Systems Manager.
24-Apr-03UCL HEP Computing Status April DESKTOPS LAPTOPS BATCH PROCESSING DEDICATED SYSTEMS GRID MAIL WEB WTS SECURITY SOFTWARE MAINTENANCE BACKUP.
A couple of slides on RAL PPD Chris Brew CCLRC - RAL - SPBU - PPD.
Martin Bly RAL CSF Tier 1/A RAL Tier 1/A Status HEPiX-HEPNT NIKHEF, May 2003.
Zeus Server Product Training Son Nguyen Zeus Server Product Training Son Nguyen.
Tier1A Status Andrew Sansum GRIDPP 8 23 September 2003.
Novell Server Linux vs. windows server 2008 By: Gabe Miller.
SUMS Storage Requirement 250 TB fixed disk cache 130 TB annual increment for permanently on- line data 100 TB work area (not controlled by SUMS) 2 PB near-line.
Title US-CMS User Facilities Vivian O’Dell US CMS Physics Meeting May 18, 2001.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
TRIUMF SITE REPORT – Corrie Kost April Catania (Italy) Update since last HEPiX/HEPNT meeting.
Gareth Smith RAL PPD HEP Sysman. April 2003 RAL Particle Physics Department Site Report.
EU funding for DataGrid under contract IST is gratefully acknowledged GridPP Tier-1A Centre CCLRC provides the GRIDPP collaboration (funded.
Edinburgh Site Report 1 July 2004 Steve Thorn Particle Physics Experiments Group.
Storage Survey and Recent Acquisition at LAL Michel Jouvin LAL / IN2P3
Tier 1A Storage Procurement 2001/2002 Andrew Sansum CLRC eScience Centre.
Terabyte IDE RAID-5 Disk Arrays David A. Sanders, Lucien M. Cremaldi, Vance Eschenburg, Romulus Godang, Christopher N. Lawrence, Chris Riley, and Donald.
Online Systems Status Review of requirements System configuration Current acquisitions Next steps... Upgrade Meeting 4-Sep-1997 Stu Fuess.
April 2001HEPix/HEPNT1 RAL Site Report John Gordon CLRC, UK.
30-Jun-04UCL HEP Computing Status June UCL HEP Computing Status April DESKTOPS LAPTOPS BATCH PROCESSING DEDICATED SYSTEMS GRID MAIL WEB WTS.
20th October 2003Hepix Vancouver - Oxford Site Report1 Oxford University Particle Physics Site Report Pete Gronbech Systems Manager.
12th November 2003LHCb Software Week1 UK Computing Glenn Patrick Rutherford Appleton Laboratory.
23 Oct 2002HEPiX FNALJohn Gordon CLRC-RAL Site Report John Gordon CLRC eScience Centre.
RAL Tier 1 Site Report HEPSysMan – RAL – May 2006 Martin Bly.
TRIUMF Site Report for HEPiX/HEPNT, Vancouver, Oct20-24/2003 – Corrie Kost TRIUMF SITE REPORT Corrie Kost Head Scientific Computing.
Martin Bly RAL Tier1/A RAL Tier1/A Report HepSysMan - July 2004 Martin Bly / Andrew Sansum.
Amy Apon, Pawel Wolinski, Dennis Reed Greg Amerson, Prathima Gorjala University of Arkansas Commercial Applications of High Performance Computing Massive.
Linux Servers with JASMine K. Edwards, A. Kowalski, S. Philpott HEPiX May 21, 2003.
28 April 2003Imperial College1 Imperial College Site Report HEP Sysman meeting 28 April 2003.
21 st October 2002BaBar Computing – Stephen J. Gowdy 1 Of 25 BaBar Computing Stephen J. Gowdy BaBar Computing Coordinator SLAC 21 st October 2002 Second.
6/26/01High Throughput Linux Clustering at Fermilab--S. Timm 1 High Throughput Linux Clustering at Fermilab Steven C. Timm--Fermilab.
Laboratório de Instrumentação e Física Experimental de Partículas GRID Activities at LIP Jorge Gomes - (LIP Computer Centre)
23 April 2002HEP SYSMAN meeting1 Cambridge HEP Group - site report April 2002 John Hill.
SLAC Site Report Chuck Boeheim Assistant Director, SLAC Computing Services.
RAL Site Report Andrew Sansum e-Science Centre, CCLRC-RAL HEPiX May 2004.
RAL Site Report John Gordon IT Department, CLRC/RAL HEPiX Meeting, JLAB, October 2000.
10/22/2002Bernd Panzer-Steindel, CERN/IT1 Data Challenges and Fabric Architecture.
The II SAS Testbed Site Jan Astalos - Institute of Informatics Slovak Academy of Sciences.
October 2002 INFN Catania 1 The (LHCC) Grid Project Initiative in Prague Dagmar Adamova INP Rez near Prague.
2-3 April 2001HEPSYSMAN Oxford Particle Physics Site Report Pete Gronbech Systems Manager.
JLAB Computing Facilities Development Ian Bird Jefferson Lab 2 November 2001.
19th September 2003Tim Adye1 RAL Tier A Status Tim Adye Rutherford Appleton Laboratory BaBar UK Collaboration Meeting Royal Holloway 19 th September 2003.
Martin Bly RAL Tier1/A Centre Preparations for the LCG Tier1 Centre at RAL LCG CERN 23/24 March 2004.
Tier1 Andrew Sansum GRIDPP 10 June GRIDPP10 June 2004Tier1A2 Production Service for HEP (PPARC) GRIDPP ( ). –“ GridPP will enable testing.
IDE disk servers at CERN Helge Meinhard / CERN-IT CERN OpenLab workshop 17 March 2003.
Tier1A Status Andrew Sansum 30 January Overview Systems Staff Projects.
RAL Site report John Gordon ITD October 1999
PC clusters in KEK A.Manabe KEK(Japan). 22 May '01LSCC WS '012 PC clusters in KEK s Belle (in KEKB) PC clusters s Neutron Shielding Simulation cluster.
Partner Logo A Tier1 Centre at RAL and more John Gordon eScience Centre CLRC-RAL HEPiX/HEPNT - Catania 19th April 2002.
CERN Computer Centre Tier SC4 Planning FZK October 20 th 2005 CERN.ch.
Status of the Bologna Computing Farm and GRID related activities Vincenzo M. Vagnoni Thursday, 7 March 2002.
ClinicalSoftwareSolutions Patient focused.Business minded. Slide 1 Opus Server Architecture Fritz Feltner Sept 7, 2007 Director, IT and Systems Integration.
RAL Site Report HEPiX - Rome 3-5 April 2006 Martin Bly.
Randy MelenApril 14, Stanford Linear Accelerator Center Site Report April 1999 Randy Melen SLAC Computing Services/Systems HPC Team Leader.
RHIC/US ATLAS Tier 1 Computing Facility Site Report Christopher Hollowell Physics Department Brookhaven National Laboratory HEPiX Upton,
The 2001 Tier-1 prototype for LHCb-Italy Vincenzo Vagnoni Genève, November 2000.
Linux IDE Disk Servers Andrew Sansum 8 March 2000.
RAL PPD Tier 2 (and stuff) Site Report Rob Harper HEP SysMan 30 th June
Tier1A Status Martin Bly 28 April CPU Farm Older hardware: –108 dual processors (450, 600 and 1GHz) –156 dual processor 1400MHz PIII Recent delivery:
A UK Computing Facility John Gordon RAL October ‘99HEPiX Fall ‘99 Data Size Event Rate 10 9 events/year Storage Requirements (real & simulated data)
The RAL Tier-1 and the 3D Deployment Andrew Sansum 3D Meeting 22 March 2006.
10/18/01Linux Reconstruction Farms at Fermilab 1 Steven C. Timm--Fermilab.
RAL Plans for SC2 Andrew Sansum Service Challenge Meeting 24 February 2005.
CERN Disk Storage Technology Choices LCG-France Meeting April 8 th 2005 CERN.ch.
Bernd Panzer-Steindel CERN/IT/ADC1 Medium Term Issues for the Data Challenges.
UK GridPP Tier-1/A Centre at CLRC
Vladimir Sapunenko On behalf of INFN-T1 staff HEPiX Spring 2017
Web Server Administration
QMUL Site Report by Dave Kant HEPSYSMAN Meeting /09/2019
Presentation transcript:

RAL Site Report John Gordon HEPiX/HEPNT Catania 17th April 2002

John Gordon - HEPiX- n° 2 Outline u Recent hardware changes n CPU n Disk n Tape n Network & Wireless n Videoconferencing u Recent service changes n TierA Centre for BaBar n EDG Testbed1 u Issues

John Gordon - HEPiX- n° 3 Recent hardware changes u CPU u Disk u Tape u Network

John Gordon - HEPiX- n° 4 CPU u Recently started buying 1u racked dual cpu boxes n 14 dual 1GHz for EDG Testbed late 2001 n 156 dual 1.4GHz PIII in March 2002 u Large increase on existing 250 cpus n Speeds from 450MHz to 1GHz

John Gordon - HEPiX- n° 5 CPU u Recent purchase from Compusys u Result of competitive EU Tender u Chose PIII when we defined the spec last October n Worried about P4 compiler support and performance u Kickstarted into existing infrastructure so running within days

John Gordon - HEPiX- n° 6

John Gordon - HEPiX- n° 7 CPU u GHz Pentium III Tualatin cpus u 1GB of memory/dual u Internal 40GB Maxtor Viper disk u Tyan S2518 Serverworks LE Motherboard u 100Mbit Ethernet NIC n Connected to switch with multiple Gb uplinks

John Gordon - HEPiX- n° 8 Recent Tender for Disk u Delivered March 2002 so no running experience yet u Compusys the supplier u Chose SCSI/IDE RAID solution n IDE disk in RAID controller n SCSI connection to Host n No modification to host operating system n Quicker to replace host, reconfigure, add extra arrays u 26 rack-mounted Linux servers with 52 RAID Arrays n = ~ 50,000GB raw, 45TB RAID5

John Gordon - HEPiX- n° 9

John Gordon - HEPiX- n° 10 RAID Arrays & Servers RAID ControllerZero-D X-3I-R Drives Per Controller12 Drive Size/Manufacturer80GB Maxtor Viper Speed7200rpm Processors2x1.266GHz MotherboardTyan S2518 Serverworks LE Memory1GB ECC PC133 SDRAM NICIntel pro/1000T RAID Servers

John Gordon - HEPiX- n° 11 Performance u Benchmarked several of the offered solutions (IOZONE shown) n Can’t disclose benchmark results of other suppliers u Zero-D RAID controller showed the best performance, n particularly in sequential read where we have the most pressing requirement. u Infotrends 6300 also offers good performance. u The remaining systems offer only tolerable or poor performance u Benchmarking was most illuminating regarding the suppliers knowledge of the equipment, technical expertise, ability to cope under pressure and ability to provide support on their product. n This information was also fed into the tender evaluation.

John Gordon - HEPiX- n° 12 Benchmark Results SINGLE THREAD SEQUENTIAL READ (KB/s) at Varying Record Size 1K K K K THROUGHPUT TEST (aggregate KB/S) at Fixed Record Size (32K) 1 Thread Thread Thread Thread Thread Thread Read Single Array Stride Throughput Test 32K Read 15 Skip 1 Thread Threads Threads Threads Threads Threads Random IO Read

John Gordon - HEPiX- n° 13 RAID0 across 2 RAID5 Arrays Single Thread Record Size Write Read 1K K K K

John Gordon - HEPiX- n° 14 Throughput test (aggregate KB/s) at Fixed Record Size (32K) Threads Write Read

John Gordon - HEPiX- n° 15 Stride Test CMS Data read under Objectivity mimic 1000MB file 32K read. 15 Record Skip Threads Read Stride Reader (30 Record Skip) 6958 Random I/O test 1000MB file 32K read Write Read 1 Random I/O

John Gordon - HEPiX- n° 16 IDE RAID u dual 1GHz PIII system u two 3ware Escalade 7810 controllers (64bit/66MHz PCI) u 8 x 100GB maxtor disks per controller 7 per RAID 5 set 1 hot spare giving 1.1TB usable filespace in total u Bonnie performance (kernel ) MB/s RAID level Block write Block Read u Good RAID5 read performance but not good write performance

John Gordon - HEPiX- n° 17 Disk Services u Still need to investigate how best to run new disk servers u RAID0 vs RAID5 vs RAIDn u Filesystems? u Will probably try different setups for data files/databases/scratch

John Gordon - HEPiX- n° 18 Tape u Single STK Powderhorn Robot u 5632 slots – not full u 5 IBM 3590 drives (10GB) u 5 STK 9940 drives (60GB) u Currently: 3160x x9940 = 81TB u If full of 9940s = 330TB

John Gordon - HEPiX- n° 19 Network u Nortel Gbit infrastructure for whole site u 3xSummit 7i for HEP services u WAN 622Mb to local WAN, 2.5GB UK backbone u 2.5GB to RAL summer 2002 when backbone goes 10Gb u 100s of cpus access 10s of disk servers (Gb) needs:- n Very big switch as interconnect – OR n Clustering of disk servers and cpu clusters

John Gordon - HEPiX- n° 20 Wireless Networking u Installing wireless access ports in conference rooms u DHCP gives IP numbers on separate class C ‘visitors network’ n Outside firewall u Staff and other approved people can use PPTP to enter RAL site from wireless

John Gordon - HEPiX- n° 21 Video Conferencing u Conference Room ISDN u Personal VRVS n UK reflector at RAL n Ad-hoc use by bigger meeting u H.323 n Regular use in UK HEP. n Most groups have Zydacron room-based systems or Polycom ViaVideo personal systems n No production MCUs yet, using several test services

John Gordon - HEPiX- n° 22 Other Services I have concentrated on changes, we still continue to run u 2 Compaq AlphaSC systems with Quadrics switches u AMD-based Beowulf clusters u Sun cpu and disk for BaBar u IBM batch farm and fibre-based disk server for CDF

John Gordon - HEPiX- n° 23 Recent service changes u TierA Centre for BaBar u EDG Testbed1 u More about this on Friday

John Gordon - HEPiX- n° 24 Issues u Network layout cpu vs disk n Big switch with high-end backplane vs localised use u Disk Reliability n Many problems with IBM 75GB IDE n Got a batch of 60 replaced by supplier u AFS n How long can we get away with AFS? n What platform should we use for OpenAFS?