Test Results of the EuroStore Mass Storage System Ingo Augustin CERNIT-PDP/DM Padova
2Ingo Augustin, CERN EuroStore EC funded ESPRIT IV project A High Performance Storage Project EP March August 2000 Participants: CERN (HEP) DESY (HEP) QSW (Super Computer Manufacturer) HCSA(Space Applications) AMC(Private Hospitals) HNMS(Meteorological Service) TERA(Medical Foundation)
3Ingo Augustin, CERN CERN Role Definition of User Requirements LHC Central Data Recording LHC data storage, administration and handling Prototype Assessment at CERN scalability performance reliability manageability Liaison with HPSS synergy in data exchange (lack of interest of the HPSS consortium)
4Ingo Augustin, CERN User Requirements Real Time Capabilities dedicated disks, tape drives for CDR dynamic allocation of resources quota and priorities for users and groups file size and number only limited by OS ?high reliability and availability tape striping General ?“multi-vendor” ?easy manageability tape vaulting
5Ingo Augustin, CERN The EuroStore Software
6Ingo Augustin, CERN Parallel File System Extended CS-2 like file system Distributed over multiple nodes and disks POSIX compliant UNIX based initial implementation on SUN Solaris, later on DECUnix Resource Management System Planned RAID 5 extension
7Ingo Augustin, CERN Hierarchical Storage Manager IEEE MSS model JAVA based native THREADS (multi-threaded) easily portable Objectivity V5 as metadata DB Communication via ssh with Blowfish encryption Configurable priorities, quotas, allocations, migration
8Ingo Augustin, CERN Prototype Platform Hardware, PFS and HSM in April 99 at CERN Platform (QM-1): 4 dual-processor SUN Enterprise 450 connection via ELAN/ELITE network (~ 250 MB/s) 4 x 4 8GB Cheetah SCSI-disks Gigabit Ethernet interfaces Prototype restrictions movers on QM-1 nodes STK robotics with Redwood or 9840 only
9Ingo Augustin, CERN Standard Test Configuration
10Ingo Augustin, CERN Interest of CERN Not the PFS one HSM demon per PFS only efficient with “non-commodity” network with optimized kernel
11Ingo Augustin, CERN Networking A PFS is only useful with optimized TCP- stack ELAN is far too expensive for large usage (~10 kCHF/port)
12Ingo Augustin, CERN Storage Mangement Common problems in existing systems: Manageability no static environment (new equipment almost all the time) monitoring (efficiency, problem detection) few operators Scalability number of demons database access Reliability hundreds of tape and disk servers fault tolerance Distributed performance (PC/Linux?) heterogeneous? regional centers?
13Ingo Augustin, CERN EuroStore HSM Status of “Proof of principle” Fault tolerant (eg. mover or PVR dead) Java works Data rates are limited by hardware only Data is safe (only one incident of data loss, cause has been fixed) Very good monitoring (jobs can be traced throughout the system) All problems found up to now were bugs, not design faults Several components still to be implemented
14Ingo Augustin, CERN
15Ingo Augustin, CERN Future EuroStore will finish in August 2000 implementation of missing features stress testing Deployment at DESY (see next talk) Proposal for follow-up submitted to EC in Jan commodity Linux, PC, low end storage (DVD-RAM-Jukebox…) network scalability additional clients SAN support (by STK)