Status: Central Storage Services CD/LSC/CSI/CSG June 26, 2007.

Slides:



Advertisements
Similar presentations
Archive Task Team (ATT) Disk Storage Stuart Doescher, USGS (Ken Gacke) WGISS-18 September 2004 Beijing, China.
Advertisements

Data Storage Solutions Module 1.2. Data Storage Solutions Upon completion of this module, you will be able to: List the common storage media and solutions.
XenData SX-520 LTO Archive Servers A series of archive servers based on IT standards, designed for the demanding requirements of the media and entertainment.
XenData SXL-3000 LTO Archive System Turnkey video archive system with near-line LTO capacities scaling from 150 TB to 750 TB, designed for the demanding.
Cloud Computing: Theirs, Mine and Ours Belinda G. Watkins, VP EIS - Network Computing FedEx Services March 11, 2011.
© 2005 DataCore Software Corp SANmelody™ 2.0 IP/SAN FC/SAN Application Support Services Thin Provisioning Automation & Disk Resource Consolidation DataCore,
Application Delivery to the Enterprise Jeremy Holt Director of Information Services.
1 Storage Today Victor Hatridge – CIO Nashville Electric Service (615)
Vorlesung Speichernetzwerke Teil 2 Dipl. – Ing. (BA) Ingo Fuchs 2003.
Internet Backup Michael White Ross Schneider Jordan Divine.
© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice HP StorageWorks LeftHand update Marcus.
How to Cluster both Servers and Storage W. Curtis Preston President The Storage Group.
Module – 7 network-attached storage (NAS)
Data Storage Willis Kim 14 May Types of storages Direct Attached Storage – storage hardware that connects to a single server Direct Attached Storage.
Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.
Copyright Tim Antonowicz, This work is the intellectual property of the author. Permission is granted for this material to be shared for non- commercial,
© Hitachi Data Systems Corporation All rights reserved. 1 1 Det går pænt stærkt! Tony Franck Senior Solution Manager.
Storwize V7000 IP Replication solution explained
© 2010 IBM Corporation Kelly Beavers Director, IBM Storage Software Changing the Economics of Storage.
BACKUP/MASTER: Immediate Relief with Disk Backup Presented by W. Curtis Preston VP, Service Development GlassHouse Technologies, Inc.
IBM TotalStorage ® IBM logo must not be moved, added to, or altered in any way. © 2007 IBM Corporation Break through with IBM TotalStorage Business Continuity.
1 © Copyright 2009 EMC Corporation. All rights reserved. Agenda Storing More Efficiently  Storage Consolidation  Tiered Storage  Storing More Intelligently.
Storage Area Networks The Basics. Storage Area Networks SANS are designed to give you: More disk space Multiple server access to a single disk pool Better.
Storage Survey and Recent Acquisition at LAL Michel Jouvin LAL / IN2P3
SANPoint Foundation Suite HA Robert Soderbery Sr. Director, Product Management VERITAS Software Corporation.
Module 10 Configuring and Managing Storage Technologies.
Best Practices for Backup in SAN/NAS Environments Jeff Wells.
Meeting the Data Protection Demands of a 24x7 Economy Steve Morihiro VP, Programs & Technology Quantum Storage Solutions Group
Planning and Designing Server Virtualisation.
Ray Pasetes, Andy Romero CSS Storage Services CD/CSS Central Services Activities Coordination Meeting 9/19/2006.
NOAA WEBShop A low-cost standby system for an OAR-wide budgeting application Eugene F. Burger (NOAA/PMEL/JISAO) NOAA WebShop July Philadelphia.
Get More out of SQL Server 2012 in the Microsoft Private Cloud environment Steven Wort, Xin Jin Microsoft Corporation.
Virtualization for Storage Efficiency and Centralized Management Genevieve Sullivan Hewlett-Packard
Confidential1 Introducing the Next Generation of Enterprise Protection Storage Enterprise Scalability Enhancements.
Paul Scherrer Institut 5232 Villigen PSI HEPIX_AMST / / BJ95 PAUL SCHERRER INSTITUT THE PAUL SCHERRER INSTITUTE Swiss Light Source (SLS) Particle accelerator.
SCF/FEF Virtualization Strategy Jason Allen August 12, 2009.
Storage Trends: DoITT Enterprise Storage Gregory Neuhaus – Assistant Commissioner: Enterprise Systems Matthew Sims – Director of Critical Infrastructure.
1 U.S. Department of the Interior U.S. Geological Survey Contractor for the USGS at the EROS Data Center EDC CR1 Storage Architecture August 2003 Ken Gacke.
RAL PPD Computing A tier 2, a tier 3 and a load of other stuff Rob Harper, June 2011.
21 st October 2002BaBar Computing – Stephen J. Gowdy 1 Of 25 BaBar Computing Stephen J. Gowdy BaBar Computing Coordinator SLAC 21 st October 2002 Second.
IST Storage & Backup Group 2011 Jack Shnell Supervisor Joe Silva Senior Storage Administrator Dennis Leong.
April 25, 2001HEPiX/HEPNT FERMI SITE REPORT Lisa Giacchetti.
Using NAS as a Gateway to SAN Dave Rosenberg Hewlett-Packard Company th Street SW Loveland, CO 80537
Site-Wide Backup Briefing Ray Pasetes Core Support Services April 16, 2004.
Panoptic Capacity Planning Presented by. "Scotty, I need warp speed in 3 minutes or we're all dead!” (William Shatner - Star Trek II ‘The Wrath of Khan’)
BACKUP/MASTER: Strategies for Archiving Dianne McAdam Senior Analyst and Partner Data Mobility Group.
O AK R IDGE N ATIONAL L ABORATORY U.S. D EPARTMENT OF E NERGY Facilities and How They Are Used ORNL/Probe Randy Burris Dan Million – facility administrator.
CD FY10 Budget and Tactical Plan Review FY10 Tactical Plans for Database Services Nelly Stanfield October 7, 2009 Database Services3425-v1.
CASPUR Site Report Andrei Maslennikov Lead - Systems Amsterdam, May 2003.
Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Implementation of a reliable and expandable on-line storage for compute clusters Jos van Wezel.
TiBS Fermilab – HEPiX-HEPNT Ray Pasetes October 22, 2003.
Queensland University of Technology CRICOS No J VMware as implemented by the ITS department, QUT Scott Brewster 7 December 2006.
1 D0 Taking Stock By Anil Kumar CD/LSCS/DBI/DBA June 11, 2007.
 The End to the Means › (According to IBM ) › 03.ibm.com/innovation/us/thesmartercity/in dex_flash.html?cmp=blank&cm=v&csr=chap ter_edu&cr=youtube&ct=usbrv111&cn=agus.
CERN Computer Centre Tier SC4 Planning FZK October 20 th 2005 CERN.ch.
CERN - IT Department CH-1211 Genève 23 Switzerland t High Availability Databases based on Oracle 10g RAC on Linux WLCG Tier2 Tutorials, CERN,
RHIC/US ATLAS Tier 1 Computing Facility Site Report Christopher Hollowell Physics Department Brookhaven National Laboratory HEPiX Upton,
Lisa Giacchetti AFS: What is everyone doing? LISA GIACCHETTI Operating Systems Support.
1 CEG 2400 Fall 2012 Network Servers. 2 Network Servers Critical Network servers – Contain redundant components Power supplies Fans Memory CPU Hard Drives.
The RAL PPD Tier 2/3 Current Status and Future Plans or “Are we ready for next year?” Chris Brew PPD Christmas Lectures th December 2007.
Extending Auto-Tiering to the Cloud For additional, on-demand, offsite storage resources 1.
U N C L A S S I F I E D LA-UR Leveraging VMware to implement Disaster Recovery at LANL Anil Karmel Technical Staff Member
Enterprise Vitrualization by Ernest de León. Brief Overview.
Open-E Data Storage Software (DSS V6)
Storage Area Networks The Basics.
Integrating Disk into Backup for Faster Restores
iSCSI Storage Area Network
SAN and NAS.
How to prepare for the End of License of Windows Server 2012/R2
Storage Trends: DoITT Enterprise Storage
Presentation transcript:

Status: Central Storage Services CD/LSC/CSI/CSG June 26, 2007

Storage Services 1. File Based Storage NFS/CIFS (BlueArc) – Fast on-site access AFS – Global access, authenticated filesystem 2. Block Based Storage Fibre-Channel connect to SAN 3. Archival Storage Backups 1. File Based Storage NFS/CIFS (BlueArc) – Fast on-site access AFS – Global access, authenticated filesystem 2. Block Based Storage Fibre-Channel connect to SAN 3. Archival Storage Backups

NAS Status Newest Service 2 Production clusters 1. Fermi-Blue (1 st generation cluster) 2. RHEA (2 nd generation cluster) Newest Service 2 Production clusters 1. Fermi-Blue (1 st generation cluster) 2. RHEA (2 nd generation cluster)

NAS Status 3/06 – NAS heads ordered (Fermi-Blue) 5/06 – Pilot deployment SLF, DSG, KITS, PPD and FESS department servers Year 1 projection: 10TB deployed storage 3/06 – NAS heads ordered (Fermi-Blue) 5/06 – Pilot deployment SLF, DSG, KITS, PPD and FESS department servers Year 1 projection: 10TB deployed storage

NAS Status Phase 1 Phase 2 Phase 3 Year 2 Year 3 Year 1 Department Servers, Array Consolidation Rollout to Farms servers Rollout to Farms Workers Projected Rollout

NAS Status Phase 1 Phase 2 Phase 3 Year 2 Year 3 Year 1 Department Servers, Array Consolidation Rollout to Farms servers Rollout to Farms Workers Actual Rollout

NAS Status Actual Year 1 Deployment Q : Pilot Program Early adopters Timing CMS “home” area evaluation Fermigrid NFS issues Actual Year 1 Deployment Q : Pilot Program Early adopters Timing CMS “home” area evaluation Fermigrid NFS issues

NAS Status Actual Year 1 Deployment (cont) Q3 2006: Production Phase 1 in full production CMS + Fermigrid go production (Phase 2) Additional NAS heads purchase (RHEA) Year 1 projection revised to 200TB deployed storage Actual Year 1 Deployment (cont) Q3 2006: Production Phase 1 in full production CMS + Fermigrid go production (Phase 2) Additional NAS heads purchase (RHEA) Year 1 projection revised to 200TB deployed storage

NAS Status Actual Year 1 deployment (cont) Q4/2006 CMS and Fermigrid deploy to worker nodes (Phase-3) Q1/Q D0/CDF/Miniboone begin consolidation of servers into central NAS service Requests for space from LHC, ILC and SDSS Actual Year 1 deployment (cont) Q4/2006 CMS and Fermigrid deploy to worker nodes (Phase-3) Q1/Q D0/CDF/Miniboone begin consolidation of servers into central NAS service Requests for space from LHC, ILC and SDSS

NAS Status NAS Storage Growth Year 1

NAS Status Q storage TB

NAS Status Current customers Experiments CMS, CDF, D0, FermiGrid/OSG, Miniboone ILC, LHC, SDSS, Sciboone(?) Departments CD, Directorate, FESS, ES&H, PPD, VMS Services Scientific Linux (FERMI), CVS, KITS, Alphaflow, Enstore Current customers Experiments CMS, CDF, D0, FermiGrid/OSG, Miniboone ILC, LHC, SDSS, Sciboone(?) Departments CD, Directorate, FESS, ES&H, PPD, VMS Services Scientific Linux (FERMI), CVS, KITS, Alphaflow, Enstore

NAS Status Benefits Stability -- Savings multiplier Effort re-directed towards supporting application Reduced downtime Increased productivity Consolidation (30+ servers/storage arrays) Reduce equipment support costs Reduce power + cooling Benefits Stability -- Savings multiplier Effort re-directed towards supporting application Reduced downtime Increased productivity Consolidation (30+ servers/storage arrays) Reduce equipment support costs Reduce power + cooling

NAS Status Benefits (cont) Ease of use Familiar storage solution – minimal training Flexible Choice of storage tiers, price points Benefits (cont) Ease of use Familiar storage solution – minimal training Flexible Choice of storage tiers, price points

NAS Status Challenges Growth higher than expected – Lun limit Each cluster is limited to 256 luns Each lun limited to 2TB Upgrade to 64TB lun support expected EOY 2008 Criticality of service Central location Offsite DR required? Challenges Growth higher than expected – Lun limit Each cluster is limited to 256 luns Each lun limited to 2TB Upgrade to 64TB lun support expected EOY 2008 Criticality of service Central location Offsite DR required?

NAS Status Challenges (cont) Backup of large data an issue Large data areas >5TB Millions of files Logistics Power Floor space Challenges (cont) Backup of large data an issue Large data areas >5TB Millions of files Logistics Power Floor space

NAS Status FY08 Plans Expansion of service Participate in Tier 3 evaluation Development of better reporting tools FY08 Plans Expansion of service Participate in Tier 3 evaluation Development of better reporting tools

NAS Status More info: Questions? More info: Questions?

SAN Status 272 Fibre-Channel ports 128 ports added to fabric in ‘07 (CMS contribution) Qlogic switches 2Gb Fibre Channel Connections 272 Fibre-Channel ports 128 ports added to fabric in ‘07 (CMS contribution) Qlogic switches 2Gb Fibre Channel Connections

SAN Status 23 storage arrays 12 centrally managed Database Array (3PAR) purchased and tested D0ora2 deployment 7/2/2007 Start retiring 1 st Generation Tier 2 storage arrays (Infortrend) 11 externally managed 23 storage arrays 12 centrally managed Database Array (3PAR) purchased and tested D0ora2 deployment 7/2/2007 Start retiring 1 st Generation Tier 2 storage arrays (Infortrend) 11 externally managed

SAN Status 346TB 156TB centrally managed 190TB externally managed 346TB 156TB centrally managed 190TB externally managed

SAN Status SAN fabric opened up to external members CMS, CDF, D0, Miniboone Must retire LSI storage array End of support (year end 2007) Impacts IMAP/POP, AFS, DSG(CDF) SAN fabric opened up to external members CMS, CDF, D0, Miniboone Must retire LSI storage array End of support (year end 2007) Impacts IMAP/POP, AFS, DSG(CDF)

SAN Status FY07 Plans Additional HDS array NAS storage for SDSS, Windows Migration, DSG Block storage for LSI migration FY07 Plans Additional HDS array NAS storage for SDSS, Windows Migration, DSG Block storage for LSI migration

SAN Status FY07 Plans (Cont) Purchase 2 Nexsan SATAbeasts Replace 4 Infortrend arrays Backup cache disk, DSG RMAN disks Test as possible tier 3 candidates FY07 Plans (Cont) Purchase 2 Nexsan SATAbeasts Replace 4 Infortrend arrays Backup cache disk, DSG RMAN disks Test as possible tier 3 candidates

SAN Status FY08 Plans Additional capacity for 3PAR For sparing DSG migration Additional capacity for NAS Decommission remaining Infortrend arrays Other tier 3 alternatives (nexgen HDS, DDN) Virtualization across arrays FY08 Plans Additional capacity for 3PAR For sparing DSG migration Additional capacity for NAS Decommission remaining Infortrend arrays Other tier 3 alternatives (nexgen HDS, DDN) Virtualization across arrays

SAN Status Questions?

Site Backup Status Service entering 4 th year 10/07 2 Backup Servers Chasm (infrastructure and business) Canyon (experiment) 1 Library (600 slots) 8 SAIT-1 Tape Drives 2 Infortrend Storage arrays TiBS Backup Software Service entering 4 th year 10/07 2 Backup Servers Chasm (infrastructure and business) Canyon (experiment) 1 Library (600 slots) 8 SAIT-1 Tape Drives 2 Infortrend Storage arrays TiBS Backup Software

Site Backup Status 22TB+ data 12,700+ backup volumes 5,506 UNIX/Windows, 7171 AFS, 25 NDMP 452+ clients 18.5% increase in past 6 months (3.7TB) No single volume > 100GB 22TB+ data 12,700+ backup volumes 5,506 UNIX/Windows, 7171 AFS, 25 NDMP 452+ clients 18.5% increase in past 6 months (3.7TB) No single volume > 100GB

Site-Backup Status Typical Daily Backup Timeline (canyon) Incr/NetworkMergesRetry Debug 6:00PM2:00AM1:00PM1:40PM 24 hour window

Site-Backup Status Issues Resolving client backup issues High client volatility Reconfiguration/Renaming/Reinstalls Large delta in data Contacting admins Slow client network performance Issues Resolving client backup issues High client volatility Reconfiguration/Renaming/Reinstalls Large delta in data Contacting admins Slow client network performance

Site-Backup Status Issues (cont) Merge problems Can be difficult to debug Tape drive/Software or combination Cache disk Multiple disk failures Issues (cont) Merge problems Can be difficult to debug Tape drive/Software or combination Cache disk Multiple disk failures

Site-Backup Status Issues (cont) SAIT-1 drive performance issues Tapes written on one drive are slow to read on another Long debug time > 1 hour Usually requires multiple replacements Sony and Spectra investigating Too few Issues (cont) SAIT-1 drive performance issues Tapes written on one drive are slow to read on another Long debug time > 1 hour Usually requires multiple replacements Sony and Spectra investigating Too few

Site-Backup Status FY07 Plans Chasm Canyon IP Disk Cache SAIT-1 Drives SAN Migrate more backups to NDMP Relieve pressure on chasm Migrate clients from canyon to chasm Relieve pressure on canyon LTO-4 NDMP

Site-Backup Status FY07 Plans (cont) Upgrade cache disks Replace aging Infortrend disks Higher performing array RAID 6 FY07 Plans (cont) Upgrade cache disks Replace aging Infortrend disks Higher performing array RAID 6

Site-Backup Status Challenges Desire from users to expand backups Larger backup volumes Larger backup sets Challenges Desire from users to expand backups Larger backup volumes Larger backup sets

Site-Backup Status FY08 Plans Upgrade Servers to Solaris 10 Faster IP stack and Filesystem Upgrade server hardware Faster bus speed Utilize faster cache disk Take advantage of faster filesystem Feed faster tape drives Migrate canyon backups to LTO-4 FY08 Plans Upgrade Servers to Solaris 10 Faster IP stack and Filesystem Upgrade server hardware Faster bus speed Utilize faster cache disk Take advantage of faster filesystem Feed faster tape drives Migrate canyon backups to LTO-4

Site-Backup Status FY08 Plans (cont) Investigate Disk-based library TiBS specific implementation Use common disks as a disk library Synchronous copy to tape (also) Faster restores, possibly backups May increase overall backup system throughput FY08 Plans (cont) Investigate Disk-based library TiBS specific implementation Use common disks as a disk library Synchronous copy to tape (also) Faster restores, possibly backups May increase overall backup system throughput

Site-Backup Status FY08 Plans (cont) Investigate Virtual Tape Library Agnostic solution (not TiBS specific) Asynchronous copy to tape Emulate tape drives and libraries Faster restores and backups Will increase overall backup system throughput Some systems have data-deduplication Inline or post-process FY08 Plans (cont) Investigate Virtual Tape Library Agnostic solution (not TiBS specific) Asynchronous copy to tape Emulate tape drives and libraries Faster restores and backups Will increase overall backup system throughput Some systems have data-deduplication Inline or post-process

Site-Backup Status More information: Questions? More information: Questions?

AFS Status 12 AFS servers ~17TB storage Largest customers: Minos and Web Roughly 8-10% increase per year (Based off number of volumes) Must migrate servers off of LSI storage array and onto HDS Tier 2 storage. 12 AFS servers ~17TB storage Largest customers: Minos and Web Roughly 8-10% increase per year (Based off number of volumes) Must migrate servers off of LSI storage array and onto HDS Tier 2 storage.

AFS Status FY07 Plans Migrate data to HDS Tier 2 disks Migration partially complete (1.8TB installed) Tier 2 storage re-allocated to NAS due to high demand Test Solaris 10 AFS server with ZFS FY07 Plans Migrate data to HDS Tier 2 disks Migration partially complete (1.8TB installed) Tier 2 storage re-allocated to NAS due to high demand Test Solaris 10 AFS server with ZFS

AFS Status FY08 Plans Upgrade Servers to Solaris 10 Faster OS – filesystem and IP stack Newer CPUs – low power Dual Power Supply Upgrade OpenAFS Multi-domain support Support for > 2GB files Promote RO copies to RW copies FY08 Plans Upgrade Servers to Solaris 10 Faster OS – filesystem and IP stack Newer CPUs – low power Dual Power Supply Upgrade OpenAFS Multi-domain support Support for > 2GB files Promote RO copies to RW copies

AFS Status More information: Questions? More information: Questions?