Implementation Review1 Moving Archive Data to the EMC Storage Array March 14, 2003 Faith Abney.

Slides:



Advertisements
Similar presentations
SolidWorks Enterprise PDM Data Loading Strategies
Advertisements

Dec 2, 2014 The HST Online Cache: Changes to the User Experience Karen Levay Faith Abney Mark Kyprianou.
Validata Release Coordinator Accelerated application delivery through automated end-to-end release management.
1 HST Pipeline Project Review March 14, Review Objectives Re-familiarize Project (and others) with production data processing done by STScI Familiarize.
Chapter 12 File Management Systems
Microsoft Dynamics AX Technical Conference 2013
Module 12: Backup and Recovery. Overview Backup and recovery methods available in Oracle and SQL Server 2008 Types of failure Types of recovery Formulating.
Module 5 Understanding SQL Server 2008 R2 Recovery Models.
Implementation Review1 Moving Pre-Archive Pipeline Processing March 14, 2003 Forrest Hamilton/OPUS Ops.
November 2009 Network Disaster Recovery October 2014.
Upcoming Enhancements to the HST Archive Mark Kyprianou Operations and Engineering Division Data System Branch.
Design Completion A Major Milestone System is Presented to Users and Management for Approval.
1 Chapter 12 File Management Systems. 2 Systems Architecture Chapter 12.
Agenda Teams Responsibilities Timeline Homework and next steps.
CAA/CFA Review | Andrea Laruelo | ESTEC | May CFA Development Status CAA/CFA Review ESTEC, May 19 th 2011 European Space AgencyAndrea Laruelo.
Sofia, Bulgaria | 9-10 October SQL Server 2005 High Availability for developers Vladimir Tchalkov Crossroad Ltd. Vladimir Tchalkov Crossroad Ltd.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
Module 7. Data Backups  Definitions: Protection vs. Backups vs. Archiving  Why plan for and execute data backups?  Considerations  Issues/Concerns.
MASSACHUSETTS INSTITUTE OF TECHNOLOGY NASA GODDARD SPACE FLIGHT CENTER ORBITAL SCIENCES CORPORATION NASA AMES RESEARCH CENTER SPACE TELESCOPE SCIENCE INSTITUTE.
Archiving 40+ years of Planetary Mission Data - Lessons Learned and Recommendations K. E. Simmons LASP, University of Colorado, Boulder, CO
Data Management Subsystem Jeff Valenti (STScI). DMS Context PRDS - Project Reference Database PPS - Proposal and Planning OSS - Operations Scripts FOS.
© Dennis Shasha, Philippe Bonnet – 2013 Communicating with the Outside.
Implementation Review1 Deriving Architecture Requirements March 14, 2003.
Chapter 16 Methodology – Physical Database Design for Relational Databases.
Databases March 14, /14/2003Implementation Review2 Goals for Database Architecture Changes Simplify hardware architecture Improve performance Improve.
©2006 Merge eMed. All Rights Reserved. Energize Your Workflow 2006 User Group Meeting May 7-9, 2006 Disaster Recovery Michael Leonard.
7202ICT – Database Administration
©2006 Merge eMed. All Rights Reserved. Energize Your Workflow 2006 User Group Meeting May 7-9, 2006 Planning for Expansion Steve Nevermann.
Project Management Part 6 Project Control. Part 6 - Project Control2 Topic Outline: Project Control Project control steps Measuring and monitoring system.
Reliable Data Movement using Globus GridFTP and RFT: New Developments in 2008 John Bresnahan Michael Link Raj Kettimuthu Argonne National Laboratory and.
Information: Policy, Strategy and Systems Module Overview
Frontiers in Massive Data Analysis Chapter 3.  Difficult to include data from multiple sources  Each organization develops a unique way of representing.
1/14/2005Yan Huang - CSCI5330 Database Implementation – Storage and File Structure Storage and File Structure.
08/30/05GDM Project Presentation Lower Storage Summary of activity on 8/30/2005.
Integrating JASMine and Auger Sandy Philpott Thomas Jefferson National Accelerator Facility Jefferson Ave. Newport News, Virginia USA 23606
NEW FOR 2009 Faster, Easier, Friendlier. Before you start Any student, staff, or faculty member can file an accident/incident report. Accident reporting.
McLean HIGHER COMPUTER NETWORKING Lesson 15 (a) Disaster Avoidance Description of disaster avoidance: use of anti-virus software use of fault tolerance.
Distributed Backup And Disaster Recovery for AFS A work in progress Steve Simmons Dan Hyde University.
Week 7 : Chapter 7 Agenda SQL 710 Maintenance Plan:
SPACE TELESCOPE SCIENCE INSTITUTE Operated for NASA by AURA WFC3 and StarView
CLASS Information Management Presented at NOAATECH Conference 2006 Presented by Pat Schafer (CLASS-WV Development Lead)
SMS Software Distribution. Overview  Explaining How SMS Distributes Software  Managing Distribution Points  Configuring Software Distribution and the.
The concept of RAID in Databases By Junaid Ali Siddiqui.
Data Integrity Issues: How to Proceed? Engineering Node Elizabeth Rye August 3, 2006
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
Chapter 8 System Management Semester 2. Objectives  Evaluating an operating system  Cooperation among components  The role of memory, processor,
20 Copyright © 2008, Oracle. All rights reserved. Cache Management.
Unit – I Presentation. Unit – 1 (Introduction to Software Project management) Definition:-  Software project management is the art and science of planning.
Implementation Review1 Archive Ingest Redesign March 14, 2003.
Microsoft ® Official Course Module 6 Managing Software Distribution and Deployment by Using Packages and Programs.
1 Future Directions in HST Data Processing 19 November 2004.
Hands-On Microsoft Windows Server 2008 Chapter 7 Configuring and Managing Data Storage.
Data Storage and Querying in Various Storage Devices.
Advanced Higher Computing Science
Design Completion A Major Milestone
HST and JWST Pipelines and Reference Files
DADS Ingest and Distribution Support for WFC3 Daryl Swade
Distributed Database Management Systems
Technology for Long-Term Digital Preservation
Monitoring and Controlling the Project
Systems Analysis and Design
Mean Value Analysis of a Database Grid Application
ProtoDUNE SP DAQ assumptions, interfaces & constraints
RAID RAID Mukesh N Tekwani
O.S Lecture 13 Virtual Memory.
Data Systems Environment at SM4
System Construction and Implementation
Managing Work in the New Computing Environment March 14, 2003
RAID RAID Mukesh N Tekwani April 23, 2019
TOTAL COST CONTROL ON CONSTRUCTION PROJECTS
Presentation transcript:

Implementation Review1 Moving Archive Data to the EMC Storage Array March 14, 2003 Faith Abney

Implementation Review2

3 Motivation for Migration EMC has higher availability and throughput  Less downtime  Data gets to users quicker More online processing space available on EMC  Current processing areas not large enough to process all retrieval requests when they are submitted  Requests are routinely queued and held until processing space is available

Implementation Review4 Portion of EMC array to be used for faster access to Archive data Currently all data is on MO platters in jukeboxes  Limited by number of platters mounted  Slow read/write access  Greatly affected by hardware downtime Moving data to the EMC array will eliminate these bottlenecks which are due to data being in a jukebox

Implementation Review5 Portion of EMC array to be used for faster access to Archive data Calling this area the “Data Depot” to avoid confusion with other staging areas DADS will check this area first (DADS 10.1)  if files are available will use them  if not, it will get them from MO Ingest software will be updated to write appropriate data to the depot after write to MO (DADS 10.1)  Data to depot will come from read of MO platter, verifying the write to MO

Implementation Review6 Data going to the Depot All POD files (raw science data) for instruments with OTFR – 1.5 TB  ACS, NICMOS, STIS, WFPC2 All legacy data -.5TB  GHRS, FOS, FOC, WFPC, HSP, AST Calibration reference files -.15TB FUSE data -.3 TB Jitter files, Metadata - <.1TB Mission scheduling data < 50 GB  SMS, MTL, MSC, ORB

Implementation Review7 Data not going to the Depot Some data is rarely requested and does not need faster access:  Calibrated data for which OTFR is available  Raw Engineering data  Log and Ancillary data

Implementation Review8 Methodology: Requirements Any movement of Data from MO platter to EMC will compete with both Ingest and Distribution for Jukebox access.  Migration should not prevent operations from meeting performance goals.  Migration of data off jukebox can proceed as background task  Task becomes high priority if jukebox failure rate begins to impact performance goals in operations.  Migration should allow for selection of highly requested data classes to be segregated and moved first.  POD files (raw science) for OTFR prime candidate to reduce overall retrieval times.

Implementation Review9 Methodology: Options MMM server within NSA  Used to populate MO from older Sony media  Required manual intervention for simultaneous efficient operations and transfer performance  Well understood operationally Bulk copy using primary platters  Same copy mechanism used for other mirror archives (CADC, ECF)  Not a routine operational procedure, manual intervention DADS 10.* beta test environment  V280R loading uncertain  Rapid Test and Build cycle for beta test environment Copy from safe store platters using DADS 9.* test environment.  Highly manual (requires platter loads)  No operational impact (impacts testing schedule only)

Implementation Review10 Methodology: Prescription Perform trade studies to evaluate the quickest and least intrusive method. Develop contingencies to accelerate transfer should jukebox failures become more frequent. Initiate transfer and develop a good understanding of maximum transfer rate before predicting a completion date.  Lessons learned from MO migration  Note that DADS 10.1 delivery makes transfer progress non linear!

Implementation Review11 Schedule DADS 10.1 – scheduled for Spring – contains:  Software to populate Depot on ingest  Software to be able to read from the Depot Tools to migrate existing data to Depot already written  Migration can start as soon as EMC Depot is connected to operational system  Migration completion estimate will be prepared after we complete acceptance testing and get actual performance numbers using the EMC and our tools