Implementation Review1 Archive Ingest Redesign March 14, 2003.

Slides:



Advertisements
Similar presentations
The System and Software Development Process Instructor: Dr. Hany H. Ammar Dept. of Computer Science and Electrical Engineering, WVU.
Advertisements

Alternate Software Development Methodologies
Summary Role of Software (1 slide) ARCS Software Architecture (4 slides) SNS -- Caltech Interactions (3 slides)
D. Düllmann - IT/DB LCG - POOL Project1 POOL Release Plan for 2003 Dirk Düllmann LCG Application Area Meeting, 5 th March 2003.
1 HST Pipeline Project Review March 14, Review Objectives Re-familiarize Project (and others) with production data processing done by STScI Familiarize.
Chapter 9 & 10 Database Planning, Design and Administration.
Components and Architecture CS 543 – Data Warehousing.
Building a Framework for Data Preservation of Large-Scale Astronomical Data ADASS London, UK September 23-26, 2007 Jeffrey Kantor (LSST Corporation), Ray.
Chapter 1: Overview of Workflow Management Dr. Shiyong Lu Department of Computer Science Wayne State University.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space User Oriented Provisioning of Secure Virtualized.
Implementation Review1 Moving Archive Data to the EMC Storage Array March 14, 2003 Faith Abney.
Data Warehousing: Defined and Its Applications Pete Johnson April 2002.
November 2011 At A Glance GREAT is a flexible & highly portable set of mission operations analysis tools that increases the operational value of ground.
© 2006, Cognizant Technology Solutions. All Rights Reserved. The information contained herein is subject to change without notice. Automation – How to.
Implementation Review1 Moving Pre-Archive Pipeline Processing March 14, 2003 Forrest Hamilton/OPUS Ops.
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
 ETL: Extract Transformation and Load  Term is used to describe data migration or data conversion process  ETL may be part of the business process repeated.
Upcoming Enhancements to the HST Archive Mark Kyprianou Operations and Engineering Division Data System Branch.
PO Box 1508, Vancouver, WA (360) THE COMMERCIAL & INDUSTRIAL BILLING SOLUTION.
FP OntoGrid: Paving the way for Knowledgeable Grid Services and Systems WP8: Use case 1: Quality Analysis for Satellite Missions.
Surveyors Conference Project Update for the as of March 2007 Right of Way Data Management System (RWDMS)
User Working Group 2013 Data Management System – Status 12 March 2013
2-3 April 2009PDS MC Rapid-prototyping tools. 2-3 April 2009PDS MC Rapid-prototyping tools Legend: Gray – Existing tool effort; White – Proposed tool.
Data Management Subsystem: Data Processing, Calibration and Archive Systems for JWST with implications for HST Gretchen Greene & Perry Greenfield.
MASSACHUSETTS INSTITUTE OF TECHNOLOGY NASA GODDARD SPACE FLIGHT CENTER ORBITAL SCIENCES CORPORATION NASA AMES RESEARCH CENTER SPACE TELESCOPE SCIENCE INSTITUTE.
Data Management Subsystem Jeff Valenti (STScI). DMS Context PRDS - Project Reference Database PPS - Proposal and Planning OSS - Operations Scripts FOS.
Implementation Review1 Deriving Architecture Requirements March 14, 2003.
Databases March 14, /14/2003Implementation Review2 Goals for Database Architecture Changes Simplify hardware architecture Improve performance Improve.
Relationships July 9, Producers and Consumers SERI - Relationships Session 1.
Margherita Forcolin (Insiel S.p.A.) Thessaloniki, 13 October 2011.
Chapter 1: Overview of Workflow Management Dr. Shiyong Lu Department of Computer Science Wayne State University.
Nov HST Data Processing Operations and Engineering Division Data Systems Branch Mark Kyprianou.
February 17, 1999Open Forum on Metadata Registries 1 Census Corporate Statistical Metadata Registry By Martin V. Appel Daniel W. Gillman Samuel N. Highsmith,
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger Life Cycle Architecture Review.
AUV CTD Time Series (900200) 09 June Project Goals Automate and sustain observations in Monterey Bay Increase the spatial resolution of regular.
ASI-Eumetsat Meeting Matera, 4-5 Feb CNM Context Matera, February 4-5, 20092ASI-Eumetsat Meeting.
COS PIPELINE PDR Daryl Swade December 7, 2000OPUS / OTFR Space Telescope Science Institute 1 of 24 Science Data Processing
1 Advanced Software Architecture Muhammad Bilal Bashir PhD Scholar (Computer Science) Mohammad Ali Jinnah University.
AWIPS II Update Unidata Policy Committee Meeting J.C. Duh Chief, Program & Plans Division, Office of Science & Technology, NWS April 15, 2010.
Label Design Tool Management Council F2F Washington, D.C. November 29-30, 2006
The System and Software Development Process Instructor: Dr. Hany H. Ammar Dept. of Computer Science and Electrical Engineering, WVU.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Common Archive Observation Model (CAOM) What is it and why does JWST care?
SPACE TELESCOPE SCIENCE INSTITUTE Operated for NASA by AURA WFC3 and StarView
March 2004 At A Glance NASA’s GSFC GMSEC architecture provides a scalable, extensible ground and flight system approach for future missions. Benefits Simplifies.
06-1L ASTRO-E2 ASTRO-E2 User Group - 14 February, 2005 Astro-E2 Archive Lorella Angelini/HEASARC.
Regional Seminar on Promotion and Utilization of Census Results and on the Revision on the United Nations Principles and Recommendations for Population.
COS PIPELINE CDR Jim Rose July 23, 2001OPUS Science Data Processing Space Telescope Science Institute 1 of 12 Science Data Processing
Distributed Data for Science Workflows Data Architecture Progress Report December 2008.
KEY PERSONNEL Dr. Bob Schutz, GLAS Science Team Leader Dr. Jay Zwally, ICESat Project Scientist, GLAS Team Member Mr. David Hancock, Science Software Development.
System/SDWG Update Management Council Face-to-Face Flagstaff, AZ August 22-23, 2011 Sean Hardman.
Transiting Exoplanet Survey Satellite December 8, 2014 TESS Kickoff.
25 April Unified Cryptologic Architecture: A Framework for a Service Based Architecture Unified Cryptologic Architecture: A Framework for a Service.
Evolution of the JPSS Ground Project Calibration and Validation System Patrick Purcell, Gyanesh Chander and Peyush Jain JPSS Ground Project NASA, GSFC.
Software Development and Deployment PDS Management Council Face-to-Face Berkeley, California November 18-19, 2014 Sean Hardman.
WFC3 PIPELINE CDR Jim Rose October 16, 2001OPUS Science Data Processing Space Telescope Science Institute 1 of 13 Science Data Processing
1 Future Directions in HST Data Processing 19 November 2004.
March 2004 At A Glance The AutoFDS provides a web- based interface to acquire, generate, and distribute products, using the GMSEC Reference Architecture.
SwCDR (Peer) Review 1 UCB MAVEN Particles and Fields Flight Software Critical Design Review Peter R. Harvey.
PDS4 Project Report PDS MC F2F UCLA Dan Crichton November 28,
PDS4 Project Report PDS MC F2F University of Maryland Dan Crichton March 27,
Physical Oceanography Distributed Active Archive Center THUANG June 9-13, 20089th GHRSST-PP Science Team Meeting GHRSST GDAC and EOSDIS PO.DAAC.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
DADS Ingest and Distribution Support for WFC3 Daryl Swade
Dr. Awad Khalil Computer Science Department AUC
WFC3 Pipeline Critical Design Review October 16, 2001
Data Systems Environment at SM4
Managing Work in the New Computing Environment March 14, 2003
Dr. Awad Khalil Computer Science Department AUC
ONAP Architecture Principle Review
Presentation transcript:

Implementation Review1 Archive Ingest Redesign March 14, 2003

Implementation Review2 Archive Ingest Redesign high-level requirements Port Ingest system from Open VMS to Unix  Ingest will be the last remaining back-end function on Open VMS.  Ingest will run under Solaris on the 15k Make Ingest scalable for future increase in data volume post-SM4 Improve throughput and reliability  Decouple Ingest from Distribution software for ease of operation and maintenance Improve system maintainability  Facilitate Ingest changes that are driven by changes in data structure during science instrument lifetimes.

Implementation Review3 Current OPUS data processing / DADS Ingest interface Historically, data processing and archive systems have developed independently.  Data processing system went from PODPS to OPUS.  Archive system went from DMF to ST-DADS.  In the past, these systems have not even operated within the same security environment. This paradigm does not work with the current archive philosophy.  On-the Fly Reprocessing (OTFR) requires integration of data processing and archive distribution functionality.  Enhanced data processing, particularly database catalogs, requires closer coupling of data processing and archive system. To address this change, software maintenance for data processing and archive systems now in one branch.

Implementation Review4 Current OPUS data process – DADS Ingest interface (cont.)

Implementation Review5 Ingest Functionality Extract metadata from data header keyword values and populate archive science catalog Write data files to archive storage media Catalog location and properties of files in archive database Validate integrity of data files Set proprietary status of data files

Implementation Review6 Goals of Ingest Redesign project Make Ingest more compatible with current science instrument design  It is almost impossible to enhance the fragile Open VMS DADS system for new science instruments without breaking existing functionality. Bring Ingest requirements up to date  No longer support GEIS format in archive  Create final archive for HST first generation science instruments  No ingest of raw engineering data or subset engineering data  CCS is now HST engineering data archive Improve operator control of the system

Implementation Review7 Status of Ingest Redesign project Ingest Ops Concept complete and distributed on February 20, 2003 Requirement definition in progress

Implementation Review8 Highlights of Ingest Ops Concept Represents a significant simplification in the data system architecture Deploy Ingest as a natural extension of data processing pipelines. Build Ingest on OPUS architecture  OPUS software system has over 7 years of operational experience on HST  Risk mitigated by using a proven architecture  Time to deployment will be reduced Consistent with JWST concept for data processing and archive systems  Same software will be used for both HST and JWST

Implementation Review9 Highlights of Ingest Ops Concept (cont.)

Implementation Review10 Highlights of Ingest Ops Concept (cont.) Reduces amount of data shuffling and conversions between different software systems  E.g., current WFPC2 science data processing pipeline

Implementation Review11 Highlights of Ingest Ops Concept (cont.) Reduces amount of data shuffling and conversions between different software systems (cont.)  Future WFPC2 science data processing pipeline

Implementation Review12 Benefits of Ops Concept All operations on data handled in a single data flow.  Create FITS file, populate header keyword values, extract metadata from keyword values, populate science component of archive catalog  No duplication of development effort or functionality  Consistent development, testing, and operations helps insure quality of archive catalog Facilitates easier delivery of header changes  Keyword changes can be built, tested, and deployed within a single subsystem

Implementation Review13 Benefits of Ops Concept (cont.) Decouples Ingest and Distribution Software  Although both will utilize much of the same hardware such as the Data depot, 15k, and database Provides opportunity for consolidation of OPUS and DADS based operator tools Provides opportunity to automate data validation

Implementation Review14 Ingest Redesign Schedule Ingest Operational Concept complete and distributed on February 20, Requirement specification in progress  To be completed by April 15, 2003 The remainder of the schedule is very preliminary pending requirement scoping and build planning  Design review: June 2003  Phased development in OPUS builds between June 2003 and March 2004  System tests: March – April 2004  Deploy system: May 2004

Implementation Review15 Summary of Data Systems software ports to Solaris Over the last few years, HST data processing systems have been ported from Open VMS to Solaris:  OPUS infrastructure  Ported to Unix for FUSE – February 1998  Current version tested under Solaris  HST Science Instrument pipeline applications  Ported to Tru64 Unix – October 1999  Testing on Solaris in progress, minor changes anticipated  HST Engineering Data Processing pipelines  Ported to Solaris – February 2003

Implementation Review16 Summary of Data Systems software ports to Solaris (cont.) HST archive systems port from Open VMS to Solaris in progress:  Data Distribution system  completion expected in summer 2003  Archive Ingest system  completion expected in spring 2004 With completion of Archive Ingest System redesign project, all data systems will be running under Solaris. No other major system enhancement projects expected through end of HST mission.