Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009.

Slides:



Advertisements
Similar presentations
Depositing e-material to The National Library of Sweden.
Advertisements

OASIS Reference Model for Service Oriented Architecture 1.0
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
7M701 1 Software Engineering Object-oriented Design Sommerville, Ian (2001) Software Engineering, 6 th edition: Chapter 12 )
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Understanding Metamodels. Outline Understanding metamodels Applying reference models Fundamental metamodel for describing software components Content.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
1/31 CS 426 Senior Projects Chapter 1: What is UML? Chapter 2: What is UP? [Arlow and Neustadt, 2005] January 22, 2009.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
Applying the ISO RM-ODP Standard in e-Government B. Meneklis 1, A. Kaliontzoglou 2,3, D. Polemi 1, C. Douligeris 1 1 University of Piraeus, Department.
Information Retrieval in Distributed Environments Based on Context- Aware, Proactive Documents Current Research Information Systems (CRIS 2002) August.
Web Service Architecture Part I- Overview and Models (based on W3C Working Group Note Frank.
1 CS 426 Senior Projects Chapter 1: What is UML? Chapter 2: What is UP? [Arlow and Neustadt, 2002] January 26, 2006.
Architectural Design Establishing the overall structure of a software system Objectives To introduce architectural design and to discuss its importance.
Chapter 10 Architectural Design
Implementation Yaodong Bi. Introduction to Implementation Purposes of Implementation – Plan the system integrations required in each iteration – Distribute.
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
An Introduction to Software Architecture
Ocean Observatories Initiative Sensing and Acquisition (SA) Subsystem Overview Michael Meisinger September 29, 2009.
Ocean Observatories Initiative Common Execution Infrastructure (CEI) Overview Michael Meisinger September 29, 2009.
OOI CyberInfrastructure: Technology Overview - Hyrax January 2009 Claudiu Farcas OOI CI Architecture & Design Team UCSD/Calit2.
Categories of Vocabulary Compatibility Dmitry Lenkov Oracle.
Lecture 9: Chapter 9 Architectural Design
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 10Slide 1 Architectural Design l Establishing the overall structure of a software system.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
FI-CORE Data Context Media Management Chapter Release 4.1 & Sprint Review.
SWIM-SUIT Information Models & Services
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger Life Cycle Architecture Review.
95-843: Service Oriented Architecture 1 Master of Information System Management Service Oriented Architecture Lecture 3: SOA Reference Model OASIS 2006.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger September 29, 2009.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Common Operating Infrastructure Subsystem Michael Meisinger Life Cycle.
Chapter 10 Analysis and Design Discipline. 2 Purpose The purpose is to translate the requirements into a specification that describes how to implement.
1 Open Ontology Repository: Architecture and Interfaces Ken Baclawski Northeastern University 1.
National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center Persistent Management of Distributed Data Reagan W. Moore.
OOI CyberInfrastructure: Data Management Architecture Specification Workshop June 30-July 1, 2008 Matthew Arrott, Ingolf Krueger, Claudiu Farcas, Emilia.
Ocean Observatories Initiative OOI Cyberinfrastructure Overview Matthew Arrott VMware Presentation March 5, 2010.
10/24/09CK The Open Ontology Repository Initiative: Requirements and Research Challenges Ken Baclawski Todd Schneider.
Database Environment Chapter 2. Data Independence Sometimes the way data are physically organized depends on the requirements of the application. Result:
Data Management Planning Session Kevin Gomes Michael Meisinger Arcot Rajasekar Michael Wan October 19, 2007.
1 Software Engineering: A Practitioner’s Approach, 6/e Chapter 10a: Architectural Design Software Engineering: A Practitioner’s Approach, 6/e Chapter 10a:
Ocean Observatories Initiative OOI Cyberinfrastructure Data Management Michael Meisinger & David Stuebe OOI Cyberinfrastructure Life Cycle Objectives Milestone.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Metadata “Data about data” Describes various aspects of a digital file or group of files Identifies the parts of a digital object and documents their content,
Slide 1 Service-centric Software Engineering. Slide 2 Objectives To explain the notion of a reusable service, based on web service standards, that provides.
REST By: Vishwanath Vineet.
OOI Cyberinfrastructure and Semantics OOI CI Architecture & Design Team UCSD/Calit2 Ocean Observing Systems Semantic Interoperability Workshop, November.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
Ocean Observatories Initiative Common Operating Infrastructure (COI) Overview Michael Meisinger, Munindar Singh September 29, 2009.
1 SOA Seminar Seminar on Service Oriented Architecture SOA Reference Model OASIS 2006.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. SOA-RM Overview and relation with SEE Adrian Mocan
Slide 1 Chapter 8 Architectural Design. Slide 2 Topics covered l System structuring l Control models l Modular decomposition l Domain-specific architectures.
Physical Oceanography Distributed Active Archive Center THUANG June 9-13, 20089th GHRSST-PP Science Team Meeting GHRSST GDAC and EOSDIS PO.DAAC.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
IPDA Architecture Project International Planetary Data Alliance IPDA Architecture Project Report.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Databases and DBMSs Todd S. Bacastow January 2005.
OAIS Producer (archive) Consumer Management
Flexible Extensible Digital Object Repository Architecture
CS 501: Software Engineering Fall 1999
Flexible Extensible Digital Object Repository Architecture
Data, Databases, and DBMSs
Database Systems Instructor Name: Lecture-3.
Metadata The metadata contains
OOI Cyberinfrastructure Integrated Elaboration Demo
Toward an Ontology-Driven Architectural Framework for B2B E. Kajan, L
Presentation transcript:

Ocean Observatories Initiative Data Management (DM) Subsystem Overview Michael Meisinger September 29, 2009

2 OOI CI Kick-Off Meeting Sept 9-11, 2009 Outline Subsystem Architecture Overview Scope of Release 1 Selected Components –Data Distribution based on the Exchange –Data Store as a Service

3 OOI CI Kick-Off Meeting Sept 9-11, 2009 Data Distribution w/ Exchange Context of DM within CI Exchange handles Data distribution

4 OOI CI Kick-Off Meeting Sept 9-11, 2009 Data Processing and Availability Multiple aspects of data management Data processing and analysis at various levels of abstraction Data distribution critical to global scientific research

5 OOI CI Kick-Off Meeting Sept 9-11, 2009 Requirements Focus on High risk requirements The CI shall implement an OOI-standard metadata model for resources The OOI-standard metadata model shall support a description of physical resource behavior The OOI-standard metadata model shall support a description of physical resource content The OOI-standard metadata model shall support a syntactic description for the content of an information resource The OOI-standard metadata model shall support a semantic description for the content of an information resource The OOI-standard metadata model shall support tracking of resource provenance The OOI-standard metadata model shall support tracking of quality The OOI-standard metadata model shall support tracking of context The OOI-standard metadata model shall support tracking of correspondence The OOI-standard metadata model shall support tracking of citation The OOI-standard metadata model shall support tracking of lineage The OOI-standard metadata model shall be extensible The CI shall provide semantic services to support ontological representations and relationships The semantic services shall utilize domain-specific vocabularies A user interface to define vocabulary terms shall be provided The vocabularies shall be extensible The semantic services shall recommend new terms to enter into the vocabulary The semantic services shall implement an ontological language The semantic services shall implement an ontological engine The CI shall provide persistent archive services The persistent archive services shall be data format agnostic The persistent archive services shall be subject to policy The persistent archive services shall preserve all associations between data and metadata The persistent archive services shall ingest data independent of delivery order The persistent archive services shall guarantee the integrity of archived data The persistent archive services shall support distributed data repositories The persistent archive services shall support federation The persistent archive services shall support data versioning The persistent archive services shall acknowledge requests for data and provide an estimate for response time

6 OOI CI Kick-Off Meeting Sept 9-11, 2009 Scope of Release 1 Common data and metadata model –Resource metadata, behavior, lifecycle, content, provenance, lineage, citation, quality, context, correspondence –Extensible vocabularies and ontologies –Data formats (syntax and semantics) Dynamic data distribution services –Pub/sub, topics, processing chaining, sequestration Data catalog and repository –Discovery, metadata management Persistent archive services –Repository management, common repository framework, ingestion services, long-term archival

7 OOI CI Kick-Off Meeting Sept 9-11, 2009 DM Functional Components DX Prototype Data Exchange (DX) prototype barely touches the Ingestion/Transformation/ Exchange/Preservation in the context of a Data distribution model DX strongly informs further refinements of the DM architecture and technology choices

8 OOI CI Kick-Off Meeting Sept 9-11, 2009 Information Container Model Encapsulates all kinds of information resources, such as: scientific data, user identities, process definitions, virtual machine images, etc. Multiple levels of meta-data Separation of concerns between Information services

9 OOI CI Kick-Off Meeting Sept 9-11, 2009 Ingestion Provides basic mechanisms for identifying the data streams and formats, parsing the content and identifying the associated meta-data, adding version information, and registering the streams with a ISN Repository

10 OOI CI Kick-Off Meeting Sept 9-11, 2009 Ingestion Service Data Model Relationship between the constituents of the Ingestion Service and the Information Container Model

11 OOI CI Kick-Off Meeting Sept 9-11, 2009 Transformation Service Data Model Relationship between the constituents of the Transformation Service and the Information Container Model

12 OOI CI Kick-Off Meeting Sept 9-11, 2009 Preservation Service Data Model Relationship between the constituents of the Preservation Service and the Information Repository Model

13 OOI CI Kick-Off Meeting Sept 9-11, 2009 Scientific Data Transport As DAP evolves, Unidata’s CDM may be its successor* –OpenDAP –netCDF –HDF5 * Comparison available at: Currently DAP as canonical form

14 OOI CI Kick-Off Meeting Sept 9-11, 2009 Data Store as Service Exchange makes data transport possible and physical location of data becomes transparent to application Storage mechanisms abstracted to improve flexibility Ability to choose the best technology for the available platform that fits the intended purpose Multiple different storage “back-ends” possible Attribute Store prototype as the predecessor to a storage architecture

15 OOI CI Kick-Off Meeting Sept 9-11, 2009 Attribute Store generic repository of information organized around key + value pairs intended to provide fast, reliable data storage and retrieval for lightweight data elements (not a full-blown SQL engine). Decomposition: –Command Processor – interfaces with other OOI entities and abstracts from Repository technology –Repository – stores the actual content in using the best technology available for the selected platform –Specification – describes Repository and how to store/retrieve/match elements to/from Repository

16 OOI CI Kick-Off Meeting Sept 9-11, 2009 Attribute Store - Design Fundamental Interaction Pattern Internal Interaction Pattern for the WRITE Cmd. Command Set

17 OOI CI Kick-Off Meeting Sept 9-11, 2009 Data Representation Data Representation/Encoding Standards –Processing –Transport –Storage Many choices… with overlapping capabilities

Technology Mapping

19 OOI CI Kick-Off Meeting Sept 9-11, 2009 Thanks !

20 OOI CI Kick-Off Meeting Sept 9-11, 2009 DM Components Base is DM FDR presentation Data Distribution based on the Exchange –Data Exchange architecture after services OV2 slide as example for a data distribution (vs storage model, the older model); real architecture has not been chosen; DX strongly informs. Covers Ingestion, Transformation, Preservation in the context of a Data distribution model –DAP as canonical form for transport of data. For given streams there are canonical forms (e.g. DAP), but not for the system in general (i.e. a database). That’s why we chose the new model. Be aware that the underlying data model of DAP is in evolution. Unidata CDM. Insert a few references to these models. –Reference to encoding formats, FIPA header –Query against the past (e.g. archive query) or the future (e.g. subscriptions). Pointer to SQLstream prototype Data Store as a Service –Attribute store as the predecessor to a storage architecture –Model, commands

21 OOI CI Kick-Off Meeting Sept 9-11, 2009 FIPA Provides valuable models for –Communication patterns –Message structure

22 OOI CI Kick-Off Meeting Sept 9-11, 2009 Subsystem Data and Information Access –Search & Navigation –External observatory access (IOOS, Neptune Canada, …) Transformation and Mediation –Attribution & Association –Aggregation –Syntactical Transformation –Ontology-based mediation between vocabularies Dynamic Data/Information Distribution –Persistent Archive –Information Catalog & Repository