Oceanographic Data Provenance Tracking with the Shore Side Data System Mike McCann, Kevin Gomes International Provenance and Annotation Workshop June 18,

Slides:



Advertisements
Similar presentations
MDS4 Roadmap Items Laura Pearlman USC Information Sciences Institute.
Advertisements

Copyright MBARI 2003 Managing Sensor Network Configuration and Metadata In Ocean Observatories Using Instrument Pucks Kent L. HEADLEY, Thomas C. O’REILLY,
Copyright MBARI 2004 MBARI-CIMT Mooring EARTH Workshop July 7, 2004 Kent Headley, Software Engineer, MBARI.
Software Modeling SWE5441 Lecture 3 Eng. Mohammed Timraz
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
ITIL: Service Transition
SENSORS and PUCK Duane Edgington, Bob Herlien, Kevin Gomes, Tom O’Reilly MBARI Duane Edgington, Bob Herlien, Kevin Gomes, Tom O’Reilly MBARI.
CoralReef:Analysis Tools platform for passive network monitoring collection of coral tools
Summary Role of Software (1 slide) ARCS Software Architecture (4 slides) SNS -- Caltech Interactions (3 slides)
Components of an Integrated Environmental Observatory Information System Cyberinfrastructure to Support Publication of Water Resources Data Jeffery S.
PAWN: A Novel Ingestion Workflow Technology for Digital Preservation
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 1: Introduction to Decision Support Systems Decision Support.
© , Michael Aivazis DANSE Software Issues Michael Aivazis California Institute of Technology DANSE Software Workshop September 3-8, 2003.
Ch 12 Distributed Systems Architectures
A Framework for Distributed Preservation Workflows Rainer Schmidt AIT Austrian Institute of Technology iPres 2009, Oct. 5, San.
SIMULATING ERRORS IN WEB SERVICES International Journal of Simulation: Systems, Sciences and Technology 2004 Nik Looker, Malcolm Munro and Jie Xu.
March 2004 At A Glance ITOS is a highly configurable low-cost control and monitoring system. Benefits Extreme low cost Database driven - ITOS software.
This chapter is extracted from Sommerville’s slides. Text book chapter
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
PerfSONAR Client Construction February 11 th 2010, APAN 29 – perfSONAR Workshop Jeff Boote, Assistant Director R&D.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Discussion and conclusion The OGC SOS describes a global standard for storing and recalling sensor data and the associated metadata. The standard covers.
GRID job tracking and monitoring Dmitry Rogozin Laboratory of Particle Physics, JINR 07/08/ /09/2006.
INFORMATION SYSTEM APPLICATIONS System Development Life Cycle.
MBARI’s Shore Side Data System From Ships, ROVs, Moorings, AUVs, & ? To Bytes, Plots, Pictures, Samples, & Video.
Information Requirements for Integrating Spatially Discrete, Feature- Based Earth Observations Jeffery S. Horsburgh Anthony Aufdenkampe, Kerstin Lehnert,
MBARI’s SSDS Data Management for Ocean Observatories Brian Schlining ブライアン シュリニング.
Kevin Gomes, SENSORS Overview and Architecture OOI Cyberinfrastructure: Sensing and Acquisition Workshop San Diego August 5-7, 2008.
An Introduction to Software Architecture
ECHO DEPository Project: Highlight on tools & emerging issues The ECHO DEPository Project is a 3-year digital preservation research and development project.
Components of Database Management System
Marine & Aquatic Sensor Arrays Ken Johnson Monterey Bay Aquarium Research Institute.
SSDS: Data for Science A Walkthrough of Proposed SSDS Capabilities 4 April 2002 John Graybeal.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
U.S. Department of the Interior U.S. Geological Survey Management of Oceanographic time-series data at the Woods Hole Coastal and Marine Science Center.
AUV CTD Time Series (900200) 09 June Project Goals Automate and sustain observations in Monterey Bay Increase the spatial resolution of regular.
ASI-Eumetsat Meeting Matera, 4-5 Feb CNM Context Matera, February 4-5, 20092ASI-Eumetsat Meeting.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Ocean Observatories Initiative OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Tom O’Reilly Monterey Bay Aquarium Research Institute.
MBARI Observatory Software Software Infrastructure and Applications for MOOS K. Headley January 2011.
MobileMAN Internal meetingHelsinki, June 8 th 2004 NETikos activity in MobileMAN project Veronica Vanni NETikos S.p.A.
CERN – European Organization for Nuclear Research Administrative Support - Internet Development Services CET and the quest for optimal implementation and.
Large Scale Nuclear Physics Calculations in a Workflow Environment and Data Provenance Capturing Fang Liu and Masha Sosonkina Scalable Computing Lab, USDOE.
Kevin Gomes and John Graybeal, MBARI MBARI’s SSDS OOI Cyberinfrastructure: San Diego June 30, 2008.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
3/30/04 16:14 1 Lessons Learned CERES Data Management Presented to GIST 21 “If the 3 laws of climate are calibrate, calibrate, calibrate, then the 3 laws.
MOOS SSDS Data Access Features A Discussion with MBARI’s Science Data Users.
Project Database Handler The Project Database Handler is a brokering application that mediates interactions between the project database and the external.
1 Registry Services Overview J. Steven Hughes (Deputy Chair) Principal Computer Scientist NASA/JPL 17 December 2015.
Background Real-time environmental monitoring is a field garnering an ever-increasing amount of attention. The ability for sensors to make and publish.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
ClearQuest XML Server with ClearCase Integration Northwest Rational User’s Group February 22, 2007 Frank Scholz Casey Stewart
1 Chapter 12 Configuration management This chapter is extracted from Sommerville’s slides. Text book chapter 29 1.
Event and Feature Catalogs in the Virtual Solar Observatory Joseph A. Hourclé and the VSO Team SP54A-07 : 2008 May 30.
Kevin Gomes, MBARI MBARI Data Architecture OOI Cyberinfrastructure: Data Product Generation Workshop San Diego May 20-21, 2008.
MBARI Data Management Initiatives John Graybeal Information Applications Group Lead.
NSDL STEM Exchange: Technical Overview and Implications for Active Dissemination of Federally Funded Resources Across Implementation Systems.
Working Group 4 Data and metadata lifecycle management  1. Policies and infrastructure for data and metadata changes  2. Supporting file and data formats.
1 CP586 © Peter Lo 2003 Multimedia Communication Multimedia Development Team.
Sponsored by the National Science Foundation 1 March 15, 2011 GENI I&M Update: I&M Service Types, Arrangements, Assembling Goals Architecture Overview.
Data Management: Data Processing Types of Data Processing at USGS There are several ways to classify Data Processing activities at USGS, and here are some.
Maintaining and Updating Windows Server 2008 Lesson 8.
Analysis and Reporting Toolset (A&RT): Lessons on how to develop a system with an external partner David Smith AstraZeneca.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
Architecture Concept Documents
Connected Vehicle Reference Implementation Architecture (CVRIA)
An Introduction to Software Architecture
CoralReef:Analysis Tools platform for passive network monitoring collection of coral tools
Mark Quirk Head of Technology Developer & Platform Group
Presentation transcript:

Oceanographic Data Provenance Tracking with the Shore Side Data System Mike McCann, Kevin Gomes International Provenance and Annotation Workshop June 18, 2008

Outline Motivation Motivation Monterey Bay Aquarium Research Institute (MBARI) Projects: Monterey Bay Aquarium Research Institute (MBARI) Projects: –Monterey Ocean Observing System (MOOS) –Shore-Side Data System (SSDS) Data Model Data Model Application framework Application framework Operational details Operational details –Instrument configuration –Data processing software

MUSE data Diversity of platforms & sensors Diversity of platforms & sensors Post-experiment organization Post-experiment organization Document for later use => FGDC Document for later use => FGDC Motivation for a better design Motivation for a better design

Identifying Requirements Configuring Instruments Configuring Instruments –Many different instrument –Many different manufacturers –Varied hardware, communication and metadata interfaces –However, all must interact with infrastructure Instrument Have Lifecycles Instrument Have Lifecycles –Changed for normal maintenance, cleaning, failure –Can also be change configuration depending on science goal/experiment –In-situ re-configuration –Instrument-Infrastructure relationship must be kept intact and, in fact, tracked.

Identifying Requirements Metadata Must Tie To Data Metadata Must Tie To Data –Huge variation in data formats that users must handle –Traditionally added on after-the-fact –Not scalable and error prone

Identifying Requirements Instruments Can Cross Observatories Instruments Can Cross Observatories –Some instrument supplies are limited –Experiment configuration Metadata and Data Can Cross Observatories Metadata and Data Can Cross Observatories –Example: data processing for instruments should not have to be re-written

Software Middleware for MOOS Shore network MOOS moored network Shore Side Data System TCP/IP via satellite SurfaceBenthic-1Benthic-2 Instrument services Instrument services Instrument services Instrument GUI Telemetry retriever

SSDS: Metadata and Data Management Requirements for SSDS (partial list) Requirements for SSDS (partial list) –Capture observatory and instrument lifecycle data –Return instrument data in its native (“raw”) format –Simple analysis tools for viewing data –Capture and archive processed data products and associated metadata, maintaining known relationships between data sets – Convert data to common formats

SSDS: Metadata and Data Management Data Access Services Metadata Access Services Aggregate HTTP-based Services (SOA) Wet SideShore Side Ingest XML Data Metadata , , ,91.1 Instrument Packets (+ infrastructure metadata) SSDS Domain Logic API

SSDS Data Model

Data Container & Data Producer attributes Recording… Recording… –What –Where –When –Relations

Satisfying Requirements: Configuration Data Access Services Metadata Access Services Aggregate HTTP-based Services Wet SideShore Side Ingest XML Data Metadata , , ,91.1 Instrument Packets (+ infrastructure metadata) SSDS Business Logic API

Satisfying Requirements : Dynamic Lifecycle Data Access Services Metadata Access Services Aggregate HTTP-based Services Wet SideShore Side Ingest XML Data Metadata , , ,91.1 Instrument Packets (+ infrastructure metadata) SSDS Business Logic API

Satisfying Requirements : Resource Mgmt. Data Access Services Metadata Access Services Aggregate HTTP-based Services Wet SideShore Side Ingest XML Data Metadata , , ,91.1 Instrument Packets (+ infrastructure metadata) SSDS Business Logic API

Satisfying Requirements : Health Monitoring Data Access Services Metadata Access Services Aggregate HTTP-based Services Wet SideShore Side Ingest XML Data Metadata , , ,91.1 Instrument Packets (+ infrastructure metadata) SSDS Business Logic API Subject: SSDS: No recent data stream update from instruments: 1441 A problem has been encountered while checking on the status of the following data streams that SSDS is monitoring: Device ID :: Last update time (in hours) :: Device Name MSE Surface Node :: 2.9 :: Medusa Card

Satisfying Requirements : Metadata/Data Data Access Services Metadata Access Services Aggregate HTTP-based Services Wet SideShore Side Ingest XML Data Metadata , , ,91.1 Instrument Packets (+ infrastructure metadata) SSDS Business Logic API

- Perl Application Programming Interface

- Matlab (works with R2008a) % Import SSDS package import moos.ssds.services.metadata.* % Get Home interface home = moos.ssds.services.metadata.DataProducerAccessUtil.getHome(); % Get Access object dpAccess = home.create(); % Call methods on the Access object dList = dpAccess.findByName('Back', logical(0), 'id','ascending', logical(1)); it = dList.iterator; d = it.next; d.getDevice.getMfgSerialNumber ans = WL Application Programming Interface

SSDS data life cycle Instrument is defined by creating Device record. Instrument is defined by creating Device record. Instrument is configured for deployment by writing Deployment, DataContainer, RecordDescription, RecordVariable XML. Instrument is configured for deployment by writing Deployment, DataContainer, RecordDescription, RecordVariable XML. Instrument is deployed. XML metadata is ingested by SSDS, data packets flow into the Instrument Packets database. Instrument is deployed. XML metadata is ingested by SSDS, data packets flow into the Instrument Packets database. Automated DataStream processing software consumes the data packets producing a NetCDF file for each instrument’s data. A DataProducer record is created linking the input DataStream to the output DataFile. Automated DataStream processing software consumes the data packets producing a NetCDF file for each instrument’s data. A DataProducer record is created linking the input DataStream to the output DataFile.

SSDS data life cycle (cont.) Follow-on data processing runs consume instrument NetCDF DataContainers producing combined data sets and graphical products. Metadata from SSDS is extracted as needed to fully describe data in all the NetCDF data sets. Follow-on data processing runs consume instrument NetCDF DataContainers producing combined data sets and graphical products. Metadata from SSDS is extracted as needed to fully describe data in all the NetCDF data sets. User uses the data with all the needed information to assess its suitability for a particular use. User uses the data with all the needed information to assess its suitability for a particular use.

SSDS Explorer web application Drill down deployment tree Drill down deployment tree Drill down processing tree Drill down processing tree Used mainly by developers Used mainly by developers

Lessons Learned Solid Interface Definitions Key Solid Interface Definitions Key Web Services Great – Not Always SOAP Web Services Great – Not Always SOAP Policies Are As Important As Interfaces Policies Are As Important As Interfaces Identify The Critical Metadata Identify The Critical Metadata Consume The Critical Metadata Early And Often Consume The Critical Metadata Early And Often Testing Station/Simulator Testing Station/Simulator

Acknowledgements SSDS funded by the David and Lucille Packard Foundation SSDS funded by the David and Lucille Packard Foundation MOOS Leads: Mark Chaffey, Kent Headley MOOS Leads: Mark Chaffey, Kent Headley Operations: Paul Coenen, Ken Heller, Hans Thomas, Duane Thompson Operations: Paul Coenen, Ken Heller, Hans Thomas, Duane Thompson Science: Jim Barry, Francisco Chavez, Charlie Paull, Erich Rienecker, John Ryan Science: Jim Barry, Francisco Chavez, Charlie Paull, Erich Rienecker, John Ryan SSDS Development Team: Andrew Chase, Mike McCann, Brian Schlining, Rich Schramm SSDS Development Team: Andrew Chase, Mike McCann, Brian Schlining, Rich Schramm