MBARI Data Management Initiatives John Graybeal Information Applications Group Lead.

Slides:



Advertisements
Similar presentations
Tom Lewis Director, Academic & Collaborative Applications University of Washington.
Advertisements

MS CRM Integration WhosOn Service Integration Presentation MS CRM User Group.
SENSORS and PUCK Duane Edgington, Bob Herlien, Kevin Gomes, Tom O’Reilly MBARI Duane Edgington, Bob Herlien, Kevin Gomes, Tom O’Reilly MBARI.
Hydra Partners Meeting March 2012 Bill Branan DuraCloud Technical Lead.
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
1 Chapter 12 Working With Access 2000 on the Internet.
Team: Maya Zalcberg Diana Attar Levona Hershtik Academic advisor: Prof. Ehud Gudes ADD Presentation.
Jacob Boston Josh Pfeifer. Definition of HyperText Transfer Protocol How HTTP works How Websites work GoDaddy.com OSI Model Networking.
Development of a Community Hydrologic Information System Jeffery S. Horsburgh Utah State University David G. Tarboton Utah State University.
Component and Deployment Diagrams
Tools and Services for the Long Term Preservation and Access of Digital Archives Joseph JaJa, Mike Smorul, and Sangchul Song Institute for Advanced Computer.
Enterprise Search With SharePoint Portal Server V2 Steve Tullis, Program Manager, Business Portal Group 3/5/2003.
Mike Smorul Saurabh Channan Digital Preservation and Archiving at the Institute for Advanced Computer Studies University of Maryland, College Park.
Tutorial 11: Connecting to External Data
System Design/Implementation and Support for Build 2 PDS Management Council Face-to-Face Mountain View, CA Nov 30 - Dec 1, 2011 Sean Hardman.
Chapter 5 Application Software.
NETWORK CENTRIC COMPUTING (With included EMBEDDED SYSTEMS)
Discussion and conclusion The OGC SOS describes a global standard for storing and recalling sensor data and the associated metadata. The standard covers.
MBARI’s Shore Side Data System From Ships, ROVs, Moorings, AUVs, & ? To Bytes, Plots, Pictures, Samples, & Video.
Metadata Guides for Smarties Marine Metadata Initiative URL:
CPS120: Introduction to Computer Science The World Wide Web Nell Dale John Lewis.
MBARI’s SSDS Data Management for Ocean Observatories Brian Schlining ブライアン シュリニング.
1. 2 introductions Nicholas Fischio Development Manager Kelvin Smith Library of Case Western Reserve University Benjamin Bykowski Tech Lead and Senior.
Kevin Gomes, SENSORS Overview and Architecture OOI Cyberinfrastructure: Sensing and Acquisition Workshop San Diego August 5-7, 2008.
AON Data Questionnaire Results 21 Respondents Last Updated 27 March 2007 First AON PI Meeting Scot Loehrer, Jim Moore.
OASIS ebXML Registry Standard Open Forum 2003 on Metadata Registries 10:30 – 11:15 January 20, 2003 Kathryn Breininger The Boeing Company Chair, OASIS.
DM_PPT_NP_v01 SESIP_0715_AJ HDF Product Designer Aleksandar Jelenak, H. Joe Lee, Ted Habermann Gerd Heber, John Readey, Joel Plutchak The HDF Group HDF.
STOQS: The Spatial Temporal Oceanographic Query System Mike McCann Abstract Monterey Bay Aquarium Research Institute Architecture Postgres.
Oceanographic Data Provenance Tracking with the Shore Side Data System Mike McCann, Kevin Gomes International Provenance and Annotation Workshop June 18,
NEPTUNE Canada Workshop Oceans 2.0 Project Environment NEPTUNE Canada DMAS Team Victoria, BC February 16, 2009.
SSDS: Data for Science A Walkthrough of Proposed SSDS Capabilities 4 April 2002 John Graybeal.
Indo-US Workshop, June23-25, 2003 Building Digital Libraries for Communities using Kepler Framework M. Zubair Old Dominion University.
PLoS ONE Application Journal Publishing System (JPS) First application built on Topaz application framework Web 2.0 –Uses a template engine to display.
AUV CTD Time Series (900200) 09 June Project Goals Automate and sustain observations in Monterey Bay Increase the spatial resolution of regular.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
Ocean Observatories Initiative OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Tom O’Reilly Monterey Bay Aquarium Research Institute.
MBARI Observatory Software Software Infrastructure and Applications for MOOS K. Headley January 2011.
© Paradigm Publishing Inc. 5-1 Chapter 5 Application Software.
ESIP & Geospatial One-Stop (GOS) Registering ESIP Products and Services with Geospatial One-Stop.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
“Confidential –Internal Halliburton Use Only. © 2004 Halliburton. All Rights Reserved.” Portal Brief OracleAS Portal A component of Oracle Application.
Kevin Gomes and John Graybeal, MBARI MBARI’s SSDS OOI Cyberinfrastructure: San Diego June 30, 2008.
Presented by Scientific Annotation Middleware Software infrastructure to support rich scientific records and the processes that produce them Jens Schwidder.
A radiologist analyzes an X-ray image, and writes his observations on papers  Image Tagging improves the quality, consistency.  Usefulness of the data.
OAI Overview DLESE OAI Workshop April 29-30, 2002 John Weatherley
OAIS Rathachai Chawuthai Information Management CSIM / AIT Issued document 1.0.
MOOS SSDS Data Access Features A Discussion with MBARI’s Science Data Users.
User Profiling using Semantic Web Group members: Ashwin Somaiah Asha Stephen Charlie Sudharshan Reddy.
Strictly Business Using “StrictlyFused” to Create an Extensible Knowledge Portal.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Preface IIntroduction Objectives I-2 Course Overview I-3 1Oracle Application Development Framework Objectives 1-2 J2EE Platform 1-3 Benefits of the J2EE.
Kevin Gomes, MBARI MBARI Data Architecture OOI Cyberinfrastructure: Data Product Generation Workshop San Diego May 20-21, 2008.
1 1 NOAA Office of Ocean Exploration End-to-End Data Management: A Success Story NOAA Tech Conference November 2005 Susan Gottfried National Coastal Data.
Rendering Syndicated Library Content in an Institutional Portal: Integrating MyLibrary into uPortal John Fereira: Cornell University Eric Lease Morgan:
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
Physical Oceanography Distributed Active Archive Center THUANG June 9-13, 20089th GHRSST-PP Science Team Meeting GHRSST GDAC and EOSDIS PO.DAAC.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
Collection-Based Persistent Archives Arcot Rajasekar, Richard Marciano, Reagan Moore San Diego Supercomputer Center Presented by: Preetham A Gowda.
5/29/2001Y. D. Wu & M. Liu1 Content Management for Digital Library May 29, 2001.
A Semi-Automated Digital Preservation System based on Semantic Web Services Jane Hunter Sharmin Choudhury DSTC PTY LTD, Brisbane, Australia Slides by Ananta.
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
The CUAHSI Hydrologic Information System Spatial Data Publication Platform David Tarboton, Jeff Horsburgh, David Maidment, Dan Ames, Jon Goodall, Richard.
Session: Towards systematically curating and integrating
DIGITAL LIBRARY.
An ecosystem of contributions
Database Design Hacettepe University
SDMX IT Tools SDMX Registry
Presentation transcript:

MBARI Data Management Initiatives John Graybeal Information Applications Group Lead

Established in 1987 Monterey Bay Aquarium Research Institute David and Lucile Packard Foundation

Santa Cruz Monterey Monterey Canyon MBARI MBARI Location

Monterey Ocean Observing System Suitable for deep ocean or coastal studies Suitable for deep ocean or coastal studies low power, long term moorings and benthic nodes low power, long term moorings and benthic nodes low bandwidth communication links to shore low bandwidth communication links to shore Configurable, re-deployable instruments and platforms (using ships and ROVs) Configurable, re-deployable instruments and platforms (using ships and ROVs) Smart nodes on deployed platforms Smart nodes on deployed platforms some on-board data processing some on-board data processing facilitate autonomous event detection facilitate autonomous event detection perform on-board calculations/detections perform on-board calculations/detections handle responses from shore handle responses from shore

MOOS Concept of Operations Benthic Node Mooring Autonomous Underwater Vehicle (AUV) MBARI

Data Stream Challenge (MUSE)

Data Management Challenge Large number of data sources Large number of data sources Large variety of data sources Large variety of data sources Dynamic systems Dynamic systems  Data sources may appear and disappear  Devices & platforms reconfigured often  Interactions from shore and ship No standard data format No standard data format  Data can be instrument ‘native’  New sources coming on-line all the time  Streams or files, automated or manual

Example: Samples Database

Example: Video and Images 14 years, up to 300 dives/year 14 years, up to 300 dives/year video tapes, hours video tapes, hours frame grabs… => 900,000 annotations frame grabs… => 900,000 annotations How to manage this valuable repository? How to manage this valuable repository? Advanced annotation system Advanced annotation system Detailed knowledge base of concepts Detailed knowledge base of concepts Easy-to-use querying tool Easy-to-use querying tool

Video Annotation and Reference System (VARS)

Notes About SSDS: The Shore Side Data System A MOOS Development Project A MOOS Development Project Goals: low cost, flexible, expandable, reliable Goals: low cost, flexible, expandable, reliable Future systems beyond MOOS (e.g., MARS) Future systems beyond MOOS (e.g., MARS) Now in 3rd year, deploying initial elements Now in 3rd year, deploying initial elements Key Tenets of SSDS Development Key Tenets of SSDS Development Iterative development—improve it as we go Iterative development—improve it as we go Test with real data—new and archival Test with real data—new and archival Build for change—use modular interfaces Build for change—use modular interfaces

Shore Side Data System: Requirements Overview Ingest data in any described format and save it Ingest data in any described format and save it Capture, publish data descriptions (metadata) Capture, publish data descriptions (metadata) Provide standards-based access to data Provide standards-based access to data Raw data, and other common digital formats Raw data, and other common digital formats APIs for common visualization and analysis tools APIs for common visualization and analysis tools User-oriented web interfaces, quick-look plots User-oriented web interfaces, quick-look plots Merge data (different sources & time intervals) Merge data (different sources & time intervals) Support data visualization & quality control Support data visualization & quality control Provide data access security as needed Provide data access security as needed

Shore Side Data System: User Requirements Raw data via device ID pages? (sort of limited) Raw data via device ID pages? (sort of limited) Standard plots like OASIS quality controlled ones? Standard plots like OASIS quality controlled ones? Access data from applications via a DODS URLs? Access data from applications via a DODS URLs? Matlab, Ingrid, Live Access Server, Excel, IDV, Ferret Matlab, Ingrid, Live Access Server, Excel, IDV, Ferret And hopefully, Ocean Data View And hopefully, Ocean Data View Access data via returned data files (e.g., ASCII CSV w/headers) opened within desktop applications? Access data via returned data files (e.g., ASCII CSV w/headers) opened within desktop applications? Excel, ArcView, Ocean Data View Excel, ArcView, Ocean Data View Delivery of data directly into an application? Delivery of data directly into an application? Ability to subset data, for example by time window? Ability to subset data, for example by time window? Ability to merge data from different data sets? Ability to merge data from different data sets?

Data Management at MBARI: SSDS Efforts Infrastructure/model development Infrastructure/model development Ontologies Ontologies Metadata schema Metadata schema Metadata entry/correction/annotation Metadata entry/correction/annotation User interfaces User interfaces Data processing Data processing Visualizations Visualizations Federated access to MBARI data/metadata Federated access to MBARI data/metadata

More MBARI SSDS Tasks Legacy data migration Legacy data migration OASIS, expd etc., Samples, Waypoints, ? OASIS, expd etc., Samples, Waypoints, ? New data sources New data sources MTM II, AUV Sonar, CIMT, … MTM II, AUV Sonar, CIMT, … Outreach (integrating non-SSDS projects) Outreach (integrating non-SSDS projects) Documentation Documentation NEPTUNE NEPTUNE Education Education Operational support Operational support

MOOS/SSDS Architecture (shows data flow) MOOS/SSDS Architecture (shows data flow) Devices Deployed Platform Shore Side Data System User Applications (User Tools) Data Tracking Communications Applications/ Interfaces Archiving Data Presentation Data line 1 more data last data OceanSideShoreSide Portal

SSDS Elements Applications Data Presentation Data line 1 more data last data Ingest Archiving Arriving Data Data Tracker Data Catalog External Data Stores Data For Analysis Web I/F < Requests Data > Shared Descriptions < Requests Data > Metadata (Re)Processed and New Data Sets Automated Data Flow Internal Interfaces On-Demand Interactions

Example SIAM to SSDS Data Flow Portal Mooring SSDS

Example SIAM to SSDS Data Flow A device is connected to a platform, such as a Mooring. Portal Device Mooring SSDS

<RecordVariable name="time" columnIndex="1" format="double" longName="Time(GMT)" units="milliseconds since Jan 01, 1970"/> Example SIAM to SSDS Data Flow The mooring retrieves the metadata from the device. Portal Device Mooring SSDS

Metadata Packet Example SIAM to SSDS Data Flow The metadata is packaged and sent to a portal on shore before any data is sent to shore. Portal DeviceMooring SSDS

Example SIAM to SSDS Data Flow Metadata Packet The portal forwards the metadata to SSDS. Portal DeviceMooring SSDS

DB Example SIAM to SSDS Data Flow SSDS stores the metadata in a database. This allows applications to query for and use data. Portal DeviceMooring SSDS

DB Example SIAM to SSDS Data Flow SSDSPortal DeviceMooring

DB 34,56.234,0.0023,... Example SIAM to SSDS Data Flow The device produces a data record. Portal Device Mooring SSDS

DB Data Packet 34,56.234,0.0023,... Example SIAM to SSDS Data Flow The data is packaged and sent to SSDS. Portal DeviceMooring SSDS

DB VersionID, DeviceID, MetadataID, RecordType, PlatformID, SystemTime, SequenceNumber, DataBuffer(34,56.234,0.0023,…) Serialized Example SIAM to SSDS Data Flow SSDS uses information in the packet to sort and store the data in a ‘raw’ format. Portal DeviceMooring SSDS

DB netCDF Example SIAM to SSDS Data Flow Serialized VersionID, DeviceID, MetadataID, RecordType, PlatformID, SystemTime, SequenceNumber, DataBuffer(34,56.234,0.0023,…) The ‘raw’ data is parsed and stored as netCDF for easier access. Portal DeviceMooring SSDS

Software applications allow users to discover and obtain data in formats useful to the typical MBARI user. (netCDF, text, etc.) DB netCDF Example SIAM to SSDS Data Flow Serialized netcdf parosci { dimensions: time = UNLIMITED ; // (17761 currently) variables: double time(time) ; time:long_name = "Time (GMT)" ; time:units = "seconds since :00:00" ; double depth(time) ; depth:long_name = "depth" ; depth:units = "UNKNOWN" ; // global attributes: :title = "AUV data" ; :created = " T23:34:58Z" ; :history0 = ": Deployment information for parosci.log" ; :deploymentName = " " ; :instrumentId = "3699" ; } Portal DeviceMooring SSDS MBARI Software

DB netCDF Example SIAM to SSDS Data Flow Serialized Software applications also provide simple visual representations of data Portal DeviceMooring SSDS MBARI Software

DB netCDF Example SIAM to SSDS Data Flow Serialized Provide internet access Portal DeviceMooring SSDS MBARI Software Web Pages

Portal DeviceMooring DB SSDS netCDF Existing netCDF Software Example SIAM to SSDS Data Flow Serialized MBARI Software Web Pages Save development time by using existing software applications

SSDS Data Mgt Sequence

AUV Data Sequence Diagram

Metadata Approach (Credit: Dan Davis) XML suitable for MOOS metadata XML suitable for MOOS metadata Enables use of many other tools/software Enables use of many other tools/software But, it looks a little bit user-unfriendly But, it looks a little bit user-unfriendly Use XML-driven GUI technology to create forms to create and display metadata Use XML-driven GUI technology to create forms to create and display metadata Users don’t have to directly read XML Users don’t have to directly read XML It’s there and easy to access if they want it It’s there and easy to access if they want it Bind XML metadata to each device through its puck Bind XML metadata to each device through its puck

Sensor Puckto host computer serial interface During pre-deployment instrument configuration, and test, sensor driver and associated metadata is stored in compact flash memory in puck During pre-deployment instrument configuration, and test, sensor driver and associated metadata is stored in compact flash memory in puck Metadata stored in puck interface

Metadata Schema Design

Metadata User Form Design User interface designer uses schema to build a form for creation, display, access, of metadata instances User interface designer uses schema to build a form for creation, display, access, of metadata instances There may be different forms for different users (e.g. scientific, system, and operational) to create, and display metadata of interest There may be different forms for different users (e.g. scientific, system, and operational) to create, and display metadata of interest

Metadata Form Design

Instrument Configuration Metadata forms are used during device configuration to create metadata that is entered into device puck Metadata forms are used during device configuration to create metadata that is entered into device puck Similarly metadata forms are used during configuration of other system elements, such as platforms, and even communication links. This metadata is maintained in system nodes. Similarly metadata forms are used during configuration of other system elements, such as platforms, and even communication links. This metadata is maintained in system nodes.

Metadata Form Layout

SSDS— Metadata (Object View)

The data source. SSDS tracks: Software or hardware source Unique identifier Manufacturer information References to documentation SSDS— Metadata (Device)

SSDS— Metadata (Deployment) ‘Deployment’ information. SSDS tracks: Where the data was collected. When it was collected. What other data was used. Relation to other deployments

References to the data. SSDS tracks: The data storage location. How to access this data. The deployment that produced this data. SSDS— Metadata (DataContainer)

Format and contents of a DataContainer. SSDS tracks: The contents of a data set. The data format (to allow parsing by software). Descriptive info like units, scale, … SSDS— Metadata (Records)

Metadata and Access: Catalogs and Repositories View From the Shore View From the Shore Many data registries and models Many data registries and models GDC, OBIS, EarthRef, NVODS, … GDC, OBIS, EarthRef, NVODS, … Many standards Many standards Communications protocols: SOAP, OPeNDAP, OBIS, … Communications protocols: SOAP, OPeNDAP, OBIS, … Metadata formats (MIF, XML, NGDC, NetCDF…) Metadata formats (MIF, XML, NGDC, NetCDF…) Metadata ontologies and efforts Metadata ontologies and efforts NGDC, MarineXML, ESRI, Metadata Wranglers NGDC, MarineXML, ESRI, Metadata Wranglers Conclusion: Watch, Learn, Try (Iterate) Conclusion: Watch, Learn, Try (Iterate)

SSDS Data Access Desktop Application: HOOVES Desktop Application: HOOVES Data File Service Data File Service Quick Look Quick Look Metadata Access (and Validation) Metadata Access (and Validation) Metadata Editing Metadata Editing Networked API: Servlet / JSP Pages Networked API: Servlet / JSP Pages Application API (NetCDF): OPeNDAP Application API (NetCDF): OPeNDAP Web Access (NetCDF): Live Access Server Web Access (NetCDF): Live Access Server Archived Files: Direct Access (?) Archived Files: Direct Access (?)

HOOVES Help

HOOVES Mission View

HOOVES Mission Outputs View

HOOVES Mission Resources: Overview

HOOVES Mission Resources: Vehicle

HOOVES Instrument View

SSDS Schedule

Prime Areas for Collaboration Infrastructure/model development Infrastructure/model development Ontologies Ontologies Metadata schema Metadata schema Metadata entry/correction/annotation Metadata entry/correction/annotation User interfaces User interfaces Data processing Data processing Visualizations Visualizations Federated access to data/metadata Federated access to data/metadata Documentation Documentation

IAG Team Kevin Gomes Kevin Gomes John Graybeal John Graybeal Mike McCann Mike McCann Brian Schlining Brian Schlining Rich Schramm Rich Schramm And, a Mystery Guest (To Be Determined) And, a Mystery Guest (To Be Determined) Science Representative to SSDS John Ryan John Ryan