DMAC Data Integration What is it really? Why does it seem frozen in place? How do we get it moving? Steve Hankin (NOAA/PMEL) DMAC = Data Management and.

Slides:



Advertisements
Similar presentations
1 GlobModel The GlobModel study, initial findings and objectives of the day Zofia Stott 13 September 2007.
Advertisements

Integrated Ocean Observing System (IOOS) Data Management and Communication (DMAC) Standards Process Julie Bosch NOAA National Coastal Data Development.
Unisys Weather Information Services Presentation for NWS Partners Meeting Partner Perspective June 2010 Ron Guy, Director Unisys Weather
FP7-Infra : Design studies for European Research Infrastrutures 1st October 2011 – 31st December 2014 Duration 39 months – Periods : 2 (month.
The FI-WARE Project – Base Platform for Future Service Infrastructures OCTOBER 2011 Presentation at proposers day.
TAC Vista Security. Target  TAC Vista & Security Integration  Key customer groups –Existing TAC Vista users Provide features and hardware for security.
1 NODC, Russia GISC & DCPC developers meeting Langen, 29 – 31 March E2EDM technology implementation for WIS GISC development S. Sukhonosov, S. Belov.
Connecting People With Information Conclusions DoD Net-Centric Data Strategy (DS) and Community of Interest (COI) Training For further information .
Regional Weather Tracking Unit Portfolio Presentation Courtney Nielsen.
ICT and Civil ProtectionSenigallia, June 2007 A Service-Oriented Middleware for EU Civil Protection cooperation Regione Marche.
The HITCH project: Cooperation between EuroRec and IHE Pascal Coorevits EuroRec 2010 Annual Conference June 18 th 2010.
GSC16-OBS-03 ITU-T GSC – 16 Observer Presentation Karen Higginbottom, JTC 1 Chair.
Bielefeld Conference 2006: Academic Library and Information Services: New Paradigms for the Digital Age Hans Geleijnse Director of Library and IT Services.
Integrated Ocean Observing System Data Management and Communications March 2004 The US Integrated Ocean Observing System (IOOS) Plan for Data Management.
NOAA Metadata Update Ted Habermann. NOAA EDMC Documentation Directive This Procedural Directive establishes 1) a metadata content standard (International.
RDA Wheat Data Interoperability Working Group Outcomes RDA Outputs P5 9 th March 2015, San Diego.
Thomas Hacker Barb Fossum Matthew Lawrence Open Science Grid May 19, 2011.
Key integrating concepts Groups Formal Community Groups Ad-hoc special purpose/ interest groups Fine-grained access control and membership Linked All content.
The MyOcean Concept P.Bahurel, project coordinator.
Bennett Adelson. Microsoft Solution Center. Independence OH February 4, 2010 BENNETT ADELSON Microsoft® Solution Center.
Government IAM Ministerial Conference Participants Virtual Water Forum Water Voice Sessions interaction Ministerial Declaration Interaction Session Reports,
Larger GIS Community Can answer: –Local questions at small extents Spatial and temporal extents limited –Global questions at low resolution (while ignoring.
Joint WMO-IOC Technical Commission for Oceanography and Marine Meteorology Contributions to WIGOS David Meldrum, vice chair, JCOMM OPA.
Tools in Support of a National DMAC Derrick Snowden NERACOOS/ODP Annual Meeting 26 Sep 2012.
SAMOS-GOSUD Meeting. Boulder 2-4 May Potential collaboration between the Coriolis project and the Samos initiative L. Petit de la Villéon. Ifremer-France-
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger Life Cycle Architecture Review.
1 Launching a European Platform  Towards a common European policy  Jan Busschbach.
1 Global Systems Division (GSD) Earth System Research Laboratory (ESRL) NextGen Weather Data Cube Chris MacDermaid October, 2010.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Integrated Model Data Management S.Hankin ESMF July ‘04 Integrated data management in the ESMF (ESME) Steve Hankin (NOAA/PMEL & IOOS/DMAC) ESMF Team meeting.
Data discovery and data processing for environmental research infrastructures Roberto Cossu ENVRI WP4 leader ESA.
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative External Observatory Integration Christopher Mueller, Matt Arrott, John Graybeal Life Cycle.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
CyberInfrastructure workshop CSG May Ann Arbor, Michigan.
Curriculum Report Card Implementation Presentations
EPA’s Role in the Global Earth Observation System of Systems (GEOSS)
AUKEGGS Architecturally Significant Issues (that we need to solve)
1 김 수 동 Dept. of Computer Science Soongsil University Tel Fax
Russ Hobby Program Manager Internet2 Cyberinfrastructure Architect UC Davis.
Observing System Monitoring Center (OSMC) Status Update April 2005 Steve Hankin – PMEL (co-PI) Kevin Kern – NDBC (co-PI)
OOI CyberInfrastructure: Data Management Architecture Specification Workshop June 30-July 1, 2008 Matthew Arrott, Ingolf Krueger, Claudiu Farcas, Emilia.
Policy Based Data Management Data-Intensive Computing Distributed Collections Grid-Enabled Storage iRODS Reagan W. Moore 1.
Center for Satellite Applications and Research (STAR) Review 09 – 11 March 2010 In Situ SST for Satellite Cal/Val and Quality Control Alexander Ignatov.
Cloud Networked Robotics Speaker: Kai-Wei Ping Advisor: Prof Dr. Ho-Ting Wu 2013/04/08 1.
Bob Keeley Marine Environmental Data Service Dept. of Fisheries and Oceans Ottawa, Canada Jun, 2006 SeaDataNet Meeting.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Kevin O’Brien University of Washington/JISAO NOAA/PMEL The Observing System Monitoring Center Steve Hankin, PMEL Ted Habermann, NGDC David Neufeld, NGDC.
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
JCOMM Services Program Area Working together beyond GODAE for Operational Oceanography Dr. Craig Donlon JCOMM Service Programme Area Coordinator The Met.
The Unified Access Framework for Gridded Data … the 1 st year focus of NOAA’s Global Earth Observation Integrated Data Environment (GEO-IDE) Steve Hankin,
Observing System Monitoring Center (OSMC) Work in progress in brief June 2005 Steve Hankin, Kevin O’Brien – PMEL.
Integrating ocean climate observations Perspectives for consideration by the COSC Steve Hankin, NOAA/PMEL.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
Implementing Marine XML for NOAA Observing Data Nazila Merati and Eugene Burger NOAA/Pacific Marine Environmental Laboratory Seattle, WA.
Google Earth INTEGRATING GLOBAL THINKING. Why Use Virtual Tours? Flexible Tool: History, Science, Math, English, etc. An Interactive Way to Explore Supports.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
1 SIMDAT Simdat Project –GTD. Meteo Activity – SIMDAT Meteo Activity OGF June 2008 Barcelona Marta Gutierrez, Baudouin Raoult, Cristina.
1 The Argo project 21st Century in-situ Ocean Observing System M. Belbeoch, Argo Technical Coordinator with inputs from D. Roemmich, Argo Steering Team.
The FI-WARE Project – Base Platform for Future Service Infrastructures FI-WARE OCTOBER 2011 Presentation at proposers day.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
1. 2 NOAA’s Mission To describe and predict changes in the Earth’s environment. To conserve and manage the Nation’s coastal and marine resources to ensure.
Using Twitter to Share What Keeps You Well: Virtual Asset Mapping Nancy Greig and Lesley Roome, Health and Social Care Alliance Scotland ( the ALLIANCE)
Education Portal Solutions for Higher Education Education portals create a common gateway to the data and services that the people throughout your university.
Data Browsing/Mining/Metadata
Integrating Data and Information Across Observing System
C2CAMP (A Working Title)
Candyce Clark JCOMM Observations Programme Area Coordinator
An ecosystem of contributions
Prepared by: Jennifer Saleem Arrigo, Program Manager
GODAE Quality Control Pilot Project
Presentation transcript:

DMAC Data Integration What is it really? Why does it seem frozen in place? How do we get it moving? Steve Hankin (NOAA/PMEL) DMAC = Data Management and Communications subsystem of the US Integrated Ocean Observing System (IIOS) []

June '07 OCO Annual Review 2 Part 1. A Short Digression (begging your indulgence …) What’s new in the Observing System Monitoring Center (OSMC)

June '07 OCO Annual Review 3

June '07 OCO Annual Review 4

June '07 OCO Annual Review 5 under the hood … Metadata feeds from NOAAPort & GODAE GODAE QC fields to be added next … A feed from NCEP ? Goal: –Compare QC strategies. –Compare GTS filters and feeds.

June '07 OCO Annual Review 6 Part 2. DMAC Data Integration (DMAC = Data Management and Communications subsystem of IOOS) Just what is DMAC “data integration” ? (and what is it not ?) Start with a taxonomy thru examples … What is it really? Why does it seem frozen in place? How do we get it moving?

June '07 OCO Annual Review 7 What is “DMAC integration”? Case study 1: Numerical Weather Prediction Consider FNMOC: Pull observations from GTS Convert disparate formats to single format Apply global QC  An “integrated” data product for assimilation.

June '07 OCO Annual Review 8 Surely this is “integration” … but it is only available to one project –Integration for a narrowly focused purpose Call this “project integration” Note: GODAE Server distributes the FNMOC product … a step in the right direction … a step in the right direction

June '07 OCO Annual Review 9 What is “DMAC integration”? Case study 2: Web Theme Pages Pull together images, documents & links Put care into presentation. “Friendly”

June '07 OCO Annual Review 10 Surely this is “integration” NO! This is a useful service. But it is not data integration.

June '07 OCO Annual Review 11 What is “DMAC integration”? Case study 3: Argo program Internationally planned A single agreed upon format Data openly shared QC and metadata carefully managed Distribution infrastructure (DACs & GDACs)

June '07 OCO Annual Review 12 Surely this is “integration” … but it is only applicable to one platform –Unique formats & distribution infrastructure Call this “platform integration” “Project integration” and “platform integration” are not the concept we are after in DMAC.

June '07 OCO Annual Review 13 An analogy: the electric power grid Energy goes in. Energy comes out. Providers do not target specific consumers. They just adhere to standards (60Hz). Consumers are not aware of specific providers. Analogy appears simplistic until you refine your concept of data. Data must always be tightly bound to its metadata.  DMAC integration is a “data grid” The concept of “integration” in DMAC Analogy is simplistic?

June '07 OCO Annual Review 14 The DMAC Plan (2004) is built around a “data grid” concept (a.k.a. “data commons”) Uniform services (standards) –to interconnect existing systems “Do no Harm” Existing standards are inadequate  An implementation plan, not a specification 240 pages How far have we progressed?

June '07 OCO Annual Review 15 Honest answer: barely at all. Why? 1.Formulation choices in the DMAC Plan 2.Political chaos 3.Community social structure How do we overcome each of these obstacles? How far has DMAC progressed since 2004?

June '07 OCO Annual Review 16 DMAC Plan has detailed milestones But they are not sufficiently tangible – e.g. “publish a community standard for [xxx]”. Solution: Reformulate the Plan as a sequence of tasks that each provide tangible benefits. Obstacle 1: Formulation choices in the plan

June '07 OCO Annual Review 17 Dumb, bad luck timing (post 9/11) & Interagency coordination failures lead to Negligible direct funding (just enough for “volunteer” meetings) (Note: millions have been made available that generated additional demand for DMAC guidance) (Note: millions have been made available that generated additional demand for DMAC guidance) Solution: Better marketing. Map out a Plan that can be marketed to Gov’t managers Obstacle 2: Political chaos

June '07 OCO Annual Review 18 Obstacle 3: Community social structure The diminutive nation of Science Data Management lies nestled among three neighbors: 1. IT Infrastructure 2. Computer Science 3. Science Research Each is larger and more powerful and imposes its viewpoint on our small nation. Science Research Computer Science IT Infrastructure Data Mgmt

June '07 OCO Annual Review 19 Obstacle 3: Community social structure 1. IT Infrastructure (CIO) viewpoint: “Solutions can be purchased if systems engineering discipline is followed.” But integration is not a system you can purchase. It is a change in how we work together. It must be built in partnership w/ data providers and users. Note: The DMAC Plan lays out a strong support role for systems engineering. (Useful reading: “The Innovator’s Dilemma”, by Clayton Christensen)

June '07 OCO Annual Review 20 Obstacle 3: Community social structure 2. Computer Science viewpoint: “The latest developing technology will solve the problems.” You can only standardize stable technology. Setting too-high requirements for technological innovation limits access to funding for IT projects that could yield great practical benefits to science. (The root of the “cyberinfrastructure” problem.) (The root of the “cyberinfrastructure” problem.)

June '07 OCO Annual Review 21 Obstacle 3: Community social structure 3. Science/Research viewpoint: “Reduce complexity by limiting the number of variables to be considered initially.” But data management challenges are largely independent of data content. Analogy: would it reduce complexity in designing an ocean glider if it only had to measure temperature? Data management simplifies by reducing the number of data structures (a.k.a. “data models”).

June '07 OCO Annual Review 22 Recap: Reformulate the DMAC Plan  tasks w/ tangible benefits … so we can  tasks w/ tangible benefits … so we can Market the Plan to Gov’t managers Independence of action from neighbors: 1.Partner with the IOOS community 2.Use available technology (wisely) 3.Reduce the initial problems by addressing data structures one by one.

June '07 OCO Annual Review 23 Proposal: Build the DMAC integration framework as a collection of Virtual Data Assembly Centers (“V-DACs”) by data structure. To be developed one-by-one: 1.Grids (models, satellites, climatologies) 2.Time series 3.Surface Tracks 4.Vertical Profiles and Sections 5.…, Scatters, Swaths, Radials, Polygons, …

June '07 OCO Annual Review 24 time series protocol Time series V-DAC Meta- data TAO BATS OceanSites U. Hawaii Sea Level Center NDBC NODC Imagine the V-DAC for time series data …

June '07 OCO Annual Review 25 time series protocol Time series V-DAC Meta- data TAOBATS OceanSites U. Hawaii Sea Level Center NDBC NODC bricks-and-mortar time series “curator” (funded) standard protocol(s) (“web services”) one access point multiple variables Imagine the V-DAC for time series data

June '07 OCO Annual Review 26 also fund a metadata development activity: –Data discovery –Controlled vocabularies –Data lineage –Geo-referencing –Instrument characterizations –Quality control

How do we build an ocean temperature V-DAC? Time series V-DAC Meta- data Profiles V-DAC Meta- data Grids V-DAC Meta- data Temperature V-DAC Meta- data A single place to access all ocean temperature data

June '07 OCO Annual Review 28 The virtues of this approach: Reductionism: One protocol at a time A concrete deliverable at every step Unites communities of interest (integration) But can we market the idea to management? (Who has the ability to carry the message to management?) The science community has a strong voice. (Much stronger than DM.)

June '07 OCO Annual Review 29 Discussion (Thank you)