Data Management in the U.S. GLOBEC Program SCOR/IGBP Meeting on Data Management for Marine Research Projects Robert C. Groman Woods Hole Oceanographic.

Slides:



Advertisements
Similar presentations
This file includes speaker notes that are in the Notes module of PPT (go to View--->Notes Page)
Advertisements

DIGIDOC A web based tool to Manage Documents. System Overview DigiDoc is a web-based customizable, integrated solution for Business Process Management.
The North American Carbon Program Google Earth Collection Peter C. Griffith, NACP Coordinator; Lisa E. Wilcox; Amy L. Morrell, NACP Web Group Organization:
National facility concerned with looking after and distributing marine data Part of NOC National Marine Facilities Serve science, education and industry,
Visualizing Fitness for Purpose Bob Groman and Dicky Allison Biological and Chemical Oceanography Data Management Office Woods Hole Oceanographic Institution.
Microsoft Excel 2003 Illustrated Complete Excel Files and Incorporating Web Information Sharing.
Environmental Variability, Bowhead Whale Distributions and Iñupiat Subsistence Whaling Carin Ashjian (Woods Hole Oceanographic Institution)
Dale Haidvogel Institute of Marine and Coastal Sciences Putting the “Globe” into U.S. GLOBEC New Models and Methods in Support of Integrated Climate Research.
1 NODC, Russia GISC & DCPC developers meeting Langen, 29 – 31 March E2EDM technology implementation for WIS GISC development S. Sukhonosov, S. Belov.
Exploring large marine datasets using an interactive website and Google Earth Jon Blower, Dan Bretherton, Keith Haines, Chunlei Liu, Adit Santokhee Reading.
OGC Technical Committee Meeting National Resources and Environment Working Group 1 The JGOFS/GLOBEC Data Management System for Serving Physical and Biological.
Data Management in the US GLOBEC Program The 2003 National NVODS Workshop Robert C. Groman Woods Hole Oceanographic Institution September 10 – 12, 2003.
SiS Technical Training Development Track Technical Training(s) Day 1 – Day 2.
Biological and Chemical Oceanography Data Management Office 1 of 12 An Introduction to the Biological and Chemical Oceanography Data Management Office.
Comprehensive Large Array-data Stewardship System (CLASS) Web Site Tutorial Visit CLASS Site at
Web Programming Language Dr. Ken Cosh Week 1 (Introduction)
TPAC Digital Library Talk Overview Presenter:Glenn Hyland Tasmanian Partnership for Advanced Computing & Australian Antarctic Division Outline: TPAC Overview.
Overview of the ODP Data Provider Sergey Sukhonosov National Oceanographic Data Centre, Russia Expert training on the Ocean Data Portal technology, Buenos.
1 Web Database Processing. Web Database Applications Static Report Publishing a report is prepared from a database application and exported to HTML DB.
Tools for accessing distributed in-situ data collections Donald W. Denbo, NOAA/PMEL-JISAO Jason E. Fabritz, NOAA/PMEL-JISAO Bernard J. Kilonsky, Sea Level.
II Course on GBIF Node Management Arusha, Tanzania 31 st October and 1 st November 2008 Tim ROBERTSON Systems Architect GBIF Secretariat Data Publishing.
MEDIN Data Guidelines. Data Guidelines Documents with tables and Excel versions of tables which are organised on a thematic basis which consider the actual.
Still serving data with an old DODS server from the early 90's Jim Manning NOAA's Northeast Fisheries Science Center NERACOOS/NECOSP Data Management Workshop,
Data Management Practices: BCO-DMO’s Successes and Challenges Bob Groman BCO-DMO Woods Hole Oceanographic Institution NERACOOS/NeCODP Data Management Workshop.
Tutorial 10 Adding Spry Elements and Database Functionality Dreamweaver CS3 Tutorial 101.
Controlled Vocabularies (Term Lists). Controlled Vocabs Literally - A list of terms to choose from Aim is to promote the use of common vocabularies so.
AERONET Web Data Access and Relational Database David Giles Science Systems and Applications, Inc. NASA Goddard Space Flight Center.
ISpheres Project. Project Overview iSpheresCore iSpheresImage Demonstration References.
NEPTUNE Canada Workshop Oceans 2.0 Project Environment NEPTUNE Canada DMAS Team Victoria, BC February 16, 2009.
HTML, XHTML, and CSS Sixth Edition Chapter 1 Introduction to HTML, XHTML, and CSS.
Peter H. Wiebe and Nancy Copley Woods Hole Oceanographic Institution How does CMarZ Work? CMarZ Information System / Database /OBIS/ Species Pages.
Online Data Flanders Marine Data & Information Centre InnovOcean site SeadataNet Annual Meeting, Madrid 2009.
Planning for Arctic GIS and Geographic Information Infrastructure Sponsored by the Arctic Research Support and Logistics Program 30 October 2003 Seattle,
U.S. Department of the Interior U.S. Geological Survey Management of Oceanographic time-series data at the Woods Hole Coastal and Marine Science Center.
U.S. GLOBEC Pan-Regional Synthesis Workshop 1 Presentation to the U.S. GLOBEC Pan-Regional Workshop 29 November 2006 Bob Groman Data Access and Associated.
GCE Data Toolbox -- metadata-based tools for automated data processing and analysis Wade Sheldon University of Georgia GCE-LTER.
NcBrowse A Graphical netCDF/OPeNDAP Browser Donald Denbo 1 & John Osborne 2 1 UW/JISAO-NOAA/PMEL, 2 OceanAtlas Software
Integrated Model Data Management S.Hankin ESMF July ‘04 Integrated data management in the ESMF (ESME) Steve Hankin (NOAA/PMEL & IOOS/DMAC) ESMF Team meeting.
INTEGRATED OCEAN DRILLING PROGRAM MANAGEMENT INTERNATIONAL International Data Exchange Workshop – Kiel, Germany – May 9-11, 2007 SEDIS Scientific Earth.
1 Welcome to CSC 301 Web Programming Charles Frank.
NOVA Networked Object-based EnVironment for Analysis P. Nevski, A. Vaniachine, T. Wenaus NOVA is a project to develop distributed object oriented physics.
A/WWW Enterprises 28 Sept 1995 AstroBrowse: Survey of Current Technology A. Warnock A/WWW Enterprises
The Metadata Tool Custom Metadata Tool Who this tool is for: This tool designed to be used a data management system. This tool is geared more for the.
IODE Ocean Data Portal - ODP  The objective of the IODE Ocean Data Portal (ODP) is to facilitate and promote the exchange and dissemination of marine.
Managed by UT-Battelle for the Department of Energy Mercury – Distributed Metadata Tool for Finding and Retrieving CDIAC Data CDIAC UWG Meeting September.
Database Concepts Track 3: Managing Information using Database.
GBIF Data Access and Database Interoperability 2003 Work Programme Overview Donald Hobern, GBIF Programme Officer for Data Access and Database Interoperability.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
November 16, 2009 Page 1 of 28 Data and Data Management: Introduction to the BCO-DMO Presented to Professor Keiichi Uchida November 16, 2009 Robert C.
U.S. GLOBEC Georges Bank 2007 Phase 4B SI Meeting April 23, 2007 GoMODP, Data Interoperability and the MapServer Interface to U.S. GLOBEC Data Presented.
The US Long Term Ecological Research (LTER) Network: Site and Network Level Information Management Kristin Vanderbilt Department of Biology University.
D. Heynderickx DH Consultancy, Leuven, Belgium 22 April 2010EuroPlanet, London, UK.
NOVA A Networked Object-Based EnVironment for Analysis “Framework Components for Distributed Computing” Pavel Nevski, Sasha Vanyashin, Torre Wenaus US.
TSS Database Inventory. CIRA has… Received and imported the 2002 and 2018 modeling data Decided to initially store only IMPROVE site-specific data Decided.
Hellenic Centre for Marine Research (HCMR) MedOBIS - Ocean Biogeographic Information System for the Eastern Mediterranean and Black Sea.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
Don’t Duck Metadata March 2005 Introducing Setting Up a Clearinghouse Node Topic: Introduction to Setting Up a Clearinghouse Node Objective: By.
Chapter 1 Introduction to HTML, XHTML, and CSS HTML5 & CSS 7 th Edition.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
Distributed Data Servers and Web Interface in the Climate Data Portal Willa H. Zhu Joint Institute for the Study of Ocean and Atmosphere University of.
US GLOBEC Georges Bank Phase 4B Scientific Investigators’ Meeting 1 Presentation to the US GLOBEC Georges Bank Phase 4B Scientific Investigators October.
1 Data Management Office and NEP Presented at the U.S. GLOBEC Northeast Pacific Scientific Investigator Meeting Robert C. Groman January 2006.
CONFIDENTIAL Overview NTP Software Object Store and Cloud Connector™ (OSCC™) has a carefully structured architecture that includes a number of collaborative.
Biological and Chemical Oceanography Data Management Office slide 1 of 10 U.S. GEOTRACES Data Management Cyndy Chandler BCO-DMO ~ WHOI 23 September 2008.
Data Browsing/Mining/Metadata
Flanders Marine Institute (VLIZ)
WHAT DOES THE FUTURE HOLD? Ann Ellis Dec. 18, 2000
Data and Data Management: Introduction to the BCO-DMO
Overview EMODnet Biology Portal Standards used Web services available
ORNL is Operated by UT-Battelle for DOE
Presentation transcript:

Data Management in the U.S. GLOBEC Program SCOR/IGBP Meeting on Data Management for Marine Research Projects Robert C. Groman Woods Hole Oceanographic Institution 8 – 10 December 2003 Click here for PowerPoint versionClick here for PowerPoint version.

U.S. GLOBECU.S. GLOBEC: Goal To understand the population dynamics. Ultimately want to be able to predict changes in distribution and abundance of key species as a result of changes in the physical and biotic environment, such as from climate change.

Three U.S. Programs Georges Bank – field program started in 1995 with some cruises earlier; field program ended in 1999.Georges Bank Northeast Pacific – field program started in Gulf of Alaska too.Northeast Pacific Southern Ocean – field program started in 2001 and ended in 2003.Southern Ocean

Georges Bank Study Area

Northeast Pacific Study Area

Southern Ocean Study Area

Project Components Field program: Georges Bank project completed 120 cruises with 360 days at sea. Laboratory experiments Retrospective studies Analysis and synthesis

Georges Bank Data 120 Cruises in inventory 118 Cruise reports printed 56 Cruise reports on-line Data objects on-line: See web siteSee web site Missing data: Zooplankton counts VPR data Acoustics data Mooring data

Northeast Pacific/CGOA Coastal Gulf of Alaska (CGOA) 35 Cruise reports on-line; 16 will be on-line soon; 36 to come later 35 Event logs on-line 11 from LTOP cruises [29 missing] 9 from Haldorson Trawling cruises [8 missing] 10 from Process/Survey cruises [5 missing] 5 from SECM cruises (via on-line report) [10 missing]

Northeast Pacific/CCS California Current System (CCS) 46 Cruise reports on-line; 2 more in the works 47 Event logs on-line 36 from CCS - LTOP cruises (via on-line report) 11 from CCS – Process/Other cruises

Northeast Pacific Summary Available on-line data include CTD, SST, alongtrack, bottle, SeaSoar, nutrients, pigments, and event logs.on-line data Data soon to be added include CTD, nutrients, and zooplankton.

Southern Ocean 11 Cruises in inventory 11 Cruise reports on-line 11 Event logs on-line 110 data objects on-line, locally or linked remotely, including:data objects Ice core and water column bacteria studies Bird studies BIOMAPERII Chlorophyll, irradiance and productivity studies MOCNESS CTD data Nutrient data Sea ice data Whale sonobuoy data 120 kHz acoustic backscattering data ADCP data Alongtrack data Seal tracking and biology Bathymetry

Southern Ocean Data in the Works IWC Whale data CTD rosette data MOC1/MOC10/net collection data XCTD/XBT data Penguin studies (exists on SO GLOBEC website) ROV data Mooring data

Data Policy Dissemination of data to scientific investigators and others on a timely basis Make available when useful (not necessarily only when finalized) Serve data and information, such as reports, papers, and other program documentation

Data Characteristics and Distribution Approach Data from many, distributed, researchers (greater than 100 contributors) Open access – read only by everyone Restricted access supported, but rarely used Quality control is contributors’ responsibility and on-going Emphasis on access to data and information as early as possible Data sets most useful when used with other data

Data Acknowledgement Policy Any person making substantial use of a data set must communicate with the investigator(s) who acquired the data prior to publication and anticipate that the data collector(s) will be co-author(s) of published results. This extends to model results and to data organized for retrospective studies. See on-line policy statement

Data are accessed from the U.S. GLOBEC Data Server,

Data can be viewed

Or plotted

Track plot of NBP0202

Or downloaded

Data Sources Broad-scale cruises Process cruises Moorings Drifters Satellites Modeling

Instruments CTDs Rosette MOCNESS (3 flavors) Bongo tows Acoustic biomass measurements Video Plankton Recorder Drifters, MET packages,...

SensorsSensors and Computed Parameters Conductivity, temperature, pressure, fluorescence, transmittance, acoustics, light (PAR), video, wind speed/direction, AVHRR,... Biomass, taxonomic composition/size distribution, species (counts, size, stage, status, rates, behavior), density, currents, stratification, heat flux, nutrients, turbulence, chlorophyll,...

Data Access Using the JGOFS Data Management System developed by G. Flierl, J. Bishop, D. Glover, and S. Paranjpe Distributed access via standard web browsersDistributed access via standard web browsers

Web Access Hierarchical list of data objectslistobjects On-line list of dataOn-line list Downloads as ASCII, Matlab files, or reorganized into single or multiple filesDownloads Simple X-Y plotsX-Y plots Created EasyKrig (kriging) and 3-D visualization applicationsEasyKrig3-D visualization

Distributed Data Ten distributed data servers use the US JGOFS software Uses the Web httpd protocol - integrates very well with standard web pages Handles tabular data in ASCII, Matlab format, and user-supplied formats using methods. It is object oriented and data driven.object oriented

Nostalgia In the Olden Days …. Reformatting and processing data was a common activity Merging navigation with measured and computed results also took time First data management system used 9 track tapes for data storage, run in batch Second system used data on disk with techniques to located data within degree squares to improve performance

Meta-data Data about data.data Document information about data elements or attributes (name, size, data type, etc), about records or data structures (length, fields, columns, etc), and about data (where it is located, ownership, etc.). Meta-data may include descriptive information about the context, quality and condition, or characteristics of the data.data elements attributes recordsdata structures

Detailed Meta-data Pros – required for full understanding of data within a database management system. Required if others want to use the data Cons – pain in the neck to prepare, maintain, and enter (Best to take advantage of tools) Currently completing Global Change Master Directory’s DIF records

What’s Happening Organizations creating systems to access their own meta-data and/or data. Umbrella databases linking to other peoples meta-data and/or data. (OBIS, GMBIS, …)OBISGMBIS Linking to meta-data is more manageable than is linking to other people’s data.

Other Efforts LabNet – consortium of marine organizations to make their data available (uses 4D Geobrowser “index cards”)LabNet Ocean Data View - access WOCE, NGDC, and other data sets. CTD, bottle, XBT …Ocean Data View OBIS – “portal” (aggregation server) for biological data (using Darwin Core 2 – OBIS)OBIS

Other Efforts, continued ZOPE – object oriented application serverZOPE LAS – web-based, active-image based data interface for registered data. Used by US JGOFS ProgramUS JGOFS Program uBio – (Universal Biological Indexer and Organizer) a networked information service for biological information resources based on the Taxonomic Name Server (TNS), a thesaurus; an index.

Other Efforts, continued Hexacoral – biggest user in OBIS; uses DiGIR (D.G. Fautin, et al.)Hexacoral DiGIR – Distributed Generic Information Retrieval. Uses XML protocol to get the data. Extends XML to do queries. Uses php software package to execute the code. Supports 14 or 15 databases, e.g SQL based. Three options for JGOFS: export to flat file, export to MySQL, or write own perl script to interface directly to DiGIR (ZooGene -> OBIS)DiGIR

Other Efforts, continued Oregon State University, Randy Keller and Paul Johnson, mapping specialist at HMRG Steve Hankin, “An Implementation Plan for the Data and Communication Subsystem of the U.S. Integrated Ocean Observing System” Margo Edwards at HIG and Dawn Wright at OSU

Other Efforts, continued RIDGE, petrological data. Endeavor Observatory website, Lamont’s PetDBPetDB SIO Ocean Exploration data portal, University of Washington’s Endeavor GIS and Portal to Endeavor Data (PED)GISPED

Educational “Tools” Virtual Research Vessel, University of Oregon and Oregon State UniversityVirtual Research Vessel REVEL, University of WashingtonREVEL Dive and Discover, WHOIDive and Discover

Protocols OpenDAP (was DODS)  http DiGIR  uses XML; but too verbose for physical data. OBIS may use OpenDAP for physical data. JGOFS  http

Other Projects and Protocols Apologies for references I’ve missed. There are many other efforts underway in all these areas.

In the Trenches What temperature: Sea surface, air, at depth? Units? How collected? How calibrated? Data quality control still labor intensive even though we can collect and store gigabytes of data daily

Future Data Management and Display Efforts Enhance data search capabilities Add additional graphical display (visualization) options Improve interface between data system and visualization/analysis tool Consider other protocols, such as OpenDAP

End