Download presentation
Presentation is loading. Please wait.
Published byMatilda Greer Modified over 9 years ago
1
Data Resources US Perspective Kerstin Lehnert Suzanne Carbotte Lamont-Doherty Earth Observatory of Columbia University
2
Scientific Data in the Digital Age “It is exceedingly rare that fundamentally new approaches to research and education arise. Information technology has ushered in such a fundamental change. Digital data collections are at the heart of this change.” US National Science Board, Report to the US National Science Foundation,, 2005
3
Access to Data “Effective access to research data, in a responsible and efficient manner, is required to take advantage of the new opportunities and benefits offered by new information and communication technologies.” Organization for Economic Co-operation & Development: “Principles and Guidelines for Access to Research Data from Public Funding” May 2007
4
Open Access to Data: Benefits Democratize access to research resources Ensure broad dissemination of results Facilitate new cross-disciplinary approaches - access for non- specialist users Enable verification of research results Provide new research opportunities Provide access to data from variety of sources and enable integration across fields Provide foundation for use of automated tools Facilitate more efficient use of resources Data are often expensive to collect (especially marine!) often/usually unique, repeat collection/analysis rare
5
Data Synthesis ‘the Old Way’ Months to Years
6
Data Synthesis Today 2 Minutes
7
Data Visualization: 2 Minutes GeoMapApp software: www.geomapapp.org
8
Sharing Research Data: USA “GAO recommends the agencies explore opportunities in the grants process to better ensure the availability of data to other researchers and determine if additional archiving strategies are warranted.” GAO Report #07-1172 September 28, 2007
9
Existing US Data Resources relevant for MARGINS Science Marine Geoscience Data System: hosts the MARGINS Data Portal Geoinformatics for Geochemistry: hosts PetDB, SedDB, SESAR, EarthChem (links to GEOROC & NAVDAT) NGDC: Marine geoscience data - mostly legacy programs IRIS: Seismic network data and earthquake catalogs UNAVCO: GPS data GEON: Lidar data SIO-GDC: hosts marine geoscience data from Scripps expeditions WHOI: hosts data from vehicles of the NDSF
10
www.marine-geo.org www.geoinfogeochem.org
11
PetDB SESAR Sample Registry EarthChem SedDB Antarctic Multibeam Seismic Reflection Field Data Center MARGINS Ridge2K Legacy GfG & MGDS Collaborations & Partnerships Boston Univ Oregon State Boise State University of Kansas WHOI Scripps Texas A&M UTIG NGDC University of NH Data & IT GEON UNAVCO USGS IODP ICDP Pangaea CoreWall PaleoStrat MetPetDB LEPR Data & IT GEON UNAVCO USGS IODP ICDP Pangaea CoreWall PaleoStrat MetPetDB LEPR Science
12
DevelopmentOperation - Data modeling - Metadata standards - QC & ingestion procedures - Data submission tools & procedures - Solicitation & Compilation - Ingestion - Quality Control - Documentation - Curation - User support - Archiving - Web applications - Query tools - Download options - Web services, XML - Visualization & data analysis tools - System operation - Maintenance - User support - Education modules - Presentations - Publications - Exhibits & demos - Workshops & short courses - Web sites (News etc.) DataServicesAccess Education & Outreach
13
Scope of the MGDS Metadata catalog: Central cruise catalog and data repository for all MARGINS programs- important goal is to preserve full data collection context for each expedition Sensor Database: data documentation and access for multibeam and geophysical data from Palmer & Gould and MCS reflection data from Ewing & Langseth Global DEM: Synthesis of multibeam bathymetry into the Global Multi Resolution Topography - GMRT MG&G Legacy data and derived data Tools for data access: lower barrier to data access with tools tailored to science needs October 23-24, 2007
14
MARGINS Database Provides access to expedition information & data for all MARGINS funded marine and some terrestrial programs Diverse data collected during these programs hosted within MARGINS database: swath bathymetry gravity and magnetics MCS reflection water column data (BLISP, CTD) side-scan sonar mapping data rock and fluid sampling information Database includes links to WHOI (near bottom camera), UTIG (processed MCS), IRIS (seismometer), UNAVCO (GPS)
15
MGDS Data Holdings
16
MGDS Access Interfaces Data Link (server side) GeoMapApp (client side) Web services Access data hosted at distributed data repositories
17
Access to data at distributed data repositories Alvin and Jason2 near bottom photos
18
With bathymetry tiles exposed through a programmatic interface - can make use of GoogleEarth
19
GfG Program: Scope PetDB, SedDB, EarthChem data sets Build and provide access to integrated compilations of large volumes of geochemical data desktop access to the entire published geochemical literature within minutes EarthChem Portal: Central access point to the broadest range of geochemical data in federated databases SESAR Sample Registry: Provide global unique identifiers for samples; build global sample catalog
20
Database Features Archive & serve integrated data sets of geochemical data (each individual value searchable) Include complete metadata of samples and analytical procedures for searching and data evaluation Offer interactive, dynamic user interfaces that allow extraction of any customized subset of the data Support data analysis Tools for data quality assessment & control. Tools for visualization (map interfaces, plotting tools). Integration with broader Geoscience data via interoperability & partnerships.
21
EarthChem Data
23
EarthChem Portal
24
Access via GeoMapApp
25
Ambiguous Sample Naming Examples from the PetDB Database Sample names are duplicated. Sample names are modified or changed.
26
International Geo Sample Number IGSN SESAR serves as registry that provides & manages unique identifiers for samples IGSN - International Geo Sample Number Obtained upon submission of sample metadata (registration) Implementation in sample collection & curation ongoing (IODP, core repositories) Ca. 4 Mio. samples registered System still under development
27
Challenges for Open Data Access
28
Improving Global Data Access Agreed on statements of principle and recommendations to address technical, procedural, and organizational issues of open global data sharing. “Building a Global Data Network for Studies of Earth Processes at the World’s Plate Boundaries” International Workshop, Kiel (Germany), May 2007. Attended by 71 people from 14 countries. Sponsored by the MARGINS, Ridge2000, InterMARGINS, InterRIDGE programs.
29
Workshop Recommendations Science User Needs Access to all data needed to reproduce scientific results Access to multidisciplinary & integrated marine & terrestrial data Data Documentation & Publication Uniform best practices & standards for data acquisition, data submission to data centers & data publication Easy procedures for metadata creation & data submission Data & Metadata Interoperability Minimize proliferation of metadata standards Development of a data discovery service across distributed data resources Opportunities & Obstacles for International Data Sharing Leverage international bodies & programs (e.g. GEOSS, eGY, ICSU, IPY) Establish dedicated task group & special interest groups to advance implementation of a global data network
31
Cyberinfrastructure Geoinformatics = Cyberinfrastructure for the Geosciences Goal : A genuine infrastructure of highly reliable, widely accessible capabilities and services to support the entire range of scientific work.
32
Infrastructure Components Technological Infrastructure Institutional & Management Models Legal & Policy Framework Financial Support Cultural & Behavioral Changes
33
MARGINS TAMU* LEGACY NGDC/UNH Ridge2000 WHOI* AntarcticMBS Seismic Reflection DMS UTIG (Lead) MGDS
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.