Standardization Promotes Biogeochemical Data Management and Use in Multidisciplinary Environmental Research Yaxing Wei, Suresh Vannan, Robert B. Cook,

Slides:



Advertisements
Similar presentations
Geographic Digital Content Components André Santanchè Advisor: Dr. Claudia Bauzer Medeiros Database Group Unicamp - Brazil.
Advertisements

V Alyssa Rosemartin 1, Lee Marsh 1, Ellen Denny 1, Bruce Wilson USA National Phenology Network, Tucson, AZ; 2 - Oak Ridge National Laboratory, Oak.
DataModel When data and model are in isolation We are getting …
ORNL DAAC Experience With Digital Object Identifiers (DOIs) Bruce Wilson, ORNL DAAC Manager for NASA Data Center Managers telecon 22 Feb 2010.
1 ORNL DAAC: Data and Services Robert Cook and Suresh SanthanaVannan Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN Presentation.
The International Surface Pressure Databank (ISPD) and Twentieth Century Reanalysis at NCAR Thomas Cram - NCAR, Boulder, CO Gilbert Compo & Chesley McColl.
Spatial Data Access Tool: Yaxing Wei, Suresh-Kumar Santhana-Vannan, Robert B. Cook, Bruce E. Wilson, and Tammy W. Beaty Oak Ridge National.
Fundamental Practices for Preparing Data Sets Bob Cook Environmental Sciences Division Oak Ridge National Laboratory 5 th NACP Principal Investigator’s.
Elements of a Data Management Plan Alison Boyer Environmental Sciences Division Oak Ridge National Laboratory.
Elements of a Data Management Plan
SAFARI 2000 Data Activities at the ORNL DAAC Bob Cook, Les Hook, Stan Attenberger, Dick Olson, and Tim Rhyne Oak Ridge National Laboratory.
Coordinated Energy and water-cycle Observations Peroject A Well Organized Data Archive System Data Integrating/Archiving Center at University of Tokyo.
Fundamental Practices for Preparing Data Sets Robert Cook ORNL Distributed Active Archive Center Environmental Sciences Division Oak Ridge National Laboratory.
U.S. Department of the Interior U.S. Geological Survey Best Practices for Preparing Science Data to Share.
MODIS Subsetting and Visualization Tool: Bringing time-series satellite-based land data to the field scientist National Aeronautics and Space Administration.
Preserving the Scientific Record: Establishing Relationships with Archives Matthew Mayernik National Center for Atmospheric Research Version 1.0 Review.
Soil characteristics, an important terrestrial ecosystem modeling input, affects the photosynthesis, respiration, evapotranspiration, or other biosphere.
Updates from EOSDIS -- as they relate to LANCE Kevin Murphy LANCE UWG, 23rd September
Citing Data Sets in the Literature: ORNL DAAC Practices Robert Cook, Suresh SanthanaVannan, and Daine Wright Environmental Sciences Division Oak Ridge.
An Introduction to Metadata Tammy Walker Beaty Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN Data Management.
ATMOSPHERIC SCIENCE DATA CENTER ‘Best’ Practices for Aggregating Subset Results from Archived Datasets Walter E. Baskin 1, Jennifer Perez 2 (1) Science.
Getting Ready for the Future Woody Turner Earth Science Division NASA Headquarters May 7, 2014 Biodiversity and Ecological Forecasting Team Meeting Sheraton.
ORNL DAAC Spatial Data Access Tool (SDAT): Internet tools to access and visualize land-based data National Aeronautics and Space Administration
CC&E Best Data Management Practices, April 19, 2015 Please take the Workshop Survey 1.
1 ORNL DAAC Data Products and Tools Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN NSIDC User Working Group Meeting.
Enhancing Linkages Between Projects and Datasets: Examples from LBA-ECO for NACP Lisa Wilcox, Amy L. Morrell,
Data Citation and Data Attribution A View from the Data Center Perspective Bruce E. Wilson Group Lead, Client & Collaboration Technologies Oak Ridge National.
Data Management Practices for Early Career Scientists: Closing Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN.
U.S. Department of the Interior U.S. Geological Survey Access to MODIS Land Data Products Through the Land Processes DAAC John Dwyer and Carolyn Gacke,
1 Global Systems Division (GSD) Earth System Research Laboratory (ESRL) NextGen Weather Data Cube Chris MacDermaid October, 2010.
Global map layers Additional global data sets such as Hydrology data (Hydrosheds), new and updated Landcover data (Globcover), demographic data and others.
Center Pixel Value Mean Value of All Pixels Percent of Pixels that meet QC Criteria Web and Web Services based tool that provides subsets and visualization.
MODIS Land Product Subsets Suresh K. Santhana Vannan, Robert B. Cook, Bruce E. Wilson, Lisa M. Olsen HDF and HDF-EOS Workshop XII October 15 – October.
1 ORNL DAAC WebGIS Demonstration Suresh Santhana Vannan, Robert Cook, Tammy W. Beaty and Yaxing Wei ORNL DAAC Oak Ridge National Laboratory Distributed.
WK 13 - How to Prepare Ecological Data Sets for Effective Analysis and Sharing 2:00 PM-5:00 PM August 1 st, 2010.
ESIP Federation 2004 : L.B.Pham S. Berrick, L. Pham, G. Leptoukh, Z. Liu, H. Rui, S. Shen, W. Teng, T. Zhu NASA Goddard Earth Sciences (GES) Data & Information.
North American Carbon Program Sub-pixel Analysis of a 1-km Resolution Land-Water Mask Source of Data: The North American sub-pixel water mask product is.
Using the Global Change Master Directory (GCMD) to Promote and Discover ESIP Data, Services, and Climate Visualizations Presented by GCMD Staff January.
Managing Your Data: Assign Descriptive File Names Robert Cook Oak Ridge National Laboratory Section: Local Data Management Version 1.0 October 2012.
DataONE: Preserving Data and Enabling Data-Intensive Biological and Environmental Research Bob Cook Environmental Sciences Division Oak Ridge National.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
NACP A High-Resolution Daily Surface Weather Database for NACP Investigations Peter E. Thornton 1, Robert B. Cook 2, W. Mac Post 2, Bruce E. Wilson 2,
Cyberinfrastructure to promote Model - Data Integration Robert Cook, Yaxing Wei, and Suresh S. Vannan Oak Ridge National Laboratory Presented at the Model-Data.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
November 16, 2009 Page 1 of 28 Data and Data Management: Introduction to the BCO-DMO Presented to Professor Keiichi Uchida November 16, 2009 Robert C.
ORNL DAAC: Introduction Bob Cook ORNL DAAC Environmental Sciences Division Oak Ridge National Laboratory.
1 U.S. Department of the Interior U.S. Geological Survey LP DAAC Dave Meyer, LP DAAC Project Scientist Stacie Doman Bennett, LP DAAC Scientist.
Vegetation Index Visualization of individual composite period. The tool provides a color coded grid display of the subset region. The tool provides time.
Terra MODIS Collection 4 / 4.5 and Aqua MODIS Collection 4; Sinusoidal Projection Data from 2000 to present; 8-day, 16-day, or annual composites Sites.
Satellite & Model Product Evaluation Center (SPEC): A Software System Providing Ready Access To Co-located Data Subsets From Satellite, In-situ, and Model.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
Ecosystem carbon storage capacity as affected by disturbance regimes: a general theoretical model Introduction Disturbances can profoundly affect ecosystem.
Data Systems Integration Committee of the Earth Science Data System Working Group (ESDSWG) on Data Quality Robert R. Downs 1 Yaxing Wei 2, and David F.
Global Change Master Directory (GCMD) Mission “To assist the scientific community in the discovery of Earth science data, related services, and ancillary.
ORNL DAAC SPATIAL DATA ACCESS TOOL Open Geospatial Consortium (OGC) Services Bruce E. Wilson Suresh K. Santhana Vannan Yaxing Wei Tammy W. Beaty National.
Managing Your Data: Assign Descriptive File Names Robert Cook Oak Ridge National Laboratory Version 1.0 Review Date.
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
Data Management Practices for Early Career Scientists: Closing Robert Cook Environmental Sciences Division Oak Ridge National Laboratory Oak Ridge, TN.
1 2.5 DISTRIBUTED DATA INTEGRATION WTF-CEOP (WGISS Test Facility for CEOP) May 2007 Yonsook Enloe (NASA/SGT) Chris Lynnes (NASA)
ORNL DAAC MODIS Land Product Subsets 1 Suresh K. Santhana Vannan, Robert B. Cook, Bruce E. Wilson, Lisa M. Olsen Environmental Sciences Division, Oak Ridge.
A Data Access and Comparison Tool for Biogeochemical Data from Diverse Sources Jerry Pan 1, Robert B. Cook 1, Suresh K. Santhana Vannan 1, Bruce E. Wilson.
NASA Tools for Remote-Sensing in Ecology Research Workshop 2: NASA Tools for Remote-Sensing in Ecology Research 95 th Annual ESA Meeting, Workshop 2, July.
Data Management for ACT-America Bob Cook 1, Gao Chen 2, Yaxing Wei 1, and Thomas Lauvaux 3 1 Oak Ridge National Laboratory 2 NASA Langley 3 Penn State.
Data Browsing/Mining/Metadata
Global Precipitation Data Access, Value-added Services and Scientific Exploration Tools at NASA GES DISC Zhong Liu1,4, D. Ostrenga1,2, G. Leptoukh4, S.
Flanders Marine Institute (VLIZ)
Common Framework for Earth Observation Data
Improving Data Access, Discovery, and Usability
Data and Data Management: Introduction to the BCO-DMO
Potential Landsat Contributions
Presentation transcript:

Standardization Promotes Biogeochemical Data Management and Use in Multidisciplinary Environmental Research Yaxing Wei, Suresh Vannan, Robert B. Cook, Tammy Beaty, Alison G. Boyer, Makhan L. Virdi, Leslie A. Hook ORNL Distributed Active Archive Center (ORNL DAAC) Climate Change Science Institute Oak Ridge National Laboratory Oak Ridge, TN weiy@ornl.gov

Outline Challenges Introduction to ORNL DAAC ORNL DAAC Data Management Workflow Practices & Benefits for Standardization Data QA/QC & Reformat Metadata DOI & Citation Data Discovery, Visualization, and Distribution Conclusion

Proper Data Management Challenges Proper Data Management Source: DataONE

ORNL DAAC The Oak Ridge National Laboratory Distributed Active Archive Center (ORNL DAAC) archives data produced by NASA’s Terrestrial Ecology Program in support of NASA’s Carbon Cycle and Ecosystems Focus Area. http://daac.ornl.gov ORNL DAAC Field Campaign Model Code Land Validation Regional and Global

ORNL DAAC Data Management Workflow

Various Historical Data Formats at the ORNL DAAC Data QA/QC & Reformat Various Historical Data Formats at the ORNL DAAC Vannan et al., 2016

Benefits: Basis for Everything Data QA/QC & Reformat Push toward standard data formats Open and self-descriptive formats Tabular: CSV ASCII Feature: Shapefile, KML, etc. Raster: GeoTIFF, CF-netCDF Enforce QA/QC Spatial & temporal information Data summary & statistics Benefits: Basis for Everything

Wrong coordinates for soil respiration measurement QA/QC Examples Wrong coordinates for soil respiration measurement Incorrect Time Zone

Metadata Fine granularity of metadata Descriptive metadata data set  data file  data parameter Descriptive metadata Embedded inside data files Consistent tabular data headers Climate & Forecast (CF) convention Discovery metadata Mapping to common metadata models Unified Metadata Model (UMM) ISO 19115 FGDC CSDGM

Benefits: Promote Discovery Standard metadata and discovery service interface enables the integration of ORNL DAAC data products into larger systems.

DOI & Citation Digital Object Identifier (DOI) for each data set Combined with UUID to provide identifiability for files/variables Formal citations http://dx.doi.org/10.3334/ORNLDAAC/1225 http://dx.doi.org/10.3334/ORNLDAAC/1225?urlappend=%3Fgid%3Dd44a4e3d-953f-4279-98ca-8975bc93733b Thornton, P.E., M.M. Thornton, B.W. Mayer, Y. Wei, R. Devarakonda, R.S. Vose, and R.B. Cook. 2016. Daymet: Daily Surface Weather Data on a 1-km Grid for North America, Version 3. ORNL DAAC, Oak Ridge, Tennessee, USA. Accessed Month DD, YYYY. Time period: YYYY-MM-DD to YYYY-MM-DD. Spatial Range: N=DD.DD, S=DD.DD, E=DDD.DD, W=DDD.DD. http://dx.doi.org/10.3334/ORNLDAAC/1328

Benefits: Ensure Authors Receive Credit

Benefits: Promote Data Use

Standards-based Data Visualization & Access Open Geospatial Consortium (OGC) Web services Open-source Project for a Network Data Access Protocol (OPeNDAP) ORNL DAAC Spatial Data Access Tool (SDAT)

Benefits: Enable Data Exploration Interactively visualize data before downloading

Subset 2004 Forest Disturbance Data in San Diego Benefits: Enable On-demand Data Access Original projection is Albers Original extent is CONUS Original resolution is 30 meters Subset 2004 Forest Disturbance Data in San Diego You want data in Geographic Lat/Lon You want a resolution of 0.001 degree You want data in geotiff format This is the spatial extent you want to subset Click “Download Data” button to get data

Benefits: Promotes Data Integration Standard Web services promote distributed data products to be dynamically integrated Dynamic integration of ORNL DAAC Daymet data in USGS Geo Data Portal through OPeNDAP service

Conclusion Data centers, like ORNL DAAC, are the fundamental component to ensure smoother scientific data lifecycle Well-curated and quality scientific data are basis of future reuse Standardization ensures the connection of ORNL DAAC to broader data systems Standardization makes data management more effective and eases the usability of the data

ORNL Distributed Active Archive Center (ORNL DAAC) Questions? Yaxing Wei ORNL Distributed Active Archive Center (ORNL DAAC) weiy@ornl.gov