© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. THE NEON APPROACH TO DATA INGEST, CURATION, AND SHARING Christine Laney (Data.

Slides:



Advertisements
Similar presentations
Grand Challenges Hydrologic Sciences: Closing the water balance Social Sciences: People, institutions, and their water decisions Engineering: Integration.
Advertisements

Data and Information Framework: Principles Sue Barrell Bureau of Meteorology, Australia CBS-Ext.(14), Asuncion, September 2014.
The North American Carbon Program Google Earth Collection Peter C. Griffith, NACP Coordinator; Lisa E. Wilcox; Amy L. Morrell, NACP Web Group Organization:
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Information Types and Registries Giridhar Manepalli Corporation for National Research Initiatives Strategies for Discovering Online Data BRDI Symposium.
Transformations at GPO: An Update on the Government Printing Office's Future Digital System George Barnum Coalition for Networked Information December.
Jennifer A. Dunne Santa Fe Institute Pacific Ecoinformatics & Computational Ecology Lab Rich William, Neo Martinez, et al. Challenges.
Development of a Community Hydrologic Information System Jeffery S. Horsburgh Utah State University David G. Tarboton Utah State University.
Building the LTER Network Information System. NIS History, Then and Now YearMilestone 1993 – 1996NIS vision formed by Information Managers (IMs) and LTER.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Data-PASS Shared Catalog Micah Altman & Jonathan Crabtree 1 Micah Altman Harvard University Archival Director, Henry A. Murray Research Archive Associate.
Upcoming Enhancements to the HST Archive Mark Kyprianou Operations and Engineering Division Data System Branch.
Scientific Data Infrastructure in CAS Dr. Jianhui Scientific Data Center Computer Network Information Center Chinese Academy of Sciences.
About CUAHSI The Consortium of Universities for the Advancement of Hydrologic Science, Inc. (CUAHSI) is an organization representing 120+ universities.
Information Requirements for Integrating Spatially Discrete, Feature- Based Earth Observations Jeffery S. Horsburgh Anthony Aufdenkampe, Kerstin Lehnert,
Data Integration, Analysis, and Synthesis Matthew B. Jones National Center for Ecological Analysis and Synthesis University of California Santa Barbara.
1 Benjamin Perry, Venkata Kambhampaty, Kyle Brumsted, Lars Vilhuber, William Block Crowdsourcing DDI Development: New Features from the CED 2 AR Project.
Long Term Ecological Research Network Information System LTER Grid Pilot Study LTER Information Manager’s Meeting Montreal, Canada 4-7 August 2005 Mark.
Using the Open Metadata Registry (openMDR) to create Data Sharing Interfaces October 14 th, 2010 David Ervin & Rakesh Dhaval, Center for IT Innovations.
Cyberinfrastructure Overview Core Cyberinfrastructure Team Matthew B. Jones National Center for Ecological Analysis and Synthesis (NCEAS) University of.
Introduction to Apache OODT Yang Li Mar 9, What is OODT Object Oriented Data Technology Science data management Archiving Systems that span scientific.
Creating and Operating a Digital Library for Information and Learning– the GROW Project Muniram Budhu Department of Civil Engineering & Engineering Mechanics.
BioData a new bioassessment database for the USGS Briefing for the CDI
OOI CI LCA REVIEW August 2010 Ocean Observatories Initiative OOI Cyberinfrastructure Architecture Overview Michael Meisinger Life Cycle Architecture Review.
The Future of the iPlant Cyberinfrastructure: Coming Attractions.
Introduction to the ESA Planetary Science Archive  Jose Luis Vázquez (ESAC/ESA)  Dave Heather (ESTEC/ESA)  Joe Zender (ESTEC/ESA)
SEEK EcoGrid l Integrate diverse data networks from ecology, biodiversity, and environmental sciences l Metacat, DiGIR, SRB, Xanthoria,... l EML is the.
ESIP & Geospatial One-Stop (GOS) Registering ESIP Products and Services with Geospatial One-Stop.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
The iPlant Collaborative Community Cyberinfrastructure for Life Science Tools and Services Workshop Discovery Environment Overview.
Building the LTER Network Information System. NIS History, Then and Now YearMilestone 1993 – 1996NIS vision formed by Information Managers (IMs) and LTER.
Clinical Collaboration Platform Overview ST Electronics (Training & Simulation Systems) 8 September 2009 Research Enablers  Consulting  Open Standards.
GEON2 and OpenEarth Framework (OEF) Bradley Wallet School of Geology and Geophysics, University of Oklahoma
The US Long Term Ecological Research (LTER) Network: Site and Network Level Information Management Kristin Vanderbilt Department of Biology University.
Seeking SC Feedback on Draft Technology Strategy and Roadmap for EarthCube Draft of 3 November 2015 The Technology and Architecture Committee (TAC) Chairs:
Children’s Health Exposure Analysis Resource (CHEAR) CHEAR Center for Data Science Susan Teitelbaum, PhD November 4, 2015.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
XMC Cat: An Adaptive Catalog for Scientific Metadata Scott Jensen and Beth Plale School of Informatics and Computing Indiana University-Bloomington Current.
NSDL STEM Exchange: Technical Overview and Implications for Active Dissemination of Federally Funded Resources Across Implementation Systems.
NeOn Components for Ontology Sharing and Reuse Mathieu d’Aquin (and the NeOn Consortium) KMi, the Open Univeristy, UK
The Earth Information Exchange. Portal Structure Portal Functions/Capabilities Portal Content ESIP Portal and Geospatial One-Stop ESIP Portal and NOAA.
ISWG / SIF / GEOSS OOSSIW - November, 2008 GEOSS “Interoperability” Steven F. Browdy (ISWG, SIF, SCC)
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
NOAA EDMC Ocean Observatories Initiative Cyberinfrastructure Karen Stocks OOI CI Data Curator University of California, San Diego Ocean Observatories.
The Virtual Observatory and Ecological Informatics System (VOEIS): Using RESTful architecture and an extensible data model to provide a unique data management.
ISWG / SIF / GEOSS OOS - August, 2008 GEOSS Interoperability Steven F. Browdy (ISWG, SIF, SCC)
EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No EUDAT Aalto Data.
Data Coordinating Center University of Washington Department of Biostatistics Elizabeth Brown, ScD Siiri Bennett, MD.
Introduction to BODC and GEOTRACES data office Edward Mawji British Oceanographic Data Centre
International Planetary Data Alliance Registry Project Update September 16, 2011.
The National Ecological Observatory (NEON) Brian Wee, Ph.D. Chief of External Affairs, NEON, Inc. 1.
R2R ↔ NODC Steve Rutz NODC Observing Systems Team Leader May 12, 2011 Presented by L. Pikula, IODE OceanTeacher Course Data Management for Information.
GCI Architecture GEOSS Information System Meeting 20 September 2013, ESA/ESRIN (Frascati, Italy) M.Albani (ESA), D.Nebert (USGS/FGDC), S.Nativi (CNR)
Enhancements to Galaxy for delivering on NIH Commons
Strategies for NIS Development
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
EOSC MODEL Pasquale Pagano CNR - ISTI
Joslynn Lee – Data Science Educator
About Client Client is a pioneer in industry that provides catastrophe risk modeling, real-time risk exposure and risk management through available live.
INTAROS WP5 Data integration and management
Network Information System Advisory Committee (NISAC)
DataNet Collaboration
An Overview of Data-PASS Shared Catalog
Persistent Identifiers Implementation in EOSDIS
improve the efficiency, collaborative potential, and
EUDAT B2FIND A Cross-Discipline Metadata Service and Discovery Portal
E-Invoicing for Network Access Customers
Brian Matthews STFC EOSCpilot Brian Matthews STFC
Bird of Feather Session
Presentation transcript:

© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. THE NEON APPROACH TO DATA INGEST, CURATION, AND SHARING Christine Laney (Data Products) - Mark Brundege (Cyberinfrastructure) National Ecological Observatory Network 1

© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. A continental-scale ecological observatory solely funded by the NSF that: Collects and provides data on the drivers/responses of ecological change across the continent over 30 years Supports standardized methods of data collection and high investment in QA/QC Serves as an infrastructure/backbone for other experiments Develops and provides educational resources to engage communities in working with scientific open data Intro to NEON Project Timeline 2

© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. 20 Ecoclimatic domains 20 Core sites: Located in unmanaged wildland conditions 40 Relocatable sites: Representative of human land management effects on ecosystems 36 Aquatic sites &10 colocated STREON sites: Measure changes in aquatic systems over time 3 Airborne Platforms: LiDAR, hyperspectral observations, imagery Intro to NEON: A Continental-Scale Design 3

© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. 4 Generalized Terrestrial Sampling Scheme 4

© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. Data Heterogeneity 5 Current deployment: 17 core, 17 relocatable terrestrial, and 6 aquatic sites Recent rapid addition of data products: 41 publicly available to date

© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. Data product workflow 6 Processing: basic calibrated data will be processed using algorithms and models to produce synthetic data products that both specialist and non-specialist scientists can use to rapidly and effectively address ecological problems Supporting trust: assignment of meaningful metadata & uncertainty measures Enabling discovery: data portal, semantics

© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. Standardized Uncertainty Quantification for each subproduct: working with internal scientists and external collaborators. Documentation: Configuration-controlled and linked science designs, engineering documents, as-built documents, protocols, algorithms, etc. that are openly available via the data portal or by request Traceability: – Science challenges  designs  implementation  data  data products – Sample management and sharing via an asset tracking system – Linkages between data products via a data product catalog Standardized Nomenclature & Metadata: – Internal: unique ID, measurement, time, location, etc. – External: Interoperability (LTER, DataONE, CZO, etc.) Supporting trust in the data 7

© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. Data Ingest and Processing – Data Flow © Rich Niewiroski Jr. DPMS (data transitions) Raw (L0) data QA/QC (L1) data Location controller DRR (unpack messages) Queue Data Portal Database Golden Gate cdsExternalAPI PDR (Oracle) CDS server WebUI or PDA WebUI External Labs Lab ingest router Queue Validator 8

© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. Data Product Availability Information

© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. Oracle database NEON data should be accessible by anyone. Our audience should be able to learn about NEON’s Mission Science designs Data collection protocols Processing practices Data! To do this, we need: A user-friendly interface Credible, traceable data Robust data processing, storage, and querying systems. NEON Data Portal – (2.0 Launched May 2015) 10

© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. NEON Data Portal – 11 Find a dataset by Date range Location (site by state or domain) Data product by theme Icons and graphics aid with identification of pertinent data Custom configure the download

© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. NEON Data Portal – 12 DATA PACKAGE: Unique citation code for query Data files for chosen sites and time range Variable definition file Readme/manifest Requested documentation Data policy & citation info Learn more about the data product Assess a data product by availability of data for: Months Sites Parameters Available documentation (protocols, algorithms) Format Estimated download size Store a citation code to retrieve the query at a later date

© 2013 National Ecological Observatory Network, Inc. ALL RIGHTS RESERVED. A Few Future Plans Adding more data products as sites continue to be built and commissioned. Scoping for next iteration of data portal: Will be seeking community feedback via meetings and form at data.neoninc.org Preparing for development of external API Assess usability of data packages Assess options for structured metadata Report and track metrics 13 THANKS!