Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant #0955649 PIs: Peter Fox.

Slides:



Advertisements
Similar presentations
FitNesse in Fifty Minutes Chris Harbert Resonate 1.
Advertisements

The BADC-CSV Format Meeting user and metadata requirements Graham A Parton*, Sam J Pepler British Atmospheric Data Centre, Rutherford Appleton Laboratory,
Towards a Common Provenance Model for Research Publications Linyun Fu Xiaogang Ma Patrick West Stace Beaulieu.
Ontology Classifications Acknowledgement Abstract Content from simulation systems is useful in defining domain ontologies. We describe a digital library.
Virtual Geophysics Laboratory (VGL) VGL v1.2 NeCTAR Project Close R.Fraser, T.Rankine, J.Vote, L.Wyborn, B.Evans, R.Woodcock, C.Kemp July 2013 CSIRO |
Scientific Knowledge Discovery in Complex Semantic Networks of Geophysical Systems (no pressure…) EGU2012, NP2.6 April 25, 2012, Vienna, Austria Peter.
Introduction to DPSIR Framework Han Wang March 9 th, 2012.
ToolMatch: Discovering What Tools can be used to Access, Manipulate, Transform, and Visualize Data Patrick West 1 Nancy Hoebelheinrich.
Still serving data with an old DODS server from the early 90's Jim Manning NOAA's Northeast Fisheries Science Center NERACOOS/NECOSP Data Management Workshop,
HTML & CSS Extended Homework Learn how to create websites by structuring and styling your pages with HTML and CSS. Form: Name: ICT Group: ICT Teacher:
XML Overview. Chapter 8 © 2011 Pearson Education 2 Extensible Markup Language (XML) A text-based markup language (like HTML) A text-based markup language.
Facilitating Next Generation Science Collaboration: Respecting and Mediating Vocabularies with Semantics in Ecosystems Assessments. December 7, 2011, AGU11.
Provenance Capture in Data Access And Data Manipulation Software Patrick West 1 Peter Fox
References: [1] [2] [3] Acknowledgments:
National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Dynamic Virtual Observatories James Myers, Luigi Marini, Rob.
© Copyright 2008 STI INNSBRUCK NLP Interchange Format José M. García.
What has been lacking, until recently, is a successful method to develop, implement and sustain informatics solutions to modern application problems, such.
Persistent Identification of Agents and Objects of Global Change: Progress in the Global Change Information System Peter Fox, RPI Curt Tilmes, NASA Xiaogang.
Discovering accessibility, display, and manipulation of data in a data portal Nancy Hoebelheinrich Patrick West 2
Software Sustainability Institute Software Attribution can we improve the reusability and sustainability of scientific software?
Modeling and Representing National Climate Assessment Information using Linked Data Jin Guang Zheng 1 Curt Tilmes 2
DOAP – Description of a Project Ontology DOAP provides us with the ability to represent software, software projects, releases of software, licensing information,
TWC Deep Earth Computer: A Platform for Linked Science of the Deep Carbon Observatory Community Xiaogang (Marshall) Ma, Yu Chen, Han Wang, Patrick West,
Access and Query Task Force Status at F2F1 Simon Miles.
Opportunities and constraints for development and translation of digital learning resources How difficult is it to translate or adjust existing digital.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
Beginning with an NSF INTEROP project whose goal is to facilitate the deployment of an Integrated Ecosystem Approach (IEA) to management in the Northeast.
TWC Ontology Development for Provenance Tracing in National Climate Assessment of the US Global Change Research Program Xiaogang Ma a, Jin Guang Zheng.
Coding Provenance in Software and Matching Tools to Data OPeNDAP Provenance Project And ESIP ToolMatch Project Patrick West, Tetherless World Constellation.
WHAT ARE WE GOING TO DO WITH DATA? Rob L Davidson #WCSJ2015 This presentation DOI: /m9.figshare
References: [1] Lebo, T., Sahoo, S., McGuinness, D. L. (eds.), PROV-O: The PROV Ontology. Available via: [2]
Information Modeling and Semantic Web Application For National Climate Assessment Jin Guang Zheng 1 Curt Tilmes 2
Virtual Geophysics Laboratory (VGL) VGL v1.2 NeCTAR Project Close Ryan Fraser, Terry Rankine, Joshua Vote, Lesley Wyborn, Ben Evans, Robert Woodcock July.
Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
1 Advanced Semantic Technologies Deborah McGuinness CSCI , 97543, CSCI , 97014, ITWS , 98113, ITWS , TA: Abigail.
Facilitating Next Generation Science Collaboration: Marine Ecosystems Status Reports and Assessments June 24, 2014 IMBER – D2 Peter Fox (RPI/ Tetherless.
Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation.
Peter Batchelor & Liddy Nevile - OZeWAI HiSoftware Accessibility Solutions Peter Batchelor & Liddy Nevile
Applications and Requirements for Scientific Workflow May NSF Geoffrey Fox Indiana University.
 Key integrating concepts  Groups  Formal Community Groups  Ad-hoc special purpose/ interest groups  Fine-grained access control and membership 
Mike Hildreth DASPOS Update Mike Hildreth representing the DASPOS project 1.
How Environmental Informatics is Preparing Us for the Era of Big Data AGU FM 2013 GC11F-01 December 09, 2013, MW 3001 Peter
Provenance Research BIBI RAJU, TODD ELSETHAGEN, ERIC STEPHAN 1 Pacific Northwest National Laboratory, Richland, WA.
Chapter 4 Murach's JavaScript and jQuery, C4© 2012, Mike Murach & Associates, Inc.Slide 1.
NOAA's Northeast Shelf Ecosystem Status Report: collaborating with IPython Notebooks for reproducibility July 2013 ECO-OP is supported by NSF Grant #
Data Management: Data Processing Types of Data Processing at USGS There are several ways to classify Data Processing activities at USGS, and here are some.
How Data Management Supports Ecosystem Science PIFSC Troy Kanemura Coral Reef Ecosystem Program Ecosystem Sciences Division April 4, 2016.
The Semantic eScience Framework AGU FM10 IN22A-02 Deborah McGuinness and Peter Fox (RPI) Tetherless World Constellation.
MSG-085 2RS Common Interest Group SINEX OVERVIEW
® Hosted and Sponsored by W3C Provenance Working Group Update 80th OGC Technical Committee Austin, Texas (USA) Carl Reed March 20, 2012 Copyright © 2012.
Poster: EGU Glossary: USGCRP – United States Global Change Research Program NCA – National Climate Assessment GCIS – Global Change Information.
The Reproducible Research Advantage Why + how to make your research more reproducible Presentation for the Center for Open Science June 17, 2015 April.
Pasquale Pagano (CNR-ISTI) Project technical director
Incorporating W3C’s DQV and PROV in CISER’s Data Quality Review and
MIRACLE Cloud-based reproducible data analysis and visualization for outputs of agent-based models Xiongbing Jin, Kirsten Robinson, Allen Lee, Gary Polhill,
Implementing the Data Management Principles Opportunities and Advantages Robert R. Downs, PhD Sr. Digital Archivist, CIESIN, Columbia University.
Scientific Reproducibility using the Provenance for Healthcare and Clinical Research Framework Satya S. Sahoo Collaborators/Co-Authors: Joshua Valdez,
Provenance Capture in Data Access And Data Manipulation Software
Final review 24th Nov 2014 Brussels
IVOA Provenance METAdata
Data management for reproducible research
Application Programming Interfaces for FIA
CMSP / OCM Vocabulary Services rpi
Introduction to MATLAB Programming
CMIP6 use case and adoption of RDA outputs
Ecosystem Status Report: collaborating with IPython Notebooks
Measurement Semantics: “MEASEM”
Towards Executable Provenance Graphs for Reported Results in Research Publications Linyun Fu Xiaogang Ma Patrick West
Getting Started with GridLAB-D on the Cloud
Data and Information Provenance in NCA4
Presentation transcript:

Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant # PIs: Peter Fox (RPI) and Andrew Maffei (WHOI) NEFSC Collaborators: Jon Hare and Mike Fogarty Software programmer: Massimo Di Stefano Informatics and metadata: Stace Beaulieu

Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: Adopting a provenance model for a collaborative report July 2013 ECO-OP is supported by NSF Grant # PIs: Peter Fox (RPI) and Andrew Maffei (WHOI) NEFSC Collaborators: Jon Hare and Mike Fogarty Software programmer: Massimo Di Stefano Informatics and metadata: Stace Beaulieu

Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: Adopting a provenance model for a collaborative report July 2013

Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: Adopting a provenance model for a collaborative report July 2013 Metadata for data and workflow provenance (i.e., the marine ecosystem indicators and the collaborative report)

Use Case: Northeast Shelf Large Marine Ecosystem Ecosystem Status Report “traceability, repeatability, explanation, verification, and validation” for ecosystem data and information products in the NEFSC Ecosystem Status Report (ESR) Goal:

Page from 2009 ESR Section on Climate Forcing Figures available for download as PDF or image files – but without access to data or metadata

Page from 2009 ESR Section on Climate Forcing Figures available for download as PDF or image files – but without access to data or metadata Note: NOAA directive for ISO metadata, but these are not sufficient to describe time-series indicators

Software design to track provenance M. Di Stefano

Software design to track provenance M. Di Stefano

PROV Data Model W3C Recommendation 30 April 2013 Core Structures (types and relations)

PROV Data Model W3C Recommendation 30 April 2013 Core Structures (types and relations) Entity may be a single data product, or a chapter containing several data products

PROV-O: The PROV Ontology (expresses PROV-DM using OWL2) PROV Data Model W3C Recommendation 30 April 2013 Core Structures (types and relations) Entity may be a single data product, or a chapter containing several data products

Screenshot of IPython Notebook used to track both data and workflow provenance

Screenshot of IPython Notebook used to track both data and workflow provenance Code in Python, Matlab, R, other

Screenshot of IPython Notebook used to track both data and workflow provenance Code in Python, Matlab, R, other

Screenshot of IPython Notebook used to track both data and workflow provenance Notebook can be shared, or output as script, HTML, PDF, other

PDF output of IPython Notebook with clickable links to data and code

Screenshot of csv file at GitHub

Having access not only to the data that are plotted, but also to provenance metadata increases the (re-) usability of the data