NOAA's Northeast Shelf Ecosystem Status Report: collaborating with IPython Notebooks for reproducibility July 2013 ECO-OP is supported by NSF Grant #0955649.

Slides:



Advertisements
Similar presentations
The BADC-CSV Format Meeting user and metadata requirements Graham A Parton*, Sam J Pepler British Atmospheric Data Centre, Rutherford Appleton Laboratory,
Advertisements

Towards a Common Provenance Model for Research Publications Linyun Fu Xiaogang Ma Patrick West Stace Beaulieu.
A BRIEF INTRO TO THE PROV DATA MODEL Simon Miles The entire W3C Provenance Working Group.
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
Ontology Classifications Acknowledgement Abstract Content from simulation systems is useful in defining domain ontologies. We describe a digital library.
Contributing source code to CSDMS Albert Kettner.
11 WARC standard revision workshop Clément Oury IIPC General Assembly open workshops Stanford, April 28th, 2015 IIPC General Assembly – Stanford – April.
C c c Comprehensive Cardiovascular Device Simulation Data and Process Management INTRODUCTION CONTACTS Cardiovascular related simulations such as nonlinear.
Virtual Geophysics Laboratory (VGL) VGL v1.2 NeCTAR Project Close R.Fraser, T.Rankine, J.Vote, L.Wyborn, B.Evans, R.Woodcock, C.Kemp July 2013 CSIRO |
Scientific Knowledge Discovery in Complex Semantic Networks of Geophysical Systems (no pressure…) EGU2012, NP2.6 April 25, 2012, Vienna, Austria Peter.
{ Building Open Access To Our Heritage Andrew Weidner Project Coordinator, New Mexico Historical Newspapers University of North Texas Libraries: Digital.
Introduction to DPSIR Framework Han Wang March 9 th, 2012.
HTML & CSS Extended Homework Learn how to create websites by structuring and styling your pages with HTML and CSS. Form: Name: ICT Group: ICT Teacher:
Facilitating Next Generation Science Collaboration: Respecting and Mediating Vocabularies with Semantics in Ecosystems Assessments. December 7, 2011, AGU11.
Provenance Capture in Data Access And Data Manipulation Software Patrick West 1 Peter Fox
Chapter 3: Formatting MuPAD Documents MATLAB for Scientist and Engineers Using Symbolic Toolbox.
National Center for Supercomputing Applications University of Illinois at Urbana-Champaign Dynamic Virtual Observatories James Myers, Luigi Marini, Rob.
What has been lacking, until recently, is a successful method to develop, implement and sustain informatics solutions to modern application problems, such.
Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant # PIs: Peter Fox.
Web software. Two types of web software Browser software – used to search for and view websites. Web development software – used to create webpages/websites.
PREMIS Rathachai Chawuthai Information Management CSIM / AIT.
Software Sustainability Institute Software Attribution can we improve the reusability and sustainability of scientific software?
Modeling and Representing National Climate Assessment Information using Linked Data Jin Guang Zheng 1 Curt Tilmes 2
DOAP – Description of a Project Ontology DOAP provides us with the ability to represent software, software projects, releases of software, licensing information,
Applying Provenance Extensions to OPeNDAP Framework Patrick West, James Michaelis, Tim Lebo, Deborah L. McGuinness Rensselaer Polytechnic Institute Tetherless.
Opportunities and constraints for development and translation of digital learning resources How difficult is it to translate or adjust existing digital.
Deepcarbon.net Xiaogang (Marshall) Ma, Yu Chen, Han Wang, John Erickson, Patrick West, Peter Fox Tetherless World Constellation Rensselaer Polytechnic.
Coding Provenance in Software and Matching Tools to Data OPeNDAP Provenance Project And ESIP ToolMatch Project Patrick West, Tetherless World Constellation.
References: [1] Lebo, T., Sahoo, S., McGuinness, D. L. (eds.), PROV-O: The PROV Ontology. Available via: [2]
Electronic labnotes Mari Wigham COMMIT/. Information WUR  Organising, sharing, finding and reusing data  Expertise in: ● Modelling data.
Information Modeling and Semantic Web Application For National Climate Assessment Jin Guang Zheng 1 Curt Tilmes 2
Deepcarbon.net Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer.
Access and Query Task Force Status at F2F1 Simon Miles.
Facilitating Next Generation Science Collaboration: Marine Ecosystems Status Reports and Assessments June 24, 2014 IMBER – D2 Peter Fox (RPI/ Tetherless.
Toward verifiable science: iPython meets PROV-O (Semantics in Ecosystems Assessments). April 16, 2014 ERRT Peter Fox (RPI/ Tetherless World Constellation.
Peter Batchelor & Liddy Nevile - OZeWAI HiSoftware Accessibility Solutions Peter Batchelor & Liddy Nevile
 Key integrating concepts  Groups  Formal Community Groups  Ad-hoc special purpose/ interest groups  Fine-grained access control and membership 
Data Organization Quality Assurance and Transformations.
Mike Hildreth DASPOS Update Mike Hildreth representing the DASPOS project 1.
How Environmental Informatics is Preparing Us for the Era of Big Data AGU FM 2013 GC11F-01 December 09, 2013, MW 3001 Peter
Provenance Research BIBI RAJU, TODD ELSETHAGEN, ERIC STEPHAN 1 Pacific Northwest National Laboratory, Richland, WA.
MATLAB ® for Engineers, Holly Moore Fourth Edition, Global Edition © Pearson Education Limited 2015 All rights reserved. Figure 7.1 The Leaning Tower of.
Chapter 4 Murach's JavaScript and jQuery, C4© 2012, Mike Murach & Associates, Inc.Slide 1.
Data Management: Data Processing Types of Data Processing at USGS There are several ways to classify Data Processing activities at USGS, and here are some.
How Data Management Supports Ecosystem Science PIFSC Troy Kanemura Coral Reef Ecosystem Program Ecosystem Sciences Division April 4, 2016.
The Semantic eScience Framework AGU FM10 IN22A-02 Deborah McGuinness and Peter Fox (RPI) Tetherless World Constellation.
Poster: EGU Glossary: USGCRP – United States Global Change Research Program NCA – National Climate Assessment GCIS – Global Change Information.
The Reproducible Research Advantage Why + how to make your research more reproducible Presentation for the Center for Open Science June 17, 2015 April.
Incorporating W3C’s DQV and PROV in CISER’s Data Quality Review and
MIRACLE Cloud-based reproducible data analysis and visualization for outputs of agent-based models Xiongbing Jin, Kirsten Robinson, Allen Lee, Gary Polhill,
Provenance Capture in Data Access And Data Manipulation Software
Installing R and R Studio
American History Chapter 7 Sections 1,2 and 3.
Ecological Information Management (EIM) 2008
This is where R scripts will load
Maze Race. Maze Race Race The first thing you need to do is change the background so click on stage. Then click on background. Now click paint Select.
CMSP / OCM Vocabulary Services rpi
Introduction to MATLAB Programming
Ecosystem Status Report: collaborating with IPython Notebooks
Following these steps, you can output your record to your wiki quickly
Python Crash Course CSC 576: Data Science.
This is where R scripts will load
Data Provenance.
Towards Executable Provenance Graphs for Reported Results in Research Publications Linyun Fu Xiaogang Ma Patrick West
Summary of WISE electronic delivery
Contributing source code to CSDMS
Computational Environment Management
Stata Conference July 12, 2019 Abigail S. Baldridge, MS
Getting Started with GridLAB-D on the Cloud
Data and Information Provenance in NCA4
Presentation transcript:

NOAA's Northeast Shelf Ecosystem Status Report: collaborating with IPython Notebooks for reproducibility July 2013 ECO-OP is supported by NSF Grant # PIs: Peter Fox (RPI) and Andrew Maffei (WHOI) NEFSC Collaborators: Jon Hare and Mike Fogarty Software programmer: Massimo Di Stefano Informatics and metadata: Stace Beaulieu

Adopting a provenance model for a collaborative report Lineage, or the history of a data or information product, including how was it processed, who processed it, and where is it stored What is provenance?

Use Case: Northeast Shelf Large Marine Ecosystem Ecosystem Status Report “traceability, repeatability, explanation, verification, and validation” for ecosystem data and information products in the NEFSC Ecosystem Status Report (ESR) Goal:

Page from 2009 ESR Section on Climate Forcing Figures available for download as PDF or image files – but without access to data or metadata Note: NOAA directive for ISO metadata, which includes lineage

Software design to track data provenance M. Di Stefano

PROV Data Model and PROV-O ontology W3C Recommendation 30 April 2013 Core Structures (types and relations) Entity may be a single data product, or a chapter containing several data products Workflow provenance (e.g., how to put together the collaborative report)

Screenshot of IPython Notebook used to track both data and workflow provenance Code in Python, Matlab, R, other

Screenshot of IPython Notebook used to track both data and workflow provenance Notebook can be shared, or output as script, HTML, PDF, other

PDF output of IPython Notebook with clickable links to data and code

Screenshot of csv file at GitHub Access not only to the data that are plotted, but also to provenance metadata for reproducibility

Data provenance: from environmental data (left) to marine ecosystem indicator (right)