Provenance Research BIBI RAJU, TODD ELSETHAGEN, ERIC STEPHAN 1 Pacific Northwest National Laboratory, Richland, WA.

Slides:



Advertisements
Similar presentations
An Integrated, Distributed Traffic Control Strategy for Future Internet An Integrated, Distributed Traffic Control Strategy for Future Internet H. Che.
Advertisements

Pat Langley Computational Learning Laboratory Center for the Study of Language and Information Stanford University, Stanford, California
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
Nmrbox.org NMRbox: TRD 3 A probabilistic core as a coherent inference engine PINE+ Core Extend functionality through the new core PINE+: Assignment, use.
The ADAMANT Project: Linking Scientific Workflows and Networks “Adaptive Data-Aware Multi-Domain Application Network Topologies” Ilia Baldine, Charles.
SDN and Openflow.
T-FLEX DOCs PLM, Document and Workflow Management.
1 Richard White Design decisions: architecture 1 July 2005 BiodiversityWorld Grid Workshop NeSC, Edinburgh, 30 June - 1 July 2005 Design decisions: architecture.
1 Using Scalable and Secure Web Technologies to Design Global Format Registry Muluwork Geremew, Sangchul Song and Joseph JaJa Institute for Advanced Computer.
Integrated Scientific Workflow Management for the Emulab Network Testbed Eric Eide, Leigh Stoller, Tim Stack, Juliana Freire, and Jay Lepreau and Jay Lepreau.
Personal Data Management Why is this such an issue? Data Provenance Representing links v Representing data Identifying resources: Life Science Identifiers.
CMPUT Software Process & QualityProcess Categories - slide# 1©P. Sorenson Engineering Process Category  Processes that specify, implement, or maintain.
NGNS Program Managers Richard Carlson Thomas Ndousse ASCAC meeting 11/21/2014 Next Generation Networking for Science Program Update.
Capturing and Presenting Relevant Information in the EDC Eric Minick, Jackson Fox Design, Learning, and Collaboration Spring 2002.
Smart Home Current Progress Summary. Main Processor – Stellaris.
Community Manager A Dynamic Collaboration Solution on Heterogeneous Environment Hyeonsook Kim  2006 CUS. All rights reserved.
IPlant Collaborative Tools and Services Workshop iPlant Collaborative Tools and Services Workshop Overview of Atmosphere.
Collaborative Systems Developing Collaborative Systems with a Reuse Driven Process.
Discussion and conclusion The OGC SOS describes a global standard for storing and recalling sensor data and the associated metadata. The standard covers.
Cracow Grid Workshop 2003 Institute of Computer Science AGH A Concept of a Monitoring Infrastructure for Workflow-Based Grid Applications Bartosz Baliś,
Customized cloud platform for computing on your terms !
DORII Joint Research Activities DORII Joint Research Activities Status and Progress 4 th All-Hands-Meeting (AHM) Alexey Cheptsov on.
Texas A&M University Page 1 9/16/ :22:47 PM Wei Zhao Texas A&M University Is Computer Stuff Science, Engineering, or Something else?
NE II NOAA Environmental Software Infrastructure and Interoperability Program Cecelia DeLuca Sylvia Murphy V. Balaji GO-ESSP August 13, 2009 Germany NE.
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Dataset Caitlin Minteer & Kelly Clynes.
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
Science Environment for Ecological Knowledge: EcoGrid Matthew B. Jones National Center for.
DR Software: Essential Foundational Elements and Platform Components UCLA Smart Grid Energy Research Center (SMERC) Industry Partners Program (IPP) Meeting.
Design and Implementation of an Operational Tsunami Forecast Tool Donald W. Denbo, John R. Osborne, Clinton K. Pells and Mike A. Traum Joint Institute.
Traceability, reproducibility, and scalability in Integrated Ecosystem Assessments: July 2013 ECO-OP is supported by NSF Grant # PIs: Peter Fox.
Interoperability between Scientific Workflows Ahmed Alqaoud, Ian Taylor, and Andrew Jones Cardiff University 10/09/2008.
1 - GEC8, San Diego, July 20-22, 2010 Measurement Tools in PlanetLab Europe Tanja Zseby (Fraunhofer FOKUS, Berlin, Germany) (some slides from other OneLab.
An Ontological Framework for Web Service Processes By Claus Pahl and Ronan Barrett.
Jamie Hall (ILL). SciencePAD Persistent Identifiers Workshop PANData Software Catalogue January 30th 2013 Jamie Hall Developer IT Services, Institut Laue-Langevin.
Grid Computing & Semantic Web. Grid Computing Proposed with the idea of electric power grid; Aims at integrating large-scale (global scale) computing.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Update on the CCA Groundwater Simulation Framework: the BOCCA Experience Bruce Palmer, Yilin Fang, Vidhya Gurumoorthi, James Fort, Tim Scheibe Computational.
INNOV-10 Progress® Event Engine™ Technical Overview Prashant Thumma Principal Software Engineer.
ARCH-11: Building your Presentation with Classes John Sadd Fellow and OpenEdge Evangelist Sasha Kraljevic Principal TSE.
SOA – Bird’s Eye View Arun Majumdar, CTO, VivoMind Intelligence Inc, Rockville, MD.
Technical Update 2008 Sandy Payette, Executive Director Eddie Shin, Senior Developer April 3, 2008 Open Repositories 2008, Fedora User Group.
A Practical Approach to Metadata Management Mark Jessop Prof. Jim Austin University of York.
SSE3 Hypertext concepts 1. Agenda Pioneers and evolution Hypermedia – Modern hypermedia technology – Structure domains Architectural evolution The project.
NORDUnet Nordic Infrastructure for Research & Education Workshop Introduction - Finding the Match Lars Fischer LHCONE Workshop CERN, December 2012.
Enabling e-Research in Combustion Research Community T.V Pham 1, P.M. Dew 1, L.M.S. Lau 1 and M.J. Pilling 2 1 School of Computing 2 School of Chemistry.
MyGrid/Taverna Provenance Daniele Turi University of Manchester OMII f2f Meeting, London, 19-20/4/06.
Biomedical Informatics Research Network BIRN Workflow Portal.
EDRN Biomarker Database Curation Web Interface and Model.
Jens Hartmann York Sure Raphael Volz Rudi Studer The OntoWeb Portal.
VisTrails Second Provenance Challenge Tommy Ellkvist David Koop Juliana Freire Joint work with: Erik Andersen, Steven P. Callahan, Emanuele Santos, Carlos.
TNA Mobility II By Henry N Jerez. TNA Principles Persistent Identification of all:  Network Components  Services  Users Functionality Abstraction 
PROGRESS: GEW'2003 Using Resources of Multiple Grids with the Grid Service Provider Michał Kosiedowski.
Ocean Observatories Initiative OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Scientific Workflows for OOI Ilkay Altintas Charles.
Gerhard Dueck -- CS3013Architecture 1 Architecture-Centric Process  There is more to software development then going blindly through the workflows driven.
Collection and storage of provenance data Jakub Wach Master of Science Thesis Faculty of Electrical Engineering, Automatics, Computer Science and Electronics.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
Varnish Cache and its usage in the real world Ivan Chepurnyi Owner EcomDev BV.
> Blueprint Executive Summary >. Presentation Purpose; This Executive Summary captures the key details and outputs which merit stakeholder alignment to.
Biomedical Informatics Research Network BIRN Workflow Portal.
Overview on the work performed during EPIKH Training Faiza MEDJEK /INFN, CATANIA 1.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Sharing models as social objects through HydroShare
Agenda Federated Enterprise Architecture Vision
Data Ingestion in ENES and collaboration with RDA
Tools and Services Workshop Overview of Atmosphere
Peter Kacsuk MTA SZTAKI
Knowledge Based Workflow Building Architecture
NSDL Data Repository (NDR)
Presentation transcript:

Provenance Research BIBI RAJU, TODD ELSETHAGEN, ERIC STEPHAN 1 Pacific Northwest National Laboratory, Richland, WA

Provenance Goals Provenance Overarching Goals Provide thorough results explanation Reuse, repeat, or reproduce workflows Enable performance optimization Provenance Work in FY15 Developed a provenance capture ontology Provenance infrastructure build out Scalable provenance capture mechanism Developed a Client API that aids in production of provenance 2

Provenance for Reproducibility and Performance Provenance Provenance for Reproducibility Work towards achieving numerical, experimental and execution reproducibility. Performance Provenance Augment/replace existing strategies with the Open Provenance Model-based WorkFlow Performance Provenance (OPM-WFPP): Capture empirical performance information from workflows and systems Links provenance information and performance metrics 3 Input Appl. Input Appl. Output WFM Output StorageNetwork Core Inter connect Core Network Storage Protocol OSProtocol OS Access Protocol Workflow System Software Systems Access Protocol

Status Roadmap for FY16 Incorporate time-series system environment metrics store (In Progress) Add provenance capture mechanism that can handle the high-velocity provenance information – scalability (In Progress) Develop different language bindings for ProvEn Client API Design services supporting the harvesting of provenance from native source types Performance metrics reporting user interface 4

ACME – IPPD – Panorama Collaboration Joint route forward, where the Panorama Pegasus workflow is used to implement an ACME workflow and capture provenance. ACME workflow with ASCR Integrated end-to-end Performance Prediction and Diagnosis for Extreme Scientific Workflows (IPPD) provenance model adapted for ACME IPPD developed provenance store instance for ACME. 5

Thank You! 6