Scientific Workflows Lecture 15

Slides:



Advertisements
Similar presentations
1 Coven a Framework for High Performance Problem Solving Environments Nathan A. DeBardeleben Walter B. Ligon III Sourabh Pandit Dan C. Stanzione Jr. Parallel.
Advertisements

CH02: Modeling the process and life cycle Process of developing software (organization and discipline in the activities) contribute to the quality of the.
® IBM Software Group © 2006 IBM Corporation Rational Software France Object-Oriented Analysis and Design with UML2 and Rational Software Modeler 04. Other.
Ewa Deelman, Integrating Existing Scientific Workflow Systems: The Kepler/Pegasus Example Nandita Mangal,
Experiences in Integration of the 'R' System into Kepler Dan Higgins – National Center for Ecological Analysis and Synthesis (NCEAS), UC Santa Barbara.
Computational Physics Kepler Dr. Guy Tel-Zur. This presentations follows “The Getting Started with Kepler” guide. A tutorial style manual for scientists.
Microsoft Visual Basic 2012 CHAPTER ONE Introduction to Visual Basic 2012 Programming.
Microsoft Visual Basic 2005 CHAPTER 1 Introduction to Visual Basic 2005 Programming.
An Introduction to Designing and Executing Workflows with Taverna Aleksandra Pawlik University of Manchester materials by Dr Katy Wolstencroft and Dr Aleksandra.
Implementation & Integration Phase Implementation, then integration: Implementation, then integration:  Each module is implemented by member of programmer.
RUP Implementation and Testing
An Introduction to Designing and Executing Workflows with Taverna Katy Wolstencroft University of Manchester.
Preserving the Scientific Record: Preserving a Record of Environmental Change Matthew Mayernik National Center for Atmospheric Research Version 1.0 [Review.
SOFTWARE DESIGN (SWD) Instructor: Dr. Hany H. Ammar
Testing Workflow In the Unified Process and Agile/Scrum processes.
Peter Bajcsy, Rob Kooper, Luigi Marini, Barbara Minsker and Jim Myers National Center for Supercomputing Applications (NCSA) University of Illinois at.
Scientific Workflow Interchanging Through Patterns: Reversals and Lessons Learned Bruno Fernandes Bastos Regina Maria Maciel Braga Antônio Tadeu Azevedo.
10/18/20151 Business Process Management and Semantic Technologies B. Ramamurthy.
Systems Analysis and Design in a Changing World, 3rd Edition
Introduction to Software Engineering. Why SE? Software crisis manifested itself in several ways [1]: ◦ Project running over-time. ◦ Project running over-budget.
_______________________________________________________________CMAQ Libraries and Utilities ___________________________________________________Community.
Implementing computational analysis through Web services Arnaud Kerhornou CRG/INB Barcelona - BioMed Workshop IRB November 2007.
Moby Web Services Iván Párraga García MSc on Bioinformatics for Health Sciences May 2006.
Chapter 2 Introduction to Systems Architecture. Chapter goals Discuss the development of automated computing Describe the general capabilities of a computer.
Distributed Computing With Triana A Short Course Matthew Shields, Ian Taylor & Ian Wang.
1 Limitations of BLAST Can only search for a single query (e.g. find all genes similar to TTGGACAGGATCGA) What about more complex queries? “Find all genes.
Data Structures and Algorithms Dr. Tehseen Zia Assistant Professor Dept. Computer Science and IT University of Sargodha Lecture 1.
Performing statistical analyses using the Rshell processor Original material by Peter Li, University of Birmingham, UK Adapted by Norman.
Introduction to Taverna Online and Interaction service Aleksandra Pawlik University of Manchester.
Inria Rhône-AlpesEMGnet meeting - December 98 1 A Platform for EMG Studies Danielle Ziébelin, Martine Maume and Philippe Genoud INRIA Rhône-Alpes Projet.
Satisfying Requirements BPF for DRA shall address: –DAQ Environment (Eclipse RCP): Gumtree ISEE workbench integration; –Design Composing and Configurability,
Portals and my Grid Stefan Rennick Egglestone Mixed Reality Laboratory University of Nottingham.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
© Geodise Project, University of Southampton, Workflow Application Fenglian Xu 07/05/03.
Onlinedeeneislam.blogspot.com1 Design and Analysis of Algorithms Slide # 1 Download From
Ganga/Dirac Data Management meeting October 2003 Gennady Kuznetsov Production Manager Tools and Ganga (New Architecture)
Microsoft Visual Basic 2015 CHAPTER ONE Introduction to Visual Basic 2015 Programming.
Of 24 lecture 11: ontology – mediation, merging & aligning.
Presenter: Bradley Green.  What is Bioinformatics?  Brief History of Bioinformatics  Development  Computer Science and Bioinformatics  Current Applications.
Aleksandra Pawlik University of Manchester. Something that can be put into a workflow Well described - what the component does Behaves “well” - conforms.
Aleksandra Pawlik Alan Williams University of Manchester.
SOFTWARE TESTING TRAINING TOOLS SUPPORT FOR SOFTWARE TESTING Chapter 6 immaculateres 1.
CIS 375 Bruce R. Maxim UM-Dearborn
OPERATING SYSTEMS CS 3502 Fall 2017
Object Oriented Programming
Working in the Forms Developer Environment
Performing statistical analyses using the Rshell processor
Introduction to Visual Basic 2008 Programming
Mobile Operating System
What contribution can automated reasoning make to e-Science?
FORMAL SYSTEM DEVELOPMENT METHODOLOGIES
CH#3 Software Designing (Object Oriented Design)
Introduction to Opnet Mobile Networks Introduction to Opnet
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 2 Database System Concepts and Architecture.
Chapter 2: Database System Concepts and Architecture
Cohesion and Coupling Chapter 5, Pfleeger 01/01/10.
Managing Business Processes
VISUAL BASIC.
Algorithm Discovery and Design
Computational Physics Kepler
Course: Module: Lesson # & Name Instructional Material 1 of 32 Lesson Delivery Mode: Lesson Duration: Document Name: 1. Professional Diploma in ERP Systems.
Faculty of Computer Science & Information System
Introduction of Week 11 Return assignment 9-1 Collect assignment 10-1
Laura Bright David Maier Portland State University
Causal Models Lecture 12.
Versatile workflow management Tool
Overview of Workflows: Why Use Them?
CHAIMS January 1999 CHAIMS.
Business Process Management and Semantic Technologies
Relating Models to Data
Presentation transcript:

Scientific Workflows Lecture 15 all images are screenshots from the respective applications add slides on e-cell, computational physics (n-body)

The Hypothetico-Deductive View of Science Recall the HD view of science with its particular emphasis on comparing predictions to observations. deduction predictions hypotheses comparison evaluation observations In practice, making predictions and analyzing observations can be far more complex than this view suggests.

Detailed Examination of the HD View On the side of data analysis, scientists may integrate data from multiple sources, process data for missing values and noise, and calculate various summary statistics. On the prediction side, a scientist’s workflow could include: simulating models to produce the predictions, integrating results from heterogeneous models, and transforming the results for comparison to observations. If you unpack the HD method for a particular application in a particular domain, it would include all these extra steps.

Scientific Workflows: Introduction Automating or aiding the scientific process requires computational tools that integrate the detailed steps. Scientific workflow systems handle this problem by organizing computational transformations of data sets. Independently, the workflows formalize and record scientific activities to assist in the reproduction of an analysis. The informatics tools that support scientific workflows help scientists build, execute, revise, and share them.

Scientific Workflows: Representation Workflows consist of nodes that indicate computational procedures and links that indicate data flow. This Kepler workflow gathers data and simulation results from the internet, integrates them, and plots the results.

Scientific Workflows: Benefits In theory, informatics tools for scientific workflows should hide execution-level details for the transformations, track input/output and parameter constraints for the transformations and ensure that they are satisfied, store provenance to support replication, and enable rapid reconfiguration for unique tasks. Workflows replace specialized programs connected through scripting languages with modules connected graphically. Workflow systems let scientists focus more on their data and less on program implementation.

Taverna Workbench component library workflow window component editor

Taverna: Workflow Example This workflow grabs remotely stored sequences of DNA from multiple species, aligns the sequences, and plots the alignment results. http://www.myexperiment.org/workflows/158 One plot from the alignment results as displayed in Taverna.

Taverna: Local Processing Some nodes refer to local operations, such as this one that has associated code and ports for input and output.

Taverna: External Processing Other nodes grab data from external sources or call executable code from external locations. Accesses data from the BioMart data repository. Sends a computation request to the SoapLab web service.

Kepler Workbench component library workflow window

Kepler: Workflow Example This workflow grabs the results of a climate change model for 2020 CE, grabs historical climate change data for 1961–1990 CE, calculates the annual average values, rescales the grid size for the simulations to merge the data, plots the difference in average annual radiation.

Kepler: Node Internals This node imports data from an outside source. The parameters for this node are similar to those for importing data into Taverna.

Kepler: Node Internals This node merges two sources of computational data with the specified merge operation. To plot the results, the scientist must write executable code.

Scientific Workflows: A Commentary Claim: Workflow systems let scientists formalize and record their activities. Superficially this is true, although all workflow systems have adopted a functional view of scientific activity. Nodes operate as functions that take data and parameters as input and produce transformed data as output. Claim: Workflow systems let scientists focus more on their data and less on program implementation. In reality, configuring nodes can be a complex task that simply replaces function calls with GUI frames. Recall that Taverna and Kepler have nodes that include program source code.