National Aeronautics and Space Administration Jet Propulsion Laboratory March 17, 2009 Workflow Orchestration: Conducting Science Efficiently on the Grid.

Slides:



Advertisements
Similar presentations
Programming Languages for End-User Personalization of Cyber-Physical Systems Presented by, Swathi Krishna Kilari.
Advertisements

Improving System Safety through Agent-Supported User/System Interfaces: Effects of Operator Behavior Model Charles SANTONI & Jean-Marc MERCANTINI (LSIS)
Software Architecture Frameworks A Family of Implementations Nikunj Mehta Computer Science Department University of Southern California Los Angeles, CA.
ARCH-05 Application Prophecy UML 101 Peter Varhol Principal Product Manager.
WebRatio BPM: a Tool for Design and Deployment of Business Processes on the Web Stefano Butti, Marco Brambilla, Piero Fraternali Web Models Srl, Italy.
Presented by: Thabet Kacem Spring Outline Contributions Introduction Proposed Approach Related Work Reconception of ADLs XTEAM Tool Chain Discussion.
Provenance in Open Distributed Information Systems Syed Imran Jami PhD Candidate FAST-NU.
Aug. 20, JPL, SoCalBSI '091 The power of bioinformatics tools in cancer research Early Detection Research Network, JPL Mentors: Dr. Chris Mattmann,
National Aeronautics and Space Administration Jet Propulsion Laboratory Supporting Science Through Workflows: Infrastructure, Architecture and Modeling.
Applied Architecture (or… Architecture In Action) David Woollard University of Southern California Software Architecture Group NASA Jet Propulsion Laboratory.
Automated Analysis and Code Generation for Domain-Specific Models George Edwards Center for Systems and Software Engineering University of Southern California.
SWSA: Domain-Specific Software Architecture for Workflow-Based Science Data Systems David Woollard § ✚ Neno Medvidovic § University of Southern California.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
Chapter 1: Overview of Workflow Management Dr. Shiyong Lu Department of Computer Science Wayne State University.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
A Semantic Workflow Mechanism to Realise Experimental Goals and Constraints Edoardo Pignotti, Peter Edwards, Alun Preece, Nick Gotts and Gary Polhill School.
Basic Concepts The Unified Modeling Language (UML) SYSC System Analysis and Design.
Getting Smarter with Information An Information Agenda Approach
June Amsterdam A Workflow Bus for e-Science Applications Dr Zhiming Zhao Faculty of Science, University of Amsterdam VL-e SP 2.5.
International Workshop on Web Engineering ACM Hypertext 2004 Santa Cruz, August 9-13 An Engineering Perspective on Structural Computing: Developing Component-Based.
Chapter 8 Architecture Analysis. 8 – Architecture Analysis 8.1 Analysis Techniques 8.2 Quantitative Analysis  Performance Views  Performance.
San Diego Supercomputer CenterUniversity of California, San Diego Preservation Research Roadmap Reagan W. Moore San Diego Supercomputer Center
Architecture-Based Runtime Software Evolution Peyman Oreizy, Nenad Medvidovic & Richard N. Taylor.
Mihir Daptardar Software Engineering 577b Center for Systems and Software Engineering (CSSE) Viterbi School of Engineering 1.
CONTENTS Arrival Characters Definition Merits Chararterstics Workflows Wfms Workflow engine Workflows levels & categories.
Assessing the Suitability of UML for Modeling Software Architectures Nenad Medvidovic Computer Science Department University of Southern California Los.
Resource Provisioning based on Lease Preemption in InterGrid Mohsen Amini Salehi, Bahman Javadi, Rajkumar Buyya Cloud Computing and Distributed Systems.
Lecture 9: Chapter 9 Architectural Design
San Diego Supercomputer Center SDSC Storage Resource Broker Data Grid Automation Arun Jagatheesan et al., San Diego Supercomputer Center University of.
Model-Driven Approach for User Interface-Business Alignment Kênia Sousa Advisor: Jean Vanderdonckt Université catholique de Louvain (UCL) Louvain School.
Chapter 1: Overview of Workflow Management Dr. Shiyong Lu Department of Computer Science Wayne State University.
In each iteration macro model creates several micro modules, sends data to them and waits for the results. Using Akka Actors for Managing Iterations in.
Xiao Liu CS3 -- Centre for Complex Software Systems and Services Swinburne University of Technology, Australia Key Research Issues in.
A Novel Approach to Workflow Management in Grid Environments Frank Berretz*, Sascha Skorupa*, Volker Sander*, Adam Belloum** 15/04/2010 * FH Aachen - University.
Model-Driven Analysis Frameworks for Embedded Systems George Edwards USC Center for Systems and Software Engineering
What caught your eye at DEAS 2005? 40 DEAS 2005 Participants.
Experiments in computer science Emmanuel Jeannot INRIA – LORIA Aleae Kick-off meeting April 1st 2009.
Middleware for FIs Apeego House 4B, Tardeo Rd. Mumbai Tel: Fax:
An Ontological Framework for Web Service Processes By Claus Pahl and Ronan Barrett.
SAS ‘05 Reducing Software Security Risk through an Integrated Approach David P. Gilliam, John D. Powell Jet Propulsion Laboratory, California Institute.
Streamflow - Programming Model for Data Streaming in Scientific Workflows Chathura Herath.
National Aeronautics and Space Administration Jet Propulsion Laboratory March 31, 2009 CSCI 578 Course Project: Rearchitecting Scientific Code March 31,
George Goulas, Christos Gogos, Panayiotis Alefragis, Efthymios Housos Computer Systems Laboratory, Electrical & Computer Engineering Dept., University.
Enabling Grids for E-sciencE Astronomical data processing workflows on a service-oriented Grid architecture Valeria Manna INAF - SI The.
Using Social Network Analysis Methods for the Prediction of Faulty Components Gholamreza Safi.
1 Advanced Collaborative Environments Kris Brown Carmel Conaty Johnny Medina.
BPEL Business Process Engineering Language A technology used to build programs in SOA architecture.
 Copyright 2005 Digital Enterprise Research Institute. All rights reserved. Enabling Components Management and Dynamic Execution Semantic.
MODEL-BASED SOFTWARE ARCHITECTURES.  Models of software are used in an increasing number of projects to handle the complexity of application domains.
Centre d’Excellence en Technologies de l’Information et de la Communication Evolution dans la gestion d’infrastructure de type Cloud (SDI)
Csci 490 / Engr 596 Special Topics / Special Projects Software Design and Scala Programming Spring Semester 2010 Lecture Notes.
Software Architectural Views By the end of this lecture, you will be able to: list and describe the views in the 4+1 view model of software architecture.
Software Development in HPC environments: A SE perspective Rakan Alseghayer.
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Earth System Curator and Model Metadata Discovery and Display for CMIP5 Sylvia Murphy and Cecelia Deluca (NOAA/CIRES) Hannah Wilcox (NCAR/CISL) Metafor.
An Overview of Scientific Workflows: Domains & Applications Laboratoire Lorrain de Recherche en Informatique et ses Applications Presented by Khaled Gaaloul.
Engr 691 Special Topics in Engineering Science Software Architecture Spring Semester 2004 Lecture Notes.
1 SAS ‘04 Reducing Software Security Risk through an Integrated Approach David P. Gilliam and John D. Powell.
Rule Engine for executing and deploying the SAGE-based Guidelines Jeong Ah Kim', Sun Tae Kim 2 ' Computer Education Department, Kwandong University, KOREA.
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
May 7-8, 2007ICVCI 2007 RTP Autonomic Approach to IT Infrastructure Management in a Virtual Computing Lab Environment H. Abdel SalamK. Maly R. MukkamalaM.
George Edwards Computer Science Department Center for Systems and Software Engineering University of Southern California
Comparison of The Workflow Management Systems Bizagi, ProcessMaker, and Joget Mohamed Zeinelabdeen Abdelgader [1], Omer Salih Dawood [2], Mohamed Elhafiz.
V7 Foundation Series Vignette Education Services.
1 Visual Computing Institute | Prof. Dr. Torsten W. Kuhlen Virtual Reality & Immersive Visualization Till Petersen-Krauß | GUI Testing | GUI.
Considerations for a Modern Distribution Grid
Model-Driven Analysis Frameworks for Embedded Systems
Software Connectors – A Taxonomy Approach
San Diego Supercomputer Center University of California, San Diego
Automated Analysis and Code Generation for Domain-Specific Models
Presentation transcript:

National Aeronautics and Space Administration Jet Propulsion Laboratory March 17, 2009 Workflow Orchestration: Conducting Science Efficiently on the Grid March 17, 2009 David Woollard NASA Jet Propulsion Lab 4800 Oak Grove Drive Pasadena, CA Dept. of Computer Science University of Southern California Los Angeles, CA 90089

National Aeronautics and Space Administration Jet Propulsion Laboratory Validating Computational Science Computational science, like all science, requires validation Validation comes in two forms: –Scaling (in data and computation) –Independent replication Both forms require significant computational resources –Grid is a promising resource Workflow Orchestration - March 17,

National Aeronautics and Space Administration Jet Propulsion Laboratory Vision of the Grid Workflow Orchestration - March 17, Center for Software and Systems Engineering University of Southern California Science Data Systems Section NASA Jet Propulsion Laboratory Aerospace Corporation Northrop Grumman Boeing Corporation Lawrence Livermore National Lab Columbia Supercomputing Center NASA Ames Research Center Supercomputing Center University of California San Diego

National Aeronautics and Space Administration Jet Propulsion Laboratory Vision of the Grid Workflow Orchestration - March 17, Like the power grid, the computational Grid should scale to the demands of individual users.

National Aeronautics and Space Administration Jet Propulsion Laboratory Workflows orchestrate processes on the Grid tasks, data, and rulesWorkflows are a processing model that incorporate tasks, data, and rules. tasks data rulesWorkflow management systems execute tasks on the Grid using data once the task’s dependencies are satisfied based on rules. Workflow-Based Specification Workflow Orchestration - March 17, Tas k 1 Tas k 2 Tas k 3 Tas k 4 Tas k 5

National Aeronautics and Space Administration Jet Propulsion Laboratory Scaling the Experiment Workflow Orchestration - March 17, OtherInstitutions Task 1Task 1 Task 2Task 2 Task 3Task 3 Task 4Task 4 Task 5Task 5

National Aeronautics and Space Administration Jet Propulsion Laboratory Independent Replication Workflow Orchestration - March 17, Collaborator3 rd Party Task 1Task 1 Task 2Task 2 Task 3Task 3 Task 4Task 4 Task 5Task 5

National Aeronautics and Space Administration Jet Propulsion Laboratory Heterogeneous Environments Workflow Orchestration - March 17, LaboratoryInstitutionCo-laboratory Task 1Task 1 Task 2Task 2 Task 3Task 3 Task 4Task 4 Task 5Task 5 Task 1Task 1 Task 2Task 2 Task 3Task 3 Task 4Task 4 Task 5Task 5 Task 1Task 1 Task 2Task 2 Task 3Task 3 Task 4Task 4 Task 5Task 5 Workflow Engine 1 Grid Infrastructure 1 Workflow Engine 1 Grid Infrastructure 2 Workflow Engine 2 Grid Infrastructure 2 Collaborator3 rd Party

National Aeronautics and Space Administration Jet Propulsion Laboratory Research Challenge Scientific validation requires: –Scaling –Replication Existing technologies exhibit three challenges: –Require scientists to become engineers or vice versa –Existing workflow specifications entwine scientific and engineering concerns –Existing workflow specifications are not portable Workflow Orchestration - March 17,

National Aeronautics and Space Administration Jet Propulsion Laboratory A Model-Driven Approach Workflow Orchestration - March 17, Computation Independent Model Implementation Independent Model Implementation Workflow Model Domain-Specific Software Architecture Deployment

National Aeronautics and Space Administration Jet Propulsion Laboratory Agenda In the rest of this talk, we will cover: Models verses languages The role of software architecture Transforming workflows to domain-specific software architectures Performance Future Work and Conclusions Workflow Orchestration - March 17,

National Aeronautics and Space Administration Jet Propulsion Laboratory A Plethora of Workflow Languages Workflow Orchestration - March 17, Yu & Buyya presented a taxonomy [Yu & Buyya 05] –Based on workflow properties like model representation and scheduling policy –Illustration of divergence in the field As of last year, researchers such as Osterweil, et. al. [08] still advocated more advanced language features Considered a Grand Challenge [Gil, et al. 07]

National Aeronautics and Space Administration Jet Propulsion Laboratory Making Decisions in Design Space Existing workflow languages violate separation of concerns –Scientists should work in languages applicable to the design space, not the solution space –Engineers should not have to become scientists to be able to scale workflow-based systems If workflow languages become the realm of the scientist, how does the software engineer effect change? Workflow Orchestration - March 17, Manipulation of the system at the architectural level

National Aeronautics and Space Administration Jet Propulsion Laboratory Orchestration Through Connectors Lau, et al., have proposed exogenous connectors [Lau, et al. 06]. –encapsulate both control and data flow in a software system –can be hierarchically composed to simulate control flow Control can be managed through several constructs: –Sequence –Conditional –Branch & Bound Workflow Orchestration - March 17, AB AB C AB AB

National Aeronautics and Space Administration Jet Propulsion Laboratory Invoking Connectors Different Grid infrastructures interact with tasks in multiple ways [Woollard 08]: –Synchronous communication –Events –Web services Workflow Orchestration - March 17,

National Aeronautics and Space Administration Jet Propulsion Laboratory Custom Handlers Workflow Orchestration - March 17, Control Flow Data Flow ExogenousConnector Invoking Connector Control Flow Data Flow Component Control Flow Data Flow Internal Logic Services Custom Handler

National Aeronautics and Space Administration Jet Propulsion Laboratory SWSA: A Domain Architecture Workflow Orchestration - March 17,

National Aeronautics and Space Administration Jet Propulsion Laboratory Implementation Prism-MW, an architecturally-aware middleware –Components, Connectors, Topologies and Architecture are reified as first class elements Exogenous connectors, invoking connectors, and component wrappers around tasks are build with Prism Workflow Orchestration - March 17,

National Aeronautics and Space Administration Jet Propulsion Laboratory Performance Studies Overhead induced in computation time and memory connectors [Woollard, et al. 09]. Workflow Orchestration - March 17, Impact of architectural deployment on computation [Woollard, et al. 09]. -Modified an existing time series workflow used at JPL -Deployed the system using OpenDAP and grid technology to co-locate data and computation Reduced typical analysis from 9+ hours to under 2 minutes

National Aeronautics and Space Administration Jet Propulsion Laboratory Deployment & Optimization In the future, we plan to utilize advanced architectural modeling and deployment analysis to guide software engineers in deployment strategy Workflow Orchestration - March 17,

National Aeronautics and Space Administration Jet Propulsion Laboratory Conclusion Computational science requires validation Existing grid and workflow technologies are promising, but lack support for scaling and replication across heterogeneous Grid environments A model-driven approach allows scientists to manipulate workflow specifications, while software engineers can effect the transformed software architectures Workflow Orchestration - March 17,

National Aeronautics and Space Administration Jet Propulsion Laboratory Thank You [Yu & Buyya 05] Yu, J. and Buyya, R. A Taxonomy of Workflow Management Systems for Grid Computing. Journal of Grid Computing 3(3-4): pp [Osterweil 08] Osterweil, L., et. al. Experience in using a process language to define scientific workflow and generate dataset provenance. In Proceedings of the 16th ACM SIGSOFT International Symposium on Foundations of software engineering, Atlanta, Georgia [Gil, et. al. 07] Gil, Y., et. al. Examining the Challenges of Scientific Workflows. IEEE Computer 40(12): pp [Lau, et. al. 06] Lau, K., et. al. A Software Component Model and its Preliminary Formalisation. In F.S. de Boer et al., editors, Proceedings of Fourth International Symposium on Formal Methods for Components and Objects, Lecture Notes in Computer Science 4111(1-21) [Woollard 08] Woollard, D. Supporting the Engineering Aspects of e-Science Through Workflow Services. Proceedings of the First Brazilian e-Science Workshop, Campinas, Brazil, [Woollard, et. al. 089 Woollard, D. et. al. Injecting Software Architectural Constraints into Legacy Scientific Applications. To appear in Proceedings of the ICSE 2009 Workshop on Software Engineering for Computational Science and Engineering. Vancouver, Canada, Workflow Orchestration - March 17,