Applications and Requirements for Scientific Workflow Introduction May 1 2006 NSF Geoffrey Fox Indiana University.

Slides:



Advertisements
Similar presentations
GRADD: Scientific Workflows. Scientific Workflow E. Science laboris Workflows are the new rock and roll of eScience Machinery for coordinating the execution.
Advertisements

Crucial Patterns in Service- Oriented Architecture Jaroslav Král, Michal Žemlička Charles University, Prague.
Help communities share knowledge more effectively across the language barrier Automated Community Content Editing PorTal.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
HP Quality Center Overview.
Lecture # 2 : Process Models
Workshop on Workflows in Support of Large-Scale Science June 20, Paris, France In conjunction with HPDC 2006 HPDC Ewa Deelman,
Software Reuse SEII-Lecture 28
Transparent Robustness in Service Aggregates Onyeka Ezenwoye School of Computing and Information Sciences Florida International University May 2006.
Service Oriented Architecture Concepts March 27, 2006 Chris Armstrong
Distributed components
© 2006 IBM Corporation IBM Software Group Relevance of Service Orientated Architecture to an Academic Infrastructure Gareth Greenwood, e-learning Evangelist,
Distributed Systems Architectures
DCS Architecture Bob Krzaczek. Key Design Requirement Distilled from the DCS Mission statement and the results of the Conceptual Design Review (June 1999):
1 Dr. Frederica Darema Senior Science and Technology Advisor NSF Future Parallel Computing Systems – what to remember from the past RAMP Workshop FCRC.
Software Engineering for Safety : A Roadmap Presentation by: Manu D Vij CS 599 Software Engineering for Embedded Systems.
Connecting Diverse Web Search Facilities Udi Manber, Peter Bigot Department of Computer Science University of Arizona Aida Gikouria - M471 University of.
© 2006 IBM Corporation SOA on your terms and our expertise Discovering the Value of SOA SOA In Action SOA & End-2-End Business Driven Development using.
Course Instructor: Aisha Azeem
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
Help!!! Some Future Semantic Grid Activities CrisisGrid and ServoGrid PTLIU Laboratory for Community Grids Geoffrey Fox Computer Science, Informatics,
1 Autonomic Computing An Introduction Guenter Kickinger.
1 Yolanda Gil Information Sciences InstituteJanuary 10, 2010 Requirements for caBIG Infrastructure to Support Semantic Workflows Yolanda.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.
Scientific Workflow reusing and long term big data preservation Salima Benbernou Université Paris Descartes Project.
Introduction and Overview Questions answered in this lecture: What is an operating system? How have operating systems evolved? Why study operating systems?
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
DISTRIBUTED COMPUTING
CONTENTS Arrival Characters Definition Merits Chararterstics Workflows Wfms Workflow engine Workflows levels & categories.
Service-enabling Legacy Applications for the GENIE Project Sofia Panagiotidi, Jeremy Cohen, John Darlington, Marko Krznarić and Eleftheria Katsiri.
Cluster Reliability Project ISIS Vanderbilt University.
material assembled from the web pages at
Architecting Web Services Unit – II – PART - III.
SAMANVITHA RAMAYANAM 18 TH FEBRUARY 2010 CPE 691 LAYERED APPLICATION.
Taverna and my Grid Open Workflow for Life Sciences Tom Oinn
Chapter 1: Overview of Workflow Management Dr. Shiyong Lu Department of Computer Science Wayne State University.
KMS Products By Justin Saunders. Overview This presentation will discuss the following: –A list of KMS products selected for review –The typical components.
4.2.1 Programming Models Technology drivers – Node count, scale of parallelism within the node – Heterogeneity – Complex memory hierarchies – Failure rates.
1 USC Information Sciences Institute Yolanda Gil AAAI-08 Tutorial July 13, 2008 AAAI-08 Tutorial on Computational Workflows for Large-Scale.
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
ArcGIS Data Reviewer: An Introduction
Illustrations and Answers for TDT4252 exam, June
Interoperability Grids, Clouds and Collaboratories Ruth Pordes Executive Director Open Science Grid, Fermilab.
NIH Resource for Biomolecular Modeling and Bioinformatics Beckman Institute, UIUC NAMD Development Goals L.V. (Sanjay) Kale Professor.
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Rational Unified Process Fundamentals Module 7: Process for e-Business Development Rational Unified Process Fundamentals Module 7: Process for e-Business.
ISERVOGrid Architecture Working Group Brisbane Australia June Geoffrey Fox Community Grids Lab Indiana University
Chapter 6 – Architectural Design Lecture 1 1Chapter 6 Architectural design.
Information Integration BIRN supports integration across complex data sources – Can process wide variety of structured & semi-structured sources (DBMS,
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Applications and Requirements for Scientific Workflow May NSF Geoffrey Fox Indiana University.
Partnerships in Innovation: Serving a Networked Nation Grid Technologies: Foundations for Preservation Environments Portals for managing user interactions.
Applications and Requirements for Scientific Workflow May NSF Geoffrey Fox Indiana University.
OGCE Workflow and LEAD Overview Suresh Marru, Marlon Pierce September 2009.
Ocean Observatories Initiative OOI Cyberinfrastructure Life Cycle Objectives Review January 8-9, 2013 Scientific Workflows for OOI Ilkay Altintas Charles.
Directions in eScience Interoperability and Science Clouds June Interoperability in Action – Standards Implementation.
Technische Universität München © Prof. Dr. H. Krcmar An Ontology-based Platform to Collaboratively Manage Supply Chains Tobias Engel, Manoj Bhat, Vasudhara.
SuperComputing 2003 “The Great Academia / Industry Grid Debate” ?
The Development Process of Web Applications
Architecting Web Services
Architecting Web Services
Grid Services B.Ramamurthy 12/28/2018 B.Ramamurthy.
Chapter 5 Architectural Design.
Cyberinfrastructure and PolarGrid
AIMS Equipment & Automation monitoring solution
Chapter 5 Architectural Design.
GGF10 Workflow Workshop Summary
Presentation transcript:

Applications and Requirements for Scientific Workflow Introduction May NSF Geoffrey Fox Indiana University

Major Themes What is different now and why –Scientific workflow is in realm of possibility now What are the application requirements rather than CS requirements –Prioritize, identify new issues, what old requirements have been satisfied Ground these in scenarios or in application descriptions that lead to these requirements Phrase as transformative research – does term “scientific workflow” conjure up the innovative future or perhaps a bureaucratic past?

Applications Extreme weather (LEAD) Bioinformatics (myGrid, BIRN); high throughput screening Virtual Observatory in Astronomy Particle Physics Generic Data Analysis Earthquake Science Ocean Data Assimilation Note most of following topics come from Computer Science and one needs to identify the driving higher level application requirement –Preserve mapping of application requirements to computer science topic

Topics – Application/Component Specific [Evangelinos] Support Ocean Data assimilation –Matlab, Fortran, Parallel simulations –Dataflow standards for “large I/O” –Metascheduling –Customization of execution parameters (provenance) [AGray] Need workflow components supporting powerful data analysis across fields [Gil] Support workflows needed in “open access” data accompanying scientific publication [Hendler] Support information management as well as computation

Topics - Overarching [Ellisman] What do we mean by workflow; the word means different things to different people; should we use different terms; need a better word (distributed scientific method) [JMyers, Barga] Categorize workflows and study use; evaluate and compare; identify common patterns [Discussion] What has changed? – data deluge is one critical change; is data a curse or a blessing [Ellisman] What is the “scientific method” (versus “Google method”) and its implication for workflow [Barga] What’s wrong with commercial solutions [Laszewski] Support common Grid patterns [Fox] Build benchmark set analogous to NAS in parallel computing [Fahringer] Include all costs (e.g. Web Service security, SOAP) in performance models [Deelman1] Support restructuring and planning for performance optimization [JMyers] Manage workflows like content [Ackerman, KMyers, Scacchi, Deelman2] Support full people (scientific process) workflow including social and organizational issues

Topics – Desired Qualities [Goble] Support users who are often under-resourced [Discussion] Multiple classes of users: “power” “common case” “education”; do users know what they want or not? –Note industry workflow captures WELL understood business processes [Several] Workflows will be re-used and shared [Ellisman] Enable reproducible science [Livny] Support high quality software [Laszewski] Balance between features, performance, and completeness. [Goble] Easily assemble workflows, find services and adapt previous workflows [Goble] The workflow has to reflect the science not the services invocation interface. [Goble] Automated workflow design is unlikely, unpopular, and undesirable as scientists know which services they want [Goble] Support all services that users want – whether they have a WSDL interface or not

Topics – Desired Features [Several] Workflows should be scalable, fault-tolerant, restartable, adaptive and repeatable; support multi-administration heterogeneous resources [Discussion] What do application scientists mean by above qualities? –[Livny] Why is size important? Complexity counts [Altintas] Support end (instruments) to end (interactive data analysis) science [Szalay] Interactive analysis as well as batch [Gannon] Workflows triggered by events without user interaction [Knoblock] Techniques for rapidly constructing models of new sources or services so that they can be rapidly and correctly integrated. [Knoblock] Support for dynamically integrating data across multiple data sources (i.e, databases or web services) that were not designed to work together. [Curbera] Support reasoning about correctness and composability [Livny] What is meaning of correctness and reproducibility (e.g. random numbers) [Gil] Support collections of workflows addressing common scientific questions [Discussion] Need to support workflows of heterogeneous workflows of different types; note industry worries about linking intra-enterprise systems across enterprises

Topics – Detailed Technology [Laszewski] Extend the workflow language through a set of core libraries such as fault tolerance and check pointing. [Goble] Need a higher level language than BPEL [Goble] There will be no one workflow language or workflow system, as there is no one word processor, programming language or operating system. [Ellisman, Livny] Role of portals (science gateways) as “common case” user interface versus distributed programming for “power user” [Altintas] User interface customizable for different domains [Deelman2] Virtual data to capture efficiently past and future actions [Curbera] Integrate internet-scale execution (REST) and enterprise service bus ESB; [Discussion] Web 2.0 like Google maps; Industry distinction between interoperability and implementation

Topics -- Provenance [Freire] Support computational (workflow) steering and provenance generation [Goble] Workflows must allow effective management of resultant data and provenance [Barga, Moreau] Define generally provenance of execution even though multiple paradigms [Altintas] Track provenance of workflow design, execution, and intermediate and final results [Gannon] Initialization of workflow components are dependent on each other [Seth] Design provenance supporting customization of adaptable workflows