Workshop on Workflows in Support of Large-Scale Science June 20, Paris, France In conjunction with HPDC 2006 HPDC 2006 www.isi.edu/works06 Ewa Deelman,

Slides:



Advertisements
Similar presentations
Delivering User Needs: A middleware perspective Steven Newhouse Director.
Advertisements

Joint CASC/CCI Workshop Report Strategic and Tactical Recommendations EDUCAUSE Campus Cyberinfrastructure Working Group Coalition for Academic Scientific.
Network Science and Engineering (NetSE) Research Agenda: v1.0 5 th GENI Engineering Conference Seattle, WA 21 July 2009 Ellen Zegura, Georgia Tech.
ACCI TASK FORCES Update CASC September 22, Task Force Introduction Timeline months or less from June 2009 Led by NSF Advisory Committee on.
SACNAS, Sept 29-Oct 1, 2005, Denver, CO What is Cyberinfrastructure? The Computer Science Perspective Dr. Chaitan Baru Project Director, The Geosciences.
The ADAMANT Project: Linking Scientific Workflows and Networks “Adaptive Data-Aware Multi-Domain Application Network Topologies” Ilia Baldine, Charles.
Ewa Deelman, Integrating Existing Scientific Workflow Systems: The Kepler/Pegasus Example Nandita Mangal,
Pegasus: Mapping complex applications onto the Grid Ewa Deelman Center for Grid Technologies USC Information Sciences Institute.
EInfrastructures (Internet and Grids) - 15 April 2004 Sharing ICT Resources – “Think Globally, Act Locally” A point-of-view from the United States Mary.
Knowledge Environments for Science: Representative Projects Ian Foster Argonne National Laboratory University of Chicago
Computing in Atmospheric Sciences Workshop: 2003 Challenges of Cyberinfrastructure Alan Blatecky Executive Director San Diego Supercomputer Center.
April 2006 Science Gateways on the TeraGrid Nancy Wilkins-Diehr Area Director for Science Gateways San Diego Supercomputer Center
Welcome to CW 2007!!!. The Condor Project (Established ‘85) Distributed Computing research performed by.
1 USC Information Sciences Institute Yolanda Gil AAAI-08 Tutorial July 13, 2008 Part VII: Future Challenges in Computational Workflows and.
NSF ACCI (Advisory Committee for CyberInfrastructure) Taskforce Update - CASC Meeting 23 March 2010 Craig Stewart – Executive Director,
August 2007 Advancing Scientific Discovery through TeraGrid Adapted from S. Lathrop’s talk in SC’07
SAN DIEGO SUPERCOMPUTER CENTER NUCRI Advisory Board Meeting November 9, 2006 Science Gateways on the TeraGrid Nancy Wilkins-Diehr TeraGrid Area Director.
Large-Scale Science Through Workflow Management Ewa Deelman Center for Grid Technologies USC Information Sciences Institute.
TeraGrid Overview Cyberinfrastructure Days Internet2 10/9/07 Mark Sheddon Resource Provider Principal Investigator San Diego Supercomputer Center
Through the development of advanced middleware, Grid computing has evolved to a mature technology in which scientists and researchers can leverage to gain.
Integrating Cloud & Cyberinfrastructure Manish Parashar NSF Cloud and Autonomic Computing Center Rutgers, The State University of New Jersey.
Virtual Organizations, CyberInfrastructure and Campuses.
MAEviz as a MAE/NCSA Cyberenvironment Partnership Jim Myers Associate Director NCSA Cyberenvironments.
Pegasus: Planning for Execution in Grids Ewa Deelman Information Sciences Institute University of Southern California.
1 USC Information Sciences Institute Yolanda Gil AAAI-08 Tutorial July 13, 2008 AAAI-08 Tutorial on Computational Workflows for Large-Scale.
Accelerating Scientific Exploration Using Workflow Automation Systems Terence Critchlow (LLNL) Ilkay Altintas (SDSC) Scott Klasky(ORNL) Mladen Vouk (NCSU)
Pegasus: Mapping Scientific Workflows onto the Grid Ewa Deelman Center for Grid Technologies USC Information Sciences Institute.
DV/dt - Accelerating the Rate of Progress towards Extreme Scale Collaborative Science DOE: Scientific Collaborations at Extreme-Scales:
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
CyberInfrastructure workshop CSG May Ann Arbor, Michigan.
1 Computing Challenges for the Square Kilometre Array Mathai Joseph & Harrick Vin Tata Research Development & Design Centre Pune, India CHEP Mumbai 16.
Pegasus: Running Large-Scale Scientific Workflows on the TeraGrid Ewa Deelman USC Information Sciences Institute
National Science Foundation Revolutionizing science and engineering research though cyberinfrastructure by David G. Messerschmitt Member, NSF Blue Ribbon.
GRID ARCHITECTURE Chintan O.Patel. CS 551 Fall 2002 Workshop 1 Software Architectures 2 What is Grid ? "...a flexible, secure, coordinated resource- sharing.
The Minority-Serving Institutions (MSI) Cyberinfrastructure (CI) Institute [MSI C(I) 2 ] Providing a scalable mechanism for developing a CI-enabled science.
The Biomedical Informatics Research Network Carl Kesselman BIRN Principal Investigator Professor of Industrial and Systems Engineering Information Sciences.
Pegasus: Mapping complex applications onto the Grid Ewa Deelman Center for Grid Technologies USC Information Sciences Institute.
ICCS WSES BOF Discussion. Possible Topics Scientific workflows and Grid infrastructure Utilization of computing resources in scientific workflows; Virtual.
The Astronomy challenge: How can workflow preservation help? Susana Sánchez, Jose Enrique Ruíz, Lourdes Verdes-Montenegro, Julian Garrido, Juan de Dios.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Applications and Requirements for Scientific Workflow Introduction May NSF Geoffrey Fox Indiana University.
The Minority-Serving Institutions (MSI) Cyberinfrastructure (CI) Institute [MSI C(I) 2 ] Providing a scalable mechanism for developing a CI-enabled science.
Scientific Workflow systems: Summary and Opportunities for SEEK and e-Science.
Planning Ewa Deelman USC Information Sciences Institute GriPhyN NSF Project Review January 2003 Chicago.
Pegasus: Planning for Execution in Grids Ewa Deelman, Carl Kesselman, Gaurang Mehta, Gurmeet Singh, Karan Vahi Information Sciences Institute University.
Applications and Requirements for Scientific Workflow May NSF Geoffrey Fox Indiana University.
2005 GRIDS Community Workshop1 Learning From Cyberinfrastructure Initiatives Grid Research Integration Development & Support
U.S. Grid Projects and Involvement in EGEE Ian Foster Argonne National Laboratory University of Chicago EGEE-LHC Town Meeting,
Applications and Requirements for Scientific Workflow May NSF Geoffrey Fox Indiana University.
2nd Texas A&M Big Data Workshop Development of “Big Data” Scientific Workflow Management Tools for the Materials Genome Initiative: “Materials Galaxy”
Management & Coordination Paul Avery, Rick Cavanaugh University of Florida Ian Foster, Mike Wilde University of Chicago, Argonne
National Science Foundation Blue Ribbon Panel on Cyberinfrastructure Summary for the OSCER Symposium 13 September 2002.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
NA-MIC National Alliance for Medical Image Computing UCSD / BIRN Coordinating Center NAMIC Group Site PI: Mark H. Ellisman Site Project.
© 2006 Open Grid Forum Geoffrey Fox OGF Workshop eScience 2006 Royal Tropical Institute Amsterdam December OGF eScience Function.
Workflow-Driven Science using Kepler Ilkay Altintas, PhD San Diego Supercomputer Center, UCSD words.sdsc.edu.
1 Artemis: Integrating Scientific Data on the Grid Rattapoom Tuchinda Snehal Thakkar Yolanda Gil Ewa Deelman.
1 USC Information Sciences InstituteYolanda Gil AAAI-08 Tutorial July 13, 2008 Part IV Workflow Mapping and Execution in Pegasus (Thanks.
All Hands Meeting 2005 BIRN-CC: Building, Maintaining and Maturing a National Information Infrastructure to Enable and Advance Biomedical Research.
1 Open Science Grid: Project Statement & Vision Transform compute and data intensive science through a cross- domain self-managed national distributed.
Human Social Dynamics: Interoperability Strategies for Scientific Cyberinfrastructure: The Comparative Interoperability Project ( ) initiates a.
RDA US Science workshop Arlington VA, Aug 2014 Cees de Laat with many slides from Ed Seidel/Rob Pennington.
Cloudy Skies: Astronomy and Utility Computing
What is the National Data Service?
Collaborations and Interactions with other Projects
Pegasus and Condor Gaurang Mehta, Ewa Deelman, Carl Kesselman, Karan Vahi Center For Grid Technologies USC/ISI.
1st International Conference on Semantics, Knowledge and Grid
Ewa Deelman University of Southern California
The Minority-Serving Institutions (MSI) Cyberinfrastructure (CI) Institute [MSI C(I)2] Providing a scalable mechanism for developing a CI-enabled science.
Presentation transcript:

Workshop on Workflows in Support of Large-Scale Science June 20, Paris, France In conjunction with HPDC 2006 HPDC Ewa Deelman, Chair

Ewa Deelman, USC Information Sciences Institute Panel on Workflow as the Methodology of Science Ewa Deelman, USC/ISI Summary of NSF Workshop Shantenu Jha, University College of London Application perspective David De Roure, University of Southampton Provenance Ian Foster, ANL & Univ. of Chicago Lessons from current Science Grids John Ibbotson, IBM Hursley Web service and business workflow

Ewa Deelman, USC Information Sciences Institute NSF Workshop on the Challenges of Scientific Workflows, 5/2006 Yolanda Gil and Ewa Deelman (co-chairs) Science is undergoing a significant paradigm change  Entire communities are collaborating and pursuing joint goals (NEES, SCEC, NVO, GriPhyN, BIRN)  Instruments, hardware, software, and other resources shared (TeraGrid, OSG, NMI)  Data shared and processed at large scales In these shared environments, workflows are emerging as a useful paradigm to:  Represent complex analyses  Manage computation  Capture data provenance Scientific collaborations using workflow paradigm for analysis (SCEC, NVO, LIGO, SEEK, myGrid, others)

Ewa Deelman, USC Information Sciences Institute Goals: Explore workflow challenges from a variety of perspectives Mark Ackerman, University of Michigan Guy Almes, NSF OCI Ilkay Altintas, SDSC Roger Barga, Microsoft Francisco Curbera, IBM Ewa Deelman, USC Information Sciences Institute Mark Ellisman, UC San Diego Constantinos Evangelinos, MIT Thomas Fahringer, University of Innsbruck Juliana Freire, University of Utah Ian Foster, University of Chicago & ANL Geoffrey Fox, Indiana University Dennis Gannon, Indiana University Yolanda Gil, USC Information Sciences Institute Carole Goble, University of Manchester Alexander Gray, Georgia Tech Jeffrey Grethe, UC San Diego Jim Hendler, University of Maryland Carl Kesselman, USC Information Sciences Institute Craig Knoblock, USC Information Sciences Institute Chuck Koelbel, Rice University Miron Livny, University of Wisconsin Luc Moreau, University of Southampton Jim Myers, NCSA Karen Myers, SRI International Walt Scacchi, University of California Irvine Ed Seidel, LSU Ashish Sharma, Ohio State University Amit Sheth, University of Georgia Alex Szalay, John Hopkins University Physics, Astronomy Gregor Von Laszewski, ANL Maria Zemankova, NSF IIS (PM for this workshop)

Ewa Deelman, USC Information Sciences Institute Four Topics of Discussion Applications and requirements  What are the requirements of future applications? What new capabilities are needed to support emerging applications? Dynamic workflows and user steering  What are the challenges in supporting dynamic workflows that need to evolve over time as execution data become available? What kinds of techniques can support incremental and dynamic workflow evolution due to user steering? System-level management  What are the challenges in supporting large-scale workflows in a scalable and robust way? What changes are needed in existing software infrastructure? What new research needs to be done to develop better workflow management systems? Data and workflow descriptions  How can workflow descriptions be improved to support usability and scalability? How to describe data produced as part of the workflows? What provenance information needs to be tracked to support scalable data and workflow discovery?

Ewa Deelman, USC Information Sciences Institute Summary of the Workshop Workflows are recipes for Cyberinfrastructure Science has exploratory and evolutionary nature  Need to support this dynamic nature  Workflows are key enabling technologies  Dynamic evolving user-steered workflows are the norm in science Reproducibility, an important requirement in scientific method  Scientific and engineering reproducibility  Increasingly challenging to capture Provenance is key to reproducibility  Provenance and process description are an end-to-end process  At all levels of abstraction -- science domain level and system level Stable and common software platforms  Infrastructure will evolve, stability is crucial for reproducibility

Ewa Deelman, USC Information Sciences Institute Workshop Recommendations Embed CS experts in the application domains Incorporate social sciences, organizational sciences and business workflow communities  Analyze and capture of scientific process Incorporate HCI research: how to present execution, options, recommendations and trails of what happened Longer term, stable (5+ year) collaborations + programs  Based on experience in current CS/Science collaboratories Capture best practices – provide coordination framework Need a framework where various workflow tools can interoperate  More work needs to be done on Workflow System: construction, planning, execution, debugging Need explicit representations for workflows ( at different levels of abstraction) Foster follow-up cross-cutting workshops and meetings Just as scientists will be in a crisis if the data they collect is lost, they will be in a crisis if the analysis---the workflows will be lost