Ian Foster Argonne National Laboratory University of Chicago Univa Corporation Service-Oriented Science Scaling Science Services APAC Conference, September.

Slides:



Advertisements
Similar presentations
Abstraction Layers Why do we need them? –Protection against change Where in the hourglass do we put them? –Computer Scientist perspective Expose low-level.
Advertisements

The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
ASCR Data Science Centers Infrastructure Demonstration S. Canon, N. Desai, M. Ernst, K. Kleese-Van Dam, G. Shipman, B. Tierney.
High Performance Computing Course Notes Grid Computing.
This product includes material developed by the Globus Project ( Introduction to Grid Services and GT3.
Data Grids Darshan R. Kapadia Gregor von Laszewski
Ian Foster Computation Institute Argonne National Lab & University of Chicago Service-Oriented Science: Scaling eScience Impact.
Service Oriented Grid Architecture Hui Li ICT in Business Colloquium, LIACS Mar 1 st, 2006 Note: Part of this presentation is based on Dr. Ian Foster’s.
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
Seminar Grid Computing ‘05 Hui Li Sep 19, Overview Brief Introduction Presentations Projects Remarks.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Education in the Science 2.0 Era.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Cyberinfrastructure and the Role of Grid Computing Or, “Science 2.0”
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
The Global Storage Grid Or, Managing Data for “Science 2.0” Ian Foster Computation Institute Argonne National Lab & University of Chicago.
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
Seminar Grid Computing ‘06 Hui Li Sep 18, Overview Brief Introduction Presentations –Architecture –Functionality/Middleware –Applications Projects.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
4b.1 Grid Computing Software Components of Globus 4.0 ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4b.
GT4 Introductory and Advanced Practicals Rachana Ananthakrishnan, Charles Bacon, Lisa Childers Argonne National Laboratory University of Chicago.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Service-Oriented Science: Scaling eScience Impact Or, “Science 2.0”
Globus Computing Infrustructure Software Globus Toolkit 11-2.
Designing and Building Grid Services GGF9 Chicago October 8, 2003 Organizers: Ian Foster, Marty Humphrey, Kate Keahey, Norman Paton, David Snelling.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
1 Globus Developments Malcolm Atkinson for OMII SC 18 th January 2005.
Globus 4 Guy Warner NeSC Training.
Kate Keahey Argonne National Laboratory University of Chicago Globus Toolkit® 4: from common Grid protocols to virtualization.
1 The Application-Infrastructure Gap Dynamic and/or Distributed Applications A 1 B Shared Distributed Infrastructure.
Core Grid Functions: A Minimal Architecture for Grids William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (www-itg.lbl.gov/~wej)
Virtual Infrastructure in the Grid Kate Keahey Argonne National Laboratory.
TeraGrid Information Services John-Paul “JP” Navarro TeraGrid Grid Infrastructure Group “GIG” Area Co-Director for Software Integration and Information.
Ian Foster Argonne National Laboratory University of Chicago Univa Corporation Grid Dynamics.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
OPEN GRID SERVICES ARCHITECTURE AND GLOBUS TOOLKIT 4
GRAM: Software Provider Forum Stuart Martin Computational Institute, University of Chicago & Argonne National Lab TeraGrid 2007 Madison, WI.
DISTRIBUTED COMPUTING
CoG Kit Overview Gregor von Laszewski Keith Jackson.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Globus and Service Oriented Architecture.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
Grid Services Overview & Introduction Ian Foster Argonne National Laboratory University of Chicago Univa Corporation OOSTech, Baltimore, October 26, 2005.
Service Oriented Science Ian Foster Argonne National Laboratory University of Chicago Univa Corporation.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
The Anatomy of the Grid Mahdi Hamzeh Fall 2005 Class Presentation for the Parallel Processing Course. All figures and data are copyrights of their respective.
Major Grid Computing Initatives Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer Science The.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
OGSA Hauptseminar: Data Grid Thema 2: Open Grid Service Architecture
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Perspectives on Grid Technology Ian Foster Argonne National Laboratory The University of Chicago.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Authors: Ronnie Julio Cole David
Brain Meets Brawn: Why Grid and Agents Need Each Other Ian Foster Argonne National Laboratory University of Chicago Globus Alliance In collaboration with.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Cyberinfrastructure and the Role of Grid Computing Or, “Science 2.0”
LEGS: A WSRF Service to Estimate Latency between Arbitrary Hosts on the Internet R.Vijayprasanth 1, R. Kavithaa 2,3 and Raj Kettimuthu 2,3 1 Coimbatore.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Cole David Ronnie Julio. Introduction Globus is A community of users and developers who collaborate on the use and development of open source software,
7. Grid Computing Systems and Resource Management
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Grid Enabling Open Science.
Ian Foster Computation Institute Argonne National Lab & University of Chicago Application Hosting Services — Enabling Science 2.0 —
Parallel Computing Globus Toolkit – Grid Ayaka Ohira.
Grid Services for Digital Archive Tao-Sheng Chen Academia Sinica Computing Centre
Globus —— Toolkits for Grid Computing
Argonne National Laboratory
Grid Services B.Ramamurthy 12/28/2018 B.Ramamurthy.
Large Scale Distributed Computing
Resource and Service Management on the Grid
Presentation transcript:

Ian Foster Argonne National Laboratory University of Chicago Univa Corporation Service-Oriented Science Scaling Science Services APAC Conference, September 28, 2005 iGrid Workshop, September 27, 2005

2 Two Questions l How do we scale the number of scientists benefiting from computational techniques? l What should be the role of infrastructure providers in enabling this scaling?

3 Computational Science  Increasingly sophisticated computational approaches DMonolithic programs, databases u Inflexible & hard to evolve u Mismatch with reality of diverse & distributed teams, resources, & approaches Computation joins theory & experiment as a third mode of scientific enquiry Genbank Program & data PC or Supercomputer

4 Decompose over the Network l Clients can then integrate dynamically u Select & compose services u Select “best of breed” providers u Publish result as a new service l Need not know implementation details l Note: complements, not replaces, HPC Service-Oriented Architecture

5 For Example: Virtual Observatories Surveys Observatories Missions Survey and Mission Archives Digital libraries Numerical Sim’s Sloan vs. 2MASS Brown dwarf candidates

6 Having Decomposed, Integrate l For example u Registries u Value-added services u Workflows l Issues u Description u Discovery u Composition u Adaptation & evolution u Qualities of service: security, performance, reliability, … Data Archives Analysis tools Discovery tools Users

7 Example Value Added Service: PUMA PUMA Knowledge Base Information about proteins analyzed against ~2 million gene sequences Analysis on Grid Involves millions of BLAST, BLOCKS, and other processes Natalia Maltsev et al.

8 SOA= Silo-Oriented Architecture? l What about dynamic behaviors? u Time-varying load u Dynamically instantiated services l What about operating costs? u Software deployment & maintenance u Security & other concerns Week Operating Services or?

9 We Need to Decompose in Two Dimensions Horizontal

10 We Need to Decompose in Two Dimensions Horizontal Vertical

11 IPC Dispatcher Globus Provision New Worker Process IPC Server 2 Decomposition Enables On-Demand Provisioning l Aggregate resources l Deliver to services l Separate production & consumption l Issues u Discovery u Composition u Qualities of service SAP GlobusWorld Demo IPC = Internet Pricing Configurator

12  Cardiff AEI/Golm Birmingham The Globus-Based LIGO Data Grid Replicating >1 Terabyte/day to 8 sites >30 million replicas so far MTBF = 1 month LIGO Gravitational Wave Observatory

13 Decomposition Enables Separation of Concerns & Roles User Service Provider “Provide access to data D at S1, S2, S3 with performance P” Resource Provider “Provide storage with performance P1, network with P2, …” D S1 S2 S3 D S1 S2 S3 Replica catalog, User-level multicast, … D S1 S2 S3

14 Scaling Up “Sometimes through heroism you can make something work. However, understanding why it worked, abstracting it, making it a primitive is the key to getting to the next order of magnitude of scale.” Robert Calderbank  We want to scale the number, robustness, & performance of services

15 Identifying Primitives: (1) Taking Services Seriously l Model the world as a collection of services u Computations, computers, instruments, storage, data, communities, agreements, … l Focus on what these things have in common l E.g., lifecycle management u Negotiation, deployment/creation, modeling, monitoring, management, termination l E.g., security u Authentication, authorization, audit, …  Web Services-based Grid infrastructure I. Foster, S. Tuecke, Describing the Elephant: The Many Faces of IT as Service, ACM Queue, 2005

16 Identifying Primitives: (2) Interface Specifications Web services (WSDL, SOAP, WS-Security, WS-ReliableMessaging, …) WS-Resource Framework & WS-Notification* (Resource identity, lifetime, inspection, subscription, …) WS-Agreement (Agreement negotiation) WS Distributed Management (Lifecycle, monitoring, …) Applications of the framework (Compute, network, storage provisioning, job reservation & submission, data management, application service QoS, …) *WS-Transfer, WS-Enumeration, WS-Eventing, WS-Management define similar functions Foster, Czajkowski, Frey, et al., From OGSI to WSRF, Proc. IEEE, 93(3)

17 Identifying Primitives: (3) Open Source Implementation Data Mgmt Security Common Runtime Execution Mgmt Info Services GridFTP Authentication Authorization Reliable File Transfer Data Access & Integration Grid Resource Allocation & Management Index Java Runtime Community Authorization Data Replication Community Scheduling Framework Delegation Replica Location Trigger C Runtime Python Runtime WebMDS Workspace Management Grid Telecontrol Protocol Credential Mgmt I. Foster, Globus Toolkit Version 4: Software for Service-Oriented Systems, LNCS 3779, 2-13, 2005

18 Jobs (2004) Open Science Grid  50 sites (15,000 CPUs) & growing  400 to >1000 concurrent jobs  Many applications + CS experiments; includes long-running production operations  Up since October 2003; few FTEs central ops

19 Virtual OSG Clusters OSG cluster Xen hypervisors TeraGrid cluster OSG

20 Dynamic Service Deployment Community A Community Z … Community scheduling logic Data distribution Community management Science services... Requirements: Community control Persistence Resource guarantees Non- interference

21 Summary l How do we scale the number of scientists benefiting from computational techniques?  Construct powerful science services  Simplify construction by decomposing roles: content, function, resource l What should be the role of infrastructure providers in enabling this scaling?  Service providers for communities wanting to deliver content  Resource providers for service providers wanting to deliver services

22 Domain-independentDomain-dependent Content Function Resources Experimental apparatus Servers, storage, networks Metadata catalog Data archive Simulation server Certificate authority Simulation code Expt design Telepresence monitor Simulation code Expt output Electronic notebook Portal server Service-Oriented Science: Scaling by Separating Concerns Hosted by Enabled by I. Foster, Service-Oriented Science, Science, 308, May 6, 2005

23 Acknowledgments l NSF, DOE, NASA, IBM for financial support l Numerous fine colleagues at Argonne, U.Chicago, USC/ISI, and elsewhere l In particular: Carl Kesselman Steve Tuecke Kate Keahey & Bill Allcock, Ann Chervenak, Ewa Deelman, Jennifer Schopf, Mike Wilde

24 For More Information l Globus Alliance: l Papers: For those at IGrid: Carl Kesselman’s Master Class (Thursday) For those at APAC: Globus Toolkit Tutorial (Thursday, Friday)