An Evolutionary Approach to Realizing the Grid Vision

Slides:



Advertisements
Similar presentations
Service Oriented Architecture Reference Model
Advertisements

Fujitsu Laboratories of Europe © 2004 What is a (Grid) Resource? Dr. David Snelling Fujitsu Laboratories of Europe W3C TAG - Edinburgh September 20, 2005.
Chapter 19 – Service-oriented Architecture
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 12 Slide 1 Distributed Systems Design 2.
This product includes material developed by the Globus Project ( Introduction to Grid Services and GT3.
Latest techniques and Applications in Interprocess Communication and Coordination Xiaoou Zhang.
Comparison of the RMI and the socket APIs
NextGRID & OGSA Data Architectures: Example Scenarios Stephen Davey, NeSC, UK ISSGC06 Summer School, Ischia, Italy 12 th July 2006.
Ch 12 Distributed Systems Architectures
The Open Grid Service Architecture (OGSA) Standard for Grid Computing Prepared by: Haoliang Robin Yu.
Secure Systems Research Group - FAU Web Services Standards Presented by Keiko Hashizume.
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
Web Services Description Language (WSDL) Jason Glenn CDA 5937 Process Coordination in Service and Computational Grids September 30, 2002.
Architecting Web Services Unit – II – PART - III.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
© 2008 Open Grid Forum Independent Software Vendor (ISV) Remote Computing Primer Steven Newhouse.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Distribution and components. 2 What is the problem? Enterprise computing is Large scale & complex: It supports large scale and complex organisations Spanning.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
Kemal Baykal Rasim Ismayilov
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
© 2006 Open Grid Forum BES 1.1 Andrew Grimshaw. © 2006 Open Grid Forum 2 OGF IPR Policies Apply “ I acknowledge that participation in this meeting is.
Franco Travostino and Admela Jukan jukan at uiuc.edu June 30, 2005 GGF 14, Chicago Grid Network Services Architecture (GNSA) draft-ggf-ghpn-netserv-2.
Software Architecture Patterns (3) Service Oriented & Web Oriented Architecture source: microsoft.
A service Oriented Architecture & Web Service Technology.
12. DISTRIBUTED WEB-BASED SYSTEMS Nov SUSMITHA KOTA KRANTHI KOYA LIANG YI.
Service Oriented Architecture (SOA) Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
EGI towards H2020 Feedback (from survey)
WS-Agreement Overview for OGSA
SuperComputing 2003 “The Great Academia / Industry Grid Debate” ?
OGSA Session #1 Execution Management Services
OGSA Data Architecture WG Data Transfer Discussion
OGSA Evolving Jeff Nick IBM Fellow, VP On Demand Architecture.
Architecting Web Services
WEB SERVICES.
Towards GLUE Schema 2.0 Sergio Andreozzi INFN-CNAF Bologna, Italy
The Open Grid Service Architecture (OGSA) Standard for Grid Computing
As the last CC-list represents Maximum Compatible Classes we conclude:
Sessions 1 & 3: Published Document Session Summary
Grid Scheduling Architecture – Research Group
Hiro Kishimoto, OGSA-WG co-chair GGF16 in Athens February 13, 2006
Architecting Web Services
Unicore and the EM Profile
Grid Resource Allocation Agreement Protocol Working Group
Distribution and components
Enterprise Computing Collaboration System Example
Distributed web based systems
OGSA Data Architecture Scenarios
HPC Profile BOF Marvin Theimer Marty Humphrey Microsoft Corporation
Grid Computing B.Ramamurthy 9/22/2018 B.Ramamurthy.
Status and Future Steps
Introduction to Web Services and SOA
Software Connectors – A Taxonomy Approach
Service-centric Software Engineering
Service-centric Software Engineering 1
Grid Computing Done by: Shamsa Amur Al-Matani.
Design Model Like a Pyramid Component Level Design i n t e r f a c d s
What is OGSA? GGF17 OGSA and Alternative Grid Architectures Panel
Distributed Systems through Web Services
Large Scale Distributed Computing
Outline Chapter 2 (cont) OS Design OS structure
Experiences in Deploying Services within the Axis Container
The Anatomy and The Physiology of the Grid
The Anatomy and The Physiology of the Grid
Introduction to OGF Standards
Introduction to Web Services and SOA
Grid Systems: What do we need from web service standards?
Distributed Systems Architectures
GIN & the Standards Activity
Presentation transcript:

An Evolutionary Approach to Realizing the Grid Vision Marvin Theimer, Savas Parastatidis, Tony Hey Microsoft Corporation Marty Humphrey University of Virginia Geoffrey Fox Indiana University

OGSA and related working groups are already doing many things Broad architectural exploration Identification and work on horizontal areas Specific specs, such as BES and JSDL

Propose that they also start doing “vertical profiles” What’s the value of these? Precise descriptions of how to interoperably implement specific use cases “Close the design loop”: identify issues and holes in existing activities Suggest they should be done for all (major) use cases This talk covers a specific instance: Basic HPC use case: Remotely executing programs on compute resources managed by a batch job scheduling service.

Approach Taken Start from “first principles” Most likely to flush out issues Not intended to ignore or denigrate existing work Mapping onto existing work represents a follow-up step Evolutionary, incremental approach Start with a minimalist design Enables broadest level of interop among existing systems Lowest level of entry cost Incrementally add functionality in an evolutionary manner based on extensible, composable services Enables specific system instances to only implement/export the functionality they need Support for evolution is required in any case Avoid “silo” effect by Publishing early and often Public debates and cross-fertilization in the context of the overall OGSA architecture and related activities Emphasis on composability

Key design principles Composable, service-oriented approach: Design infrastructure to enable selective use of composable, extensible services Narrowly scoped interaction protocols: Easier to define, implement, combine, and evolve Align with existing infrastructure and industry initiatives: Don't reinvent the wheel and don’t get run over Avoid controversial solutions when possible Acknowledge and exploit boundaries: Don’t need global standards (at least initially) for things that can be bounded to local scopes

Basic grid architecture for HPC Use case: batch job scheduling Simplest base case: Job scheduling service for compute resources within an organization Specify what kinds of resources are needed; execute a program on resources obtained Incremental extensions: Provisioning (i.e. program installation & cleanup) Suspension Migration … Needed functionality: “Standard” distributed system infrastructure that is not Grid-specific Distributed communications (WS-I) Security Systems management Job scheduling support: i.e. a standard way of interacting with fungible resources Resource reservation Provisioning Execution of programs Data transfer support May be large in volume A means to discover available services

Leveraging existing standards Lots of non-Grid-specific things are already being pursued by the wider IT industry Distributed communications Security … Should leverage these industry initiatives Don’t reinvent the wheel Don’t get run over An example of something not needed for the basic HPC use case: Identity/naming support Basic DNS-based names have been very successful and should be used Higher-level naming is still an area in which there is not a single universally accepted solution Avoid the controversy by deferring the issue Expect it to be added as the HPC design evolves and expands

Job scheduling Principle of factorization: split resource reservation, provisioning, and execution into separate composable pieces. Examples of benefits gained: QoS and SLAs via repeated use of reserved resources Support for a variety of different provisioning scenarios: pre-installed, “file-copy”, “installer/uninstaller, … Amortization of provisioning overheads Support for interactive and debugging scenarios Interop among existing schedulers: Can’t expect existing schedulers to implement the superset of all their functionalities Principle of minimal, evolutionary design: Identify common functional denominators Define simplest possible “universal” scheduling base “Extension” profiles for well-defined, composable functionalities that clients and schedulers selectively interoperate on Two examples: “migration” and “suspension” of programs Must be careful to make framework plus extensions properly composable

Data transfer Base case requires the following things: Means of transferring data between two service endpoints Means of specifying how the transferred data should be interpreted Data format (e.g. files & directories vs. row-sets and tables) Where in the receiving party’s storage name space the data should be placed and how programs can access it These should be separate, composable capabilities Example of minimal, evolutionary design principle: no need for a global file system for the base case Not strictly necessary Has significant additional complexity; hence would reduce scope for interop Could eventually become a part of an evolved, extensible HPC design

Discovering HPC services Not strictly necessary: plenty of existing job scheduling systems that rely on clients simply “knowing” about them Well-known DNS name(s) Static config files Simple directory services enable significantly more powerful discovery paradigm: rendezvous point for dynamic discovery Examples of existing infrastructures already in use: UDDI and LDAP Proposed approach: Minimal base case should NOT include any discovery services Incremental addition: add support for one – or more – of the existing directory service infrastructures Reduced cost of entry Increased scope for interop: people are already using (and have invested in) these

System management Fundamental part of any distributed system, including Grid systems Simplest HPC base case: Job scheduling may occur between organizations System management restricted to scope of resources managed by a job scheduling service  Opaque to clients Don’t need a system management standard Support for virtual organizations will require global system management standards Virtual organizations should be part of an extended HPC use case Avoids controversy by postponing system management questions until the wider industry settles on standards for this area Enables wider scope for interop

Using Web Services Protocols This is the way the industry is going; therefore Grid must go the same way Not all core Web services specs have settled down Still in the process of becoming standardized Competing sets Non-controversial specs: WS-I WS-Addressing … Controversial specs: WSRF vs WS-Transfer et al. WS-Notification vs WS-Evening WSDM vs WS-Management HPC design focus should identify the “abstract” operations needed; e.g. “QueryJob” Defer the issue of how to map such operations to competing specifications “WS-I profile” Could also choose to do two separate “sub-profiles” that map to the relevant competing protocol specs

Conclusions Propose the definition of “vertical profiles” that map protocols and services onto use cases Advocate an evolutionary approach Start with a minimal design Incrementally expand via narrowly-scoped, extensible, composable service interaction protocols Advocate using only well-established Web services protocols and deferring use of controversial ones until the wider industry sorts them out Have illustrated our design approach with the specific instance of a basic HPC use case: Use case: batch job scheduling Outlined design: Relies on existing industry standards, infrastructure, and initiatives Introduces factored job scheduling and data transfer protocols/services Bounds and defers the topic of system management