Promoting and Standardizing Grid Computing OGSA - A View From The Trenches Andrew Grimshaw GGF Architecture Area Co-Director January, 2005.

Slides:



Advertisements
Similar presentations
Genesis II Open Source, OGSA Implementation Genesis II: Mapping Grids into the Local File System: Access, RNS, and ByteIO Andrew Grimshaw Genesis II Team.
Advertisements

1 Use Cases Application provisioning (version control) Workload management/load-balancing (server consolidation) Data Federation/sharing E-utilities (provisioning.
© 2007 Open Grid Forum Data Management Challenge - The View from OGF OGF22 – February 28, 2008 Cambridge, MA, USA Erwin Laure David E. Martin Data Area.
Fujitsu Laboratories of Europe © 2004 What is a (Grid) Resource? Dr. David Snelling Fujitsu Laboratories of Europe W3C TAG - Edinburgh September 20, 2005.
Ch:8 Design Concepts S.W Design should have following quality attribute: Functionality Usability Reliability Performance Supportability (extensibility,
Agreement-based Distributed Resource Management Alain Andrieux Karl Czajkowski.
Internet Technologies (Grid Computing (OGSA, WSRF) )
Building a SOA roadmap for your enterprise Presented by Sanjeev Batta Architect, Cayzen Technologies.
1 On Death, Taxes, & the Convergence of Peer-to-Peer & Grid Computing Adriana Iamnitchi Duke University “Our Constitution is in actual operation; everything.
6th Biennial Ptolemy Miniconference Berkeley, CA May 12, 2005 Distributed Computing in Kepler Ilkay Altintas Lead, Scientific Workflow Automation Technologies.
OGF Standards Activity Steven Newhouse. © myOGF Part of the Standards Council Area Directror for Applications Working Groups Co-chair: Basic Execution.
Components and Architecture CS 543 – Data Warehousing.
ATSN 2009 Towards an Extensible Agent-based Middleware for Sensor Networks and RFID Systems Dirk Bade University of Hamburg, Germany.
Lecture Nine Database Planning, Design, and Administration
Architecture of Grid File System (GFS) - Based on the outline draft - Arun swaran Jagatheesan San Diego Supercomputer Center Global Grid Forum 11 Honolulu,
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
1 Autonomic Computing An Introduction Guenter Kickinger.
Possible Architectural Principles for OGSA-UK and other Grids UK e-Science Core Programme Town Meeting London Monday 31st January 2005 “Defining the next.
An Introduction to Software Architecture
An XMPP (Extensible Message and Presence Protocol) based implementation for NHIN Direct 1.
Grid Andrew Grimshaw September, What is a Grid System? A Grid system is a collection of distributed resources connected by a network. Examples of.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting October 10-11, 2002.
© 2008 Open Grid Forum Independent Software Vendor (ISV) Remote Computing Primer Steven Newhouse.
A Summary of the Distributed System Concepts and Architectures Gayathri V.R. Kunapuli
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Distribution and components. 2 What is the problem? Enterprise computing is Large scale & complex: It supports large scale and complex organisations Spanning.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
EGA Discussion November 2004 Promoting and Standardizing Grid Computing Complexity Matters Andrew Grimshaw Virginia OGSA Architecture Area Director ICSOC.
Legion - A Grid OS. Object Model Everything is object Core objects - processing resource– host object - stable storage - vault object - definition of.
Enabling the Future Service-Oriented Internet (EFSOI 2008) Supporting end-to-end resource virtualization for Web 2.0 applications using Service Oriented.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Andrew McNabSecurity Middleware, GridPP8, 23 Sept 2003Slide 1 Security Middleware Andrew McNab High Energy Physics University of Manchester.
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
Distributed Accounting Working Group (DAWG) Distributed Accounting Models Research Group Monday, 22 July 2002 Tuesday, 23 July 2002 Edinburgh, Scotland.
7. Grid Computing Systems and Resource Management
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
© 2004 IBM Corporation ICSOC2004 Panel Discussion: Grid Systems: What is needed from web service standards? Jeffrey Frey IBM.
Tanenbaum & Van Steen, Distributed Systems: Principles and Paradigms, 2e, (c) 2007 Prentice-Hall, Inc. All rights reserved DISTRIBUTED SYSTEMS.
Jemerson Pedernal IT 2.1 FUNDAMENTALS OF DATABASE APPLICATIONS by PEDERNAL, JEMERSON G. [BS-Computer Science] Palawan State University Computer Network.
UML - Development Process 1 Software Development Process Using UML.
OOD OO Design. OOD-2 OO Development Requirements Use case analysis OO Analysis –Models from the domain and application OO Design –Mapping of model.
SharePoint Governance And the role of the Site Owner.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
Resource Selection Services for a Single Job Execution Soonwook Hwang National Institute of Informatics/NAREGI OGSA F2F RSS Session Sunnyvale, CA, US Aug.
Promoting and Standardizing Grid Computing OGSA - Service Oriented Architecture for Grids Ravi Subramaniam OGSA-WG November, 2005.
Building PetaScale Applications and Tools on the TeraGrid Workshop December 11-12, 2007 Scott Lathrop and Sergiu Sanielevici.
Leading the pervasive adoption of grid computing for research and industry © 2005 Global Grid Forum The information contained herein is subject to change.
Leading the pervasive adoption of grid computing for research and industry © 2005 Global Grid Forum The information contained herein is subject to change.
OGSA-WG Session #2 Program Execution Services GGF10 Berlin March. 10, :30-5pm Senate Hall.
Process 4 Hours.
Bob Jones EGEE Technical Director
OGSA HPC cluster usecase for reference Model v.02
Resource Management in OGSA
OGSA Session #1 Execution Management Services
JRA3 Introduction Åke Edlund EGEE Security Head
OGSA Data Architecture WG Data Transfer Discussion
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
OGSA Data Architecture Scenarios
Leigh Grundhoefer Indiana University
OGSA and Security Services GGF12 , September 20th, 2004 Hiro Kishimoto
Web Services Interoperability Organization
An Introduction to Software Architecture
OGF19 – Chapel Hill, NC, USA 30 January 2007
Design Yaodong Bi.
Grid Systems: What do we need from web service standards?
Software Development Process Using UML Recap
From Use Cases to Implementation
Presentation transcript:

Promoting and Standardizing Grid Computing OGSA - A View From The Trenches Andrew Grimshaw GGF Architecture Area Co-Director January, 2005

2 Agenda Background – quick OGSA objectives and process OGSA design teams Opportunities for collaboration

3 What is an architecture? In the computer systems world an architecture is the definition of the components, their interactions, and the design philosophy used in the development of the whole system. In a grid, high-performance secure, shared, collaboration distributed system, the architecture will define the services, their interactions, and the design philosophy. In other words, what are the pieces of the puzzle, how do they fit together, and what does the puzzle look like when complete. One of our design philosophies is that the pieces can be replaced, extended, and tailored to particular use cases. Further, a systems architecture is the architecture on which applications, application services, and specialized views or profiles of the architecture are built. OGSA is a grid system architecture.

4 Architecture Requirements Simple Secure Standards-based Multiple interoperable implementations Scalable Extensible Site Autonomy Persistence & I/O Multi-Language Legacy Support Transparency Heterogeneity Fault-tolerance & Exception Management Success Requires an integrated model at the foundation. Manage Complexity!!

5 The Importance of a Strong Foundation

6 OGSA Aims and Perspective Goals −Interoperable solutions to Grid based applications Grid definitions sidebar −Addressing loosely coupled distributed computing Philosophy −Standardization at the Architectural level Similar to profiling. Developed before and/or during standards development −Use existing standards and technology where possible −Use case driven gap analysis −Gaps are filled proactively Not exclusively within the GGF (e.g. naming).

7 OGSA Process Use Case Driven −21 Detailed Use Cases (~ 6 pages each) Tier 1 Available at: Distributed Specification and Standardization −Identify and/or develop open and accessible standard specifications −Active current work in GGF, OASIS, W3C, and DMTF. “Design Team” Working Model −Facilitate cross fertilization within and outside GGF. −Avoid redundant work applicable efforts −Focus mind share (the most valuable commodity) e.g. DAIS-WG and OGSA-Data Design Team Iterative Refinement −Abstract service evolving to concrete specifications Documents: −OGSA: Use Cases, Informal Specification, Recommendation

8 OGSA –What is it? Two streams −Profiles −Design Teams  Working Groups Process for design team, working group, profile development interaction −Draw circle

9 Profiles Define a usage pattern and include specifications developed by working groups both within and external to GGF. Issue: How mature and “widely adopted”? Three “in the pipe” −Basic −Data −Execution Management

10 Design Teams Naming – the foundation on which distributed systems are built Security – deeply dependent on WS-Security Data of all types Execution Management Services – EMS Logging – spit off into a working group

11 “A Rose by any other name would smell as sweet” Terms Resource Abstract resource name Human name (paths and attributes) Resource address Resource identity Binding scheme Bind time

12 Why names? Transparencies −Location −Migration −Failure −Replication −Scalability −and so on

13 Distributed naming is a well-understood area - properties Unique Provide identity Comparable Location portable Widely adopted Scalable – high performance Extensible Dynamic binding …. Two and three level name schemes dominate

14 Two level schemes Human name -> address −E.g., DNS, Unix file system (string->inode) abstract name -> address

15 Three level schemes Human -> abstract -> address In OGSA, −Human -> address and Human -> abstract will likely be handled by RNS – Resource Naming Service being developed by the GGF GFS-WG

16 OGSA Security Process is not moving rapidly −Partially because they are waiting on WS Security Maybe too focused on one set of use cases (big government labs working together) (my opinion)

17 OGSA Data & InfoD Use case driven Many different data “types” and use scenario’s from HEP to business intelligence Strong consensus emerging with some issues still around meta-data and information dissemination Strawman services defined for flat files, interacting with GFS. Pushing for early spec’s. Interacting with existing GGF WG’s including GFS, GSM, DIAS, Info-D Interacting begun with WSDM

18 Info Services Troubleshooting Event Management Discovery Logging – spun off

19 EMS Overview Basic problem: provision, execute and manage services (including legacy applications) in a grid −Some use cases start up a cache service on-demand, utility computing start up and manage a set of legacy applications Want to be able to “instantiate” a service and have the grid figure it out, and provide management interfaces throughout the lifetime of the service

20 EMS addresses issues such as: Where can a service execute? What are the locations it can execute because of resource restrictions such as memory, CPU and binary type, available libraries, and available licenses? Given the above, what policy restrictions are in place that may further limit the candidate set of execution locations? Where should the service execute? Once it is known where the service can execute, the question is where should it execute? This may involve different selection algorithms that optimize different objective functions or are trying to enforce different policies or service level agreements. Prepare the service to execute. Just because a service can execute somewhere does not necessarily mean it can execute there without some setup. Setup could include deployment and configuration of binaries, libraries, staging data, or other operations to prepare the local execution environment to execute the service. Get the service executing. Once everything is ready, actually start the service and register it in the appropriate places. Manage (monitor, restart, move, etc.). Once the service is started in must be managed and monitored. What if it fails? Or fails to meet its service agreements. Should it be restarted in another location? What about state? Should the state be “checkpointed” periodically to ensure restartability? Is the service participating in some sort of fault-detection and recovery scheme?

21 EMS Services fall into three sets Resources that model processing, storage, executables, resource management, and provisioning Job management and monitoring services; and Resource selection services that collectively decide where to execute a service.

22 Typical Pattern Provisioning Deployment Configuration Information Services Service Container Persistent State Handle Service Accounting Services Execution Planning Services Candidate Set Generator (Work - Resource mapping) Job Manager Reservation

23 Opportunities For Collaboration OMII and EGEE efforts intersect with OGSA design team efforts We all win if we can come to consensus EMS −The basic problem that everyone (Globus, SGE, LSF, Legion, EGEE, OMII) solves is the same. −Solutions have many similarities −EMS team spent quite a bit of time hammering those out −We’re here to make sure that OMII input is part of design Similarly for data