Congreso Cuidad, Spain May 15, 2007 GridWay 1/29 gLite Course EGEE’07 MTA SZTAKI, Budapest, Hungary September 30th, 2007 An Overview of the GridWay Metascheduler.

Slides:



Advertisements
Similar presentations
CSF4 Meta-Scheduler Tutorial 1st PRAGMA Institute Zhaohui Ding or
Advertisements

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Experiences with GridWay on CRO NGI infrastructure Emir Imamagic, Srce EGEE User.
1/22 Distributed Systems Architecture Research Group Universidad Complutense de Madrid Constantino Vázquez Eduardo Huedo Scaling DRMAA codes to the Grid:
Distributed Systems Architecture Research Group Universidad Complutense de Madrid EGEE UF4/OGF25 Catania, Italy March 2 nd, 2009 State and Future Plans.
Current status of grids: the need for standards Mike Mineter TOE-NeSC, Edinburgh.
1/16 Distributed Systems Architecture Research Group Universidad Complutense de Madrid An Introduction to Virtualization and Cloud Technologies to Support.
Towards a Virtual European Supercomputing Infrastructure Vision & issues Sanzio Bassini
A Computation Management Agent for Multi-Institutional Grids
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
Resource Management of Grid Computing
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
2 nd GADA Workshop / OTM 2005 Conferences Eduardo Huedo Rubén S. Montero Ignacio M. Llorente Advanced Computing Laboratory Center for.
Universität Dortmund Robotics Research Institute Information Technology Section Grid Metaschedulers An Overview and Up-to-date Solutions Christian.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
Workload Management Massimo Sgaravatto INFN Padova.
Simo Niskala Teemu Pasanen
Massive Ray Tracing in Fusion Plasmas on EGEE J.L. Vázquez-Poletti, E. Huedo, R.S. Montero and I.M. Llorente Distributed Systems Architecture Group Universidad.
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
W w w. h p c - e u r o p a. o r g Single Point of Access to Resources of HPC-Europa Krzysztof Kurowski, Jarek Nabrzyski, Ariel Oleksiak, Dawid Szejnfeld.
Grid Computing 7700 Fall 2005 Lecture 17: Resource Management Gabrielle Allen
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
HPDC 2007 / Grid Infrastructure Monitoring System Based on Nagios Grid Infrastructure Monitoring System Based on Nagios E. Imamagic, D. Dobrenic SRCE HPDC.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
OGF 25/EGEE User Forum Catania, March 2 nd 2009 Meta Scheduling and Advanced Application Support on the Spanish NGI Enol Fernández del Castillo (IFCA-CSIC)
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
GRAM5 - A sustainable, scalable, reliable GRAM service Stuart Martin - UC/ANL.
3-2.1 Topics Grid Computing Meta-schedulers –Condor-G –Gridway Distributed Resource Management Application (DRMAA) © 2010 B. Wilkinson/Clayton Ferner.
Web Services Load Leveler Enabling Autonomic Meta-Scheduling in Grid Environments Objective Enable autonomic meta-scheduling over different organizations.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
Grid Workload Management Massimo Sgaravatto INFN Padova.
BOF: Megajobs Gracie: Grid Resource Virtualization and Customization Infrastructure How to execute hundreds of thousands tasks concurrently on distributed.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks S. Natarajan (CSU) C. Martín (UCM) J.L.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks, An Overview of the GridWay Metascheduler.
Middleware for Grid Computing and the relationship to Middleware at large ECE 1770 : Middleware Systems By: Sepehr (Sep) Seyedi Date: Thurs. January 23,
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
NW-GRID Campus Grids Workshop Liverpool31 Oct 2007 NW-GRID Campus Grids Workshop Liverpool31 Oct 2007 Moving Beyond Campus Grids Steven Young Oxford NGS.
1October 9, 2001 Sun in Scientific & Engineering Computing Grid Computing with Sun Wolfgang Gentzsch Director Grid Computing Cracow Grid Workshop, November.
Congreso Cuidad, Spain May 15, 2007 GridWay 1/26 EGEE’07 Conference Budapest, Hungary October 1st – 5th, 2007 Uniform Grid Access with GridWay GridWay.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks, Novelties and Features around the GridWay.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
7. Grid Computing Systems and Resource Management
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Application Porting Support Group Demonstration at EGEE’08 Conference Istanbul,
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Dr. Isabel Campos Plasencia (IFCA-CSIC) Spanish NGI Coordinator ES-GRID The Spanish National Grid Initiative.
EGI Technical Forum Amsterdam, 16 September 2010 Sylvain Reynaud.
Congreso Cuidad, Spain May 15, 2007 GridWay 1/28, Installation and Basic Configuration GridWay Distributed Systems Architecture Group Universidad Complutense.
CSF4 Meta-Scheduler Zhaohui Ding College of Computer Science & Technology Jilin University.
EGEE-II INFSO-RI Enabling Grids for E-sciencE A glance to the future Mike Mineter TOE-NeSC Application Developers.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
New Paradigms: Clouds, Virtualization and Co.
Workload Management Workpackage
Management of Virtual Machines in Grids Infrastructures
J.L. Vázquez-Poletti (UCM) EGEE08 (Istambul)
Management of Virtual Machines in Grids Infrastructures
Management of Virtual Execution Environments 3 June 2008
Wide Area Workload Management Work Package DATAGRID project
Presentation transcript:

Congreso Cuidad, Spain May 15, 2007 GridWay 1/29 gLite Course EGEE’07 MTA SZTAKI, Budapest, Hungary September 30th, 2007 An Overview of the GridWay Metascheduler GridWay José Luis Vázquez-Poletti Distributed Systems Architecture Group Universidad Complutense de Madrid

2/29 GridWay DSA Group GridWay DSA Group Contents 1.What is GridWay? 2.A Global Vision 3.Scheduling Policies 4.Scheduling Infrastructures 5.The GridWay Project

3/29 GridWay DSA Group GridWay DSA Group What is GridWay? GridWay is a Globus Toolkit component for meta-scheduling, creating a scheduler virtualization layer on top of Globus services (GRAM, MDS & GridFTP) For project and infrastructure directors GridWay is an open-source community project, adhering to Globus philosophy and guidelines for collaborative development. For system integrators GridWay is highly modular, allowing adaptation to different grid infrastructures, and supports several OGF standards. For system managers GridWay gives a scheduling framework similar to that found on local LRM systems, supporting resource accounting and the definition of state-of-the-art scheduling policies. For application developers GridWay implements the OGF standard DRMAA API (C and JAVA bindings), assuring compatibility of applications with LRM systems that implement the standard, such as SGE, Condor, Torque,... For end users GridWay provides a LRM-like CLI for submitting, monitoring, synchronizing and controlling jobs, that could be described using the OGF standard JSDL.

4/29 GridWay DSA Group GridWay DSA Group Contents 1.What is GridWay? 2.A Global Vision 3.Scheduling Policies 4.Scheduling Infrastructures 5.The GridWay Project

5/29 GridWay DSA Group GridWay DSA Group Global Architecture of a Computational Grid Application-Infrastructure decoupling PBS GridWay SGE $> CLI Results.C,.java DRMAA.C,.java Infrastructure Grid Middleware Applications Globus Grid Meta- Scheduler Standard API (OGF DRMAA) Command Line Interface open source job execution management resource brokering Globus services Standard interfaces end-to-end (e.g. TCP/IP) highly dynamic & heterogeneous high fault rate A Global Vision

6/29 GridWay DSA Group GridWay DSA Group Integration of non-interoperable computational platforms (Organization) Establishment of a uniform and flexible infrastructure Achievement of greater utilization of resources and higher application throughput Support for the existing platforms and LRM Systems (Sys. Admin.) Allocation of grid resources according to management specified policies Analysis of trends in resource usage Monitoring of user behavior Familiar CLI and standard APIs (End Users & Developers) High Throughput Computing Applications Workflows Benefits A Global Vision

7/29 GridWay DSA Group GridWay DSA Group Workload Management Advanced (Grid-specific) scheduling policies Fault detection & recovery Accounting Array jobs and DAG workflows User Interface OGF standards: JSDL & DRMAA (C and JAVA) Analysis of trends in resource usage Command line interface, similar to that found on local LRM Systems Integration Straightforward deployment as new services are not required Interoperability between different infrastructures Features A Global Vision

8/29 GridWay DSA Group GridWay DSA Group Execution Manager Transfer Manager Information Manager Dispatch Manager Request Manager Scheduler Job PoolHost Pool DRMAA libraryCLI GridWay Core Grid File Transfer Services Grid Execution Services GridFTPRFT pre-WS GRAM WS GRAM Grid Information Services MDS2 GLUE MDS4 Resource Discovery Resource Monitoring Resource Discovery Resource Monitoring Job Preparation Job Termination Job Migration Job Preparation Job Termination Job Migration Job Submission Job Monitoring Job Control Job Migration Job Submission Job Monitoring Job Control Job Migration A Global Vision GridWay Internals

9/29 GridWay DSA Group GridWay DSA Group Contents 1.What is GridWay? 2.A Global Vision 3.Scheduling Policies 4.Scheduling Infrastructures 5.The GridWay Project

10/29 GridWay DSA Group GridWay DSA Group Scheduling Policies Matching Resources for each job (user)‏ Pending Jobs Grid Scheduling = Job + Resource Policies Resource Policies Rank Expressions Fixed Priority User Usage History Failure Rate Job Policies Fixed Priority Urgent Jobs User Share Deadline Waiting Time

11/29 GridWay DSA Group GridWay DSA Group Contents 1.What is GridWay? 2.A Global Vision 3.Scheduling Policies 4.Scheduling Infrastructures 5.The GridWay Project

12/29 GridWay DSA Group GridWay DSA Group SMP (Symmetric Multi-processors) MPP (Massive Parallel Processors) Clusters Grid Infrastructures Network Systems Intranet/Internet High Performance Computing High Throughput Computing Network Links Administration Homogeneity Centralized Coupled Decentralized Decoupled Scheduling Infrastructures

13/29 GridWay DSA Group GridWay DSA Group Characteristics “Small” scale infrastructures (campus/enterprise) with one meta-scheduler instance providing access to resources within the same administration domain that may be running different DRMS and be geographically distributed Goal & Benefits Integrate multiple systems, that could be heterogeneous, in an uniform/centralized infrastructure Decoupling of applications and resources Improve return of IT investment Performance/Usage maximization Scheduling Centralized meta-scheduler that allows the enforcement of Grid-wide policies (e.g. resource usage) and provides centralized accounting Enterprise Grid Infrastructures Scheduling Infrastructures

14/29 GridWay DSA Group GridWay DSA Group Enterprise Grids: Deployment with GridWay SGE Cluster Users PBS ClusterLSF Cluster Globus GridWay Globus Infrastructure Applications Middleware Could be heterogeneous and geographically distributed Services: MDS, GRAM, GridFTP One scheduling instance Grid-wide policies DRMAA interface Portal and/or CLI access Scheduling Infrastructures

15/29 GridWay DSA Group GridWay DSA Group European Space Astronomy Center Data Analysis from space missions (DRMAA) Site-level meta-scheduler Several clusters Enterprise Grids: Examples Scheduling Infrastructures

16/29 GridWay DSA Group GridWay DSA Group UABGrid, University of Alabama at Birmingham Bioinformatics applications Campus-level meta-scheduler 3 resources (PBS, SGE and Condor) Enterprise Grids: Examples Scheduling Infrastructures

17/29 GridWay DSA Group GridWay DSA Group Characteristics “Large” scale infrastructures with one or several meta-scheduler instances providing access to resources that belong to different administrative domains (different organizations or partners) Goal & Benefits Large-scale, secure and reliable sharing of resources between partners or supply-chain participants Support collaborative projects Access to higher computing power to satisfy peak demands Scheduling Decentralized scheduling system that allows the enforcement of organization-wide policies Partner Grid Infrastructures Scheduling Infrastructures

18/29 GridWay DSA Group GridWay DSA Group SGE Cluster Users PBS ClusterLSF Cluster Globus GridWay Globus Partner Grids: Deployment with GridWay Infrastructure Applications Middleware Multiple Admin. Domains Multiple Organizations Services: MDS, GRAM, GridFTP Multiple scheduling instances (V) Organization-wide policies DRMAA interface Science Gateways GridWay Users (Virtual) Organization Scheduling Infrastructures

19/29 GridWay DSA Group GridWay DSA Group AstroGrid-D, German Astronomy Community Grid Collaborative management of supercomputing resources & astronomy-specific resources Grid-level meta-scheduler (GRAM interface) 22 5 sites, 800 CPUs Partner Grids: Examples Scheduling Infrastructures

20/29 GridWay DSA Group GridWay DSA Group SGE Cluster Users PBS Cluster gLite GridWay gLite Services: BDII, GRAM, GridFTP EGEE Resource Broker DRMAA interface VO Schedulers GridWay Users Biomed Fusion EGEE RB Massive Ray Tracing CD-HIT workflow Scheduling Infrastructures Partner Grids: Examples

21/29 GridWay DSA Group GridWay DSA Group A Tool for Interoperability SGE Cluster Users Globus/WS GridWay Globus/WS gLite SGE ClusterPBS Cluster Different Middlewares (e.g. WS and pre-WS) Different Data/Execution architectures Different Information models Integration through adapters Global DN’s Demo in June 2007, TeraGrid07 Globus/WS SGE ClusterPBS Cluster Scheduling Infrastructures

22/29 GridWay DSA Group GridWay DSA Group Characteristics Multiple meta-scheduler layers in a hierarchical structure Resource provision in a utility fashion (provider/consumer) Goal & Benefits Supply resources on-demand, making resource provision more adaptive Access to unlimited computational capacity Transform IT costs from fixed to variable Seamless integration of different Grids (The Grid) Scheduling Each Grid is handled as any other resource Characterization of a Grid as a single resource Use standard interfaces to virtualize a Grid infrastructure Scheduling Infrastructures Utility Grid Infrastructures

23/29 GridWay DSA Group GridWay DSA Group Globus SGE Cluster GridWay Users Globus PBS Cluster Globus LSF Cluster Grid Infrastructure Globus SGE Cluster GridWay Globus PBS Cluster Globus LSF Cluster Grid Infrastructure Users globus-job-run, Condor/G, Nimrod/G … Users Globus Access to Outsourced Resources Globus Utility Grid Virtualization of a Grid GRID-GATEWAY Deploying Utility Grid Infrastructures with GridWay GridWay Scheduling Infrastructures

24/29 GridWay DSA Group GridWay DSA Group Utility Grids: Example SGE Cluster Users Globus GridWay Infrastructure PBS Cluster Applications Globus Middleware SGE ClusterPBS Cluster GridWay Globus gLite Access to different infrastructures with the same adapters EGEE managed as other resource Delegate identity/ “VO” certificates In-house/provider gateway Regional infrastructure Scheduling Infrastructures

25/29 GridWay DSA Group GridWay DSA Group Contents 1.What is GridWay? 2.A Global Vision 3.Scheduling Policies 4.Scheduling Infrastructures 5.The GridWay Project

26/29 GridWay DSA Group GridWay DSA Group Life-Sciences Aerospace Fusion Physics Computational Chemistry  IRISGrid  Politecnico di Torino  CABGrid (Centro de Astrobiología)‏  C2VO (Universidad de Castilla La Mancha)‏  Grid en ESAC (Agencia Espacial Europea)‏  CRO-GRID (Croacia)‏  Sun Microsystems Solution Center World Grid  Infraestructura EGEE  Proyecto BeinGRID  GridX1 (Canadian Grid for HEP applications)‏  Universidade do Porto  Madras Institute of Technology  National Center for High-Performance Computing More at: (Success Stories)‏ The GridWay Project Some Projects and Infrastructures Some Application Porting Areas

27/29 GridWay DSA Group GridWay DSA Group Started in 2002, first releases were only distributed on request in binary format First open source release (v4.0) in January 2005 (Apache license v2.0) Adhering to Globus philosophy and guidelines for collaborative development In June 2007 GridWay became part of the Globus Toolkit Since January 2005, more than 1000 downloads from 80 different countries, 25% are private companies and 75% are universities and research centers. Best-effort support provided (contract support is also available) Based on a strong open source community History of the Project The GridWay Project

28/29 GridWay DSA Group GridWay DSA Group Community – Open Source Project. Globus Development Philosophy Development Infrastructure (thanks to Globus Project!) Mailing Lists Bugzilla CVS You are very welcome to contribute: Reporting Bugs Making feature requests for the next GridWay release Contributing your own developments (bug fixes, new features, documentation) Detailed Roadmap: GridWay Campaigns at bugzilla.mcs.anl.gov/globus/query.cgi www-unix.mcs.anl.gov/~bacon/cgi-bin/big-roadmap.cgi#Gridway Development Process The GridWay Project

29/29 GridWay DSA Group GridWay DSA Group Thank you for your attention! An Overview of the GridWay Metascheduler