Design and Evaluation of a Resource Selection Framework for Grid Applications University of Chicago.

Slides:



Advertisements
Similar presentations
Agreement-based Distributed Resource Management Alain Andrieux Karl Czajkowski.
Advertisements

A system Performance Model Instructor: Dr. Yanqing Zhang Presented by: Rajapaksage Jayampthi S.
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
Presented by: Priti Lohani
GRID workload management system and CMS fall production Massimo Sgaravatto INFN Padova.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
A Grid Resource Broker Supporting Advance Reservations and Benchmark- Based Resource Selection Erik Elmroth and Johan Tordsson Reporter : S.Y.Chen.
1 Draft of a Matchmaking Service Chuang liu. 2 Matchmaking Service Matchmaking Service is a service to help service providers to advertising their service.
Distributed Application Management Using PLuSH Jeannie Albrecht, Christopher Tuttle, Alex C. Snoeren, and Amin Vahdat UC San Diego CSE {jalbrecht, ctuttle,
Usage Policy (UPL) Research for GriPhyN & iVDGL Catalin L. Dumitrescu, Michael Wilde, Ian Foster The University of Chicago.
GRID Workload Management System Massimo Sgaravatto INFN Padova.
High Throughput Urgent Computing Jason Cope Condor Week 2008.
Legion Worldwide virtual computer. About Legion Made in University of Virginia Object-based metasystems software project middleware that connects computer.
Resource Selector Chuang Liu. What do we want to do? A smart Resource Selector App R S Resource requirement.
Workload Management Massimo Sgaravatto INFN Padova.
First steps implementing a High Throughput workload management system Massimo Sgaravatto INFN Padova
Evaluation of the Globus GRAM Service Massimo Sgaravatto INFN Padova.
Jim Basney Computer Sciences Department University of Wisconsin-Madison Managing Network Resources in.
Resource Management Reading: “A Resource Management Architecture for Metacomputing Systems”
Miron Livny Computer Sciences Department University of Wisconsin-Madison Harnessing the Capacity of Computational.
Alain Roy Computer Sciences Department University of Wisconsin-Madison An Introduction To Condor International.
Grid Computing 7700 Fall 2005 Lecture 17: Resource Management Gabrielle Allen
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
HAMS Technologies 1
WP9 Resource Management Current status and plans for future Juliusz Pukacki Krzysztof Kurowski Poznan Supercomputing.
CSE 548 Advanced Computer Network Security Document Search in MobiCloud using Hadoop Framework Sayan Cole Jaya Chakladar Group No: 1.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
3-2.1 Topics Grid Computing Meta-schedulers –Condor-G –Gridway Distributed Resource Management Application (DRMAA) © 2010 B. Wilkinson/Clayton Ferner.
Multicriteria Driven Resource Management Strategies in GRMS Krzysztof Kurowski, Jarek Nabrzyski, Ariel Oleksiak, Juliusz Pukacki Poznan Supercomputing.
DataGrid WP1 Massimo Sgaravatto INFN Padova. WP1 (Grid Workload Management) Objective of the first DataGrid workpackage is (according to the project "Technical.
Grid Workload Management Massimo Sgaravatto INFN Padova.
The Owner Share scheduler for a distributed system 2009 International Conference on Parallel Processing Workshops Reporter: 李長霖.
BOF: Megajobs Gracie: Grid Resource Virtualization and Customization Infrastructure How to execute hundreds of thousands tasks concurrently on distributed.
Condor: High-throughput Computing From Clusters to Grid Computing P. Kacsuk – M. Livny MTA SYTAKI – Univ. of Wisconsin-Madison
Software Engineering Prof. Ing. Ivo Vondrak, CSc. Dept. of Computer Science Technical University of Ostrava
Douglas Thain, John Bent Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau, Miron Livny Computer Sciences Department, UW-Madison Gathering at the Well: Creating.
Superscheduling and Resource Brokering Sven Groot ( )
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
July 11-15, 2005Lecture3: Grid Job Management1 Grid Compute Resources and Job Management.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Review of Condor,SGE,LSF,PBS
RDFPath: Path Query Processing on Large RDF Graph with MapReduce Martin Przyjaciel-Zablocki et al. University of Freiburg ESWC May 2013 SNU IDB.
how Shibboleth can work with job schedulers to create grids to support everyone Exposing Computational Resources Across Administrative Domains H. David.
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
A Constraint Language Approach to Grid Resource Selection Chuang Liu, Ian Foster Distributed System Lab University of Chicago
Nicholas Coleman Computer Sciences Department University of Wisconsin-Madison Distributed Policy Management.
Eileen Berman. Condor in the Fermilab Grid FacilitiesApril 30, 2008  Fermi National Accelerator Laboratory is a high energy physics laboratory outside.
Grid Compute Resources and Job Management. 2 Grid middleware - “glues” all pieces together Offers services that couple users with remote resources through.
Miron Livny Computer Sciences Department University of Wisconsin-Madison Condor and (the) Grid (one of.
Douglas Thain, John Bent Andrea Arpaci-Dusseau, Remzi Arpaci-Dusseau, Miron Livny Computer Sciences Department, UW-Madison Gathering at the Well: Creating.
April 25, 2006Parag Mhashilkar, Fermilab1 Resource Selection in OSG & SAM-On-The-Fly Parag Mhashilkar Fermi National Accelerator Laboratory Condor Week.
Grid Workload Management (WP 1) Massimo Sgaravatto INFN Padova.
From Use Cases to Implementation 1. Structural and Behavioral Aspects of Collaborations  Two aspects of Collaborations Structural – specifies the static.
EGEE 3 rd conference - Athens – 20/04/2005 CREAM JDL vs JSDL Massimo Sgaravatto INFN - Padova.
Use of Performance Prediction Techniques for Grid Management Junwei Cao University of Warwick April 2002.
HTCondor’s Grid Universe Jaime Frey Center for High Throughput Computing Department of Computer Sciences University of Wisconsin-Madison.
Architecture for Resource Allocation Services Supporting Interactive Remote Desktop Sessions in Utility Grids Vanish Talwar, HP Labs Bikash Agarwalla,
First evaluation of the Globus GRAM service Massimo Sgaravatto INFN Padova.
From Use Cases to Implementation 1. Mapping Requirements Directly to Design and Code  For many, if not most, of our requirements it is relatively easy.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Workload Management Workpackage
Chuang Liu, Lingyun Yang, Dave Angulo, Ian Foster
Abstract Machine Layer Research in VGrADS
A Distributed Policy Scenario
Condor and Multi-core Scheduling
Basic Grid Projects – Condor (Part I)
Wide Area Workload Management Work Package DATAGRID project
Experiences in Running Workloads over OSG/Grid3
Condor-G Making Condor Grid Enabled
From Use Cases to Implementation
Presentation transcript:

Design and Evaluation of a Resource Selection Framework for Grid Applications University of Chicago

Problem Co-selection of resources –Resource Selection –Resource configuration –Mapping workload to resources

Outline Problem > Related work Set Match Resource Selection Service Experiments Summary Future work

Related work Networked Batch Queue System –PBS, LSF, NQE Application-specific scheduler –AppLeS, etc. Condor

Outline Problem Related work > Set Match Resource Selection Service Experiments Summary Future work

Set Match Resource Set: –Several resources with similar characteristics that may be located in different sites. Set Match –A Set Match occurs between a resource request and a resource set. If the resource set fulfill the resource request and the resource request fulfill resource’s management policy, they match each other.

Set Match Matchmaker

Set Match ClassAds Language The classads language is a language for expressing and evaluating attributes. An attribute is a named expression. A classad is a set of named expressions. Expression similar to those found in C/C++. B=[ name=”foo”; type="machine"; cpuspeed=800M; memory=512M] A=[ Requirements = other.type = = "machine" && other.cpuspeed > 500M; Rank = other.cpuspeed ]

Set Match Set-Extended ClassAds language –Aggregation functions Max(), Min(), Sum(), SizeOfSet() –Other functions suffix(S, L), Prefix(S, L)

Set Match An Example of the Set Extended ClassAds [ Type=”Set”; Domainlist={ucsd.edu, cs.uiuc.edu}; requirements = Sum(other.memory) > 1G && other.cpuspeed > 200M && suffix(other.Hostname, Domainlist); rank = Min(other.cpuspeed)*SizeOfSet() ]

Resource Selection mechanism

Outline Problem Related work Set Match > Resource Selection Service Experiments Summary Future work

Resource Selection Service Help applications to choose a good Resource Set in Grid environment. –Synchronous and asynchronous service Mapping application workload to resources if needed.

Resource Selection Service Framework

Resource Request Owner: The sender of this request. Type of Service: Synchronous or asynchronous. Job description: The characteristics of the job to be run, for example, problem size, problem specification and the performance model. Job resource requirements: User resource requirements, for example, memory capability, type of operating system, software packages installed, etc. Mapper: The kind of mapper algorithm to be used. Rank: The criteria to rank the matched resources.

Resource Request: An example 1.[ Service = "InstantService"; 2. iter=100; alpha=100; x=100; y=100; z=100; 3. computetime = x*y*alpha/other.cpuspeed*370; 4. comtime= ( other.RLatency+ y*x*254/other.RBandwidth +other.LLatency+y*x*254/other.Lbandwidth); 5. exectime=(computetime+comtime)*iter+startup; 6. Mapper = [type ="dll"; libraryname="cactus"; function="mapper"]; 7. requirements = Sum(other.MemorySize) >= ( *z*x*y) && suffix(other.machine, domains); 8. domains={ cs.utk.edu, ucsd.edu}; 9. rank=Min(1/exectime) ] Job description Resource Requirements Type of Service Mapper Rank

Resource Selection Result

Outline Problem Related work Set Match Resource Selection Service > Experiments Summary Future work

Cactus application Application performance model –ExecTime= (Communication time + Computation time) * slowdown Application mapping algorithm –1 Dimension mapping R 1

Cactus application Application performance model –ExecTime= (Communication time + Computation time) * slowdown Application mapping algorithm –1 Dimension mapping R 1R 2

Cactus application Application performance model –ExecTime= (Communication time + Computation time) * slowdown Application mapping algorithm –1 Dimension mapping R 1R 2R 3

Cactus application Application performance model –ExecTime= (Communication time + Computation time) * slowdown Application mapping algorithm –1 Dimension mapping R 1R 2R 3

Execution Time Prediction Problem Size Time (S) torc1 Torc1 Torc5 Torc1 Torc5 Torc1 Torc5 Torc1,3, 5 Torc1,3, 5, o.ucsd.edu

Resource Selection Algorithm Test 1. o.ucsd.edu, mystere.ucsd.edu, saltimbanco.ucsd.edu 2: mystere.ucsd.edu, o.ucsd.edu 3: o.ucsd.edu, Saltimbanco.ucsd.edu 4. o.ucsd.edu 5: saltimbanco.ucsd.edu 6: mystere.ucsd.edu

Resource Selection Algorithm Test 1: torc6.cs.utk.edu 2: o.ucsd.edu 3: Saltimbanco.ucsd.edu 4. Torc6.cs.utk.edu + o.ucsd.edu 5: o.ucsd.edu + saltimbanco.ucsd.edu 6. o.ucsd.edu, mystere.ucsd.edu, torc6.cs.utk.edu

Outline Problem Related work Set Match Resource Selection Service Evaluation of the implementation > Summary Future work

Summary Extended the ClassAds language to describe the requirement for a Resource Set Implement a Set Matchmaker Create a general framework for the Resource Selection Service Validate our implementation by the Cactus application

Future Work Extend the resource selection algorithm to support other applications. –Gang match –Resource with particular topology Provide fault tolerance to incomplete or error information A general matchmaking mechanism

Welcome comments and suggestions Thank you!

Problem Co-selection of resources –Dynamic nature of Grid resources –Heterogeneity of resources –The autonomy of resources –The quality of resources depends on the application’s run characteristics