1 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Enforcing resource allocations with.

Slides:



Advertisements
Similar presentations
Gfarm v2 and CSF4 Osamu Tatebe University of Tsukuba Xiaohui Wei Jilin University SC08 PRAGMA Presentation at NCHC booth Nov 19,
Advertisements

Legacy code support for commercial production Grids G.Terstyanszky, T. Kiss, T. Delaitre, S. Winter School of Informatics, University.
Test Case Management and Results Tracking System October 2008 D E L I V E R I N G Q U A L I T Y (Short Version)
1 NextGRID Monitoring and Fabric Management Requirements SLA Management Example: SweGrid Accounting System and Test-bed Thomas Sandholm, KTH,
Polish Infrastructure for Supporting Computational Science in the European Research Space EUROPEAN UNION Services and Operations in Polish NGI M. Radecki,
Seminar Grid Computing ‘05 Hui Li Sep 19, Overview Brief Introduction Presentations Projects Remarks.
DataGrid is a project funded by the European Union 22 September 2003 – n° 1 EDG WP4 Fabric Management: Fabric Monitoring and Fault Tolerance
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Authz work in GGF David Chadwick
1 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTER SCIENCE DEPARTMENT OF COMPUTING SCIENCE 2 nd International Conference on Service Oriented Computing,
CoreGRID Workpackage 5 Virtual Institute on Grid Information and Monitoring Services Authorizing Grid Resource Access and Consumption Erik Elmroth, Michał.
Sergey Belov, LIT JINR 15 September, NEC’2011, Varna, Bulgaria.
MCTS Guide to Microsoft Windows Server 2008 Network Infrastructure Configuration Chapter 11 Managing and Monitoring a Windows Server 2008 Network.
Sergey Belov, Tatiana Goloskokova, Vladimir Korenkov, Nikolay Kutovskiy, Danila Oleynik, Artem Petrosyan, Roman Semenov, Alexander Uzhinskiy LIT JINR The.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
● Problem statement ● Proposed solution ● Proposed product ● Product Features ● Web Service ● Delegation ● Revocation ● Report Generation ● XACML 3.0.
Cardea Requirements, Authorization Model, Standards and Approach Globus World Security Workshop January 23, 2004 Rebekah Lepro Metz
1 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Enforcing resource allocations with.
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
Thinking about Accounting Matteo Melani SLAC Open Science Grid.
SOS EGEE ‘06 GGF Security Auditing Service: Draft Architecture Brian Tierney Dan Gunter Lawrence Berkeley National Laboratory Marty Humphrey University.
GRAM: Software Provider Forum Stuart Martin Computational Institute, University of Chicago & Argonne National Lab TeraGrid 2007 Madison, WI.
Accounting for the Grid Usage Records and a Resource Usage Service.
U.S. Department of Agriculture eGovernment Program August 14, 2003 eAuthentication Agency Application Pre-Design Meeting eGovernment Program.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Open Science Grid The OSG Accounting System: GRATIA by Philippe Canal (FNAL) & Matteo Melani (SLAC) Mumbai, India CHEP2006.
Frascati, October 9th, Accounting in DataGrid Initial Architecture Albert Werbrouck Frascati, October 9, 2001.
Configuration Management (CM)
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
Summary of Accounting Discussion at the GDB in Bologna Dave Kant CCLRC, e-Science Centre.
Through the development of advanced middleware, Grid computing has evolved to a mature technology in which scientists and researchers can leverage to gain.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting October 10-11, 2002.
London e-Science Centre GridSAM Job Submission and Monitoring Web Service William Lee, Stephen McGough.
Grid Workload Management Massimo Sgaravatto INFN Padova.
Cracow Grid Workshop October 2009 Dipl.-Ing. (M.Sc.) Marcus Hilbrich Center for Information Services and High Performance.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
1 Globus Toolkit Security Rachana Ananthakrishnan Frank Siebenlist Argonne National Laboratory.
EMI is partially funded by the European Commission under Grant Agreement RI Argus Policies Tutorial Valery Tschopp - SWITCH EGI TF Prague.
 Apache Airavata Architecture Overview Shameera Rathnayaka Graduate Assistant Science Gateways Group Indiana University 07/27/2015.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Cracow Grid Workshop ‘06 17 October 2006 Execution Management and SLA Enforcement in Akogrimo Antonios Litke Antonios Litke, Kleopatra Konstanteli, Vassiliki.
1October 9, 2001 Sun in Scientific & Engineering Computing Grid Computing with Sun Wolfgang Gentzsch Director Grid Computing Cracow Grid Workshop, November.
Grid Security: Authentication Most Grids rely on a Public Key Infrastructure system for issuing credentials. Users are issued long term public and private.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Overview of Privilege Project at Fermilab (compilation of multiple talks and documents written by various authors) Tanya Levshina.
Conference name Company name INFSOM-RI Speaker name The ETICS Job management architecture EGEE ‘08 Istanbul, September 25 th 2008 Valerio Venturi.
INFSO-RI Enabling Grids for E-sciencE EGEE is a project funded by the European Union under contract INFSO-RI Grid Accounting.
WLCG-RUS An Extensible Solution to Grid Accounting & Usage Monitoring EGEE 3 rd User Forum X. Chen, A. Khan Brunel University.
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
EMI INFSO-RI Argus Policies in Action Valery Tschopp (SWITCH) on behalf of the Argus PT.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
EMI INFSO-RI Argus The EMI Authorization Service Valery Tschopp (SWITCH) Argus Product Team.
The National Grid Service User Accounting System Katie Weeks Science and Technology Facilities Council.
Sep 17, 20081/16 VO Services Project – Stakeholders’ Meeting Gabriele Garzoglio VO Services Project Stakeholders’ Meeting Sep 17, 2008 Gabriele Garzoglio.
EMI is partially funded by the European Commission under Grant Agreement RI Argus Policies Tutorial Valery Tschopp (SWITCH) – Argus Product Team.
IT 5433 LM1. Learning Objectives Understand key terms in database Explain file processing systems List parts of a database environment Explain types of.
OGSA-DAI.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
Allocation Management Solutions for High Performance Computing Scott M. Jackson Workshop on Scheduling and Resource Management for Parallel and Distributed.
EGEE is a project funded by the European Union under contract INFSO-RI DGAS Grid accounting L.Gaido on behalf of A.Guarise LCG Workshop November.
Argus EMI Authorization Integration
DGAS A.Guarise April 19th, Athens
OGF PGI – EDGI Security Use Case and Requirements
StoRM: a SRM solution for disk based storage systems
A gLite Authorization Framework
Ruslan Fomkin and Tore Risch Uppsala DataBase Laboratory
Wide Area Workload Management Work Package DATAGRID project
Presentation transcript:

1 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Enforcing resource allocations with the SweGrid Accounting System (SGAS) GDB meeting, Bologna October 11, 2005 Joint effort by Erik Elmroth (Umeå University) Peter Gardfjäll (Umeå University) Lennart Johnsson (KTH) Olle Mulmo (KTH) Thomas Sandholm (KTH) Presented by Tord Ekelof, Uppsala University on behalf of this group

2 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE The involved parties Resource owner User Allocation authority User  E.g. a member of a scientific project  Wants QoS guarantees  His/her ”fair share of the Grid” Allocation authority  E.g. Swedish National Allocation Committee  Wants to coordinate aggregate Grid capacity to assure efficient utilization of resources  Reserve resources to projects (e.g. subject to payment/importance) Resource owner  Resource administrator  Wants to retain control over the local resource and its utilization SGAS – ”the mechanism”  Provides functionality to manage and enforce the necessary policies SGAS

3 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Grid accounting - coordinating Grid resource usage Maintaining a (consistent) Grid-wide view of the resources utilized by VO members Measure and control users’ total resource usage on the Grid  Assuming absence of central point of control  Resource owners should retain local control

4 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Why accounting? Accounting information can be used for several purposes  Economic compensation  Tracking of resource usage  Evaluation/forecasting of resource usage  Resource brokering decisions  Assign scheduling priorities to jobs based on previous resource utilization  Pricing & creating economic markets for resource sharing  Enforcement of resource allocations  Etc…

5 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE SGAS in SweGrid SweGrid is a Swedish computational Grid  Connects six computer clusters (Umeå, Göteborg, Uppsala, Stockholm, Lund, Linköping) with a total of 600 processors Swedish National Allocation Committee  Allocates CPU time (measured in node hours) on SweGrid to research projects  Grid-wide allocations can be spent arbitrarily among Grid sites SGAS has been developed to  Enforce project allocations across all SweGrid sites Prevent project members from overspending  Store detailed information on each Grid job’s resource usage

6 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Milestones and future directions Sep 2003: SGAS project initiated Sep 2003: SweGrid site survey – needs analysis Oct 2003: SGAS white paper  Investigated existing work on Grid accounting  Accounting system architecture proposal Jan 2004: Finished proof-of-concept prototype Feb 2004: Started work on production code base Apr 2004: Version 0.1 released  OGSI/GT3-based Apr 2004: Contributed authorization framework to Globus Toolkit Nov 2004: Version 0.2 released  Additional core functionality (e.g. timestamped allocations) Oct 2004: Version 1.0 released  Stability/scalability improvements Jun 2005: Version 2.0 (alpha) released  Fully WSRF-compliant implementation (GT4-based) Aug 2005: SGAS included as ”tech preview” in Globus Toolkit Autumn 2005: Final 2.0 version  Further real-world testing  Distributed bank solution, and simplified account naming

7 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE SweGrid Accounting System (SGAS) Decentralized resource allocation enforcement system SGAS performs soft real-time enforcement of allocations  Real-time enforcement: Resources can, at the time of job submission, deny access if project quota has been used up  Soft: enforcement is subject to local resource policies (strict enforcement not always appropriate) Initially addressed allocation enforcement in SweGrid  Not restricted to SweGrid use Developed with an emphasis on easy integration into different Grid middleware  Single-point-of-integration  In SweGrid: deployed on top of NorduGrid middleware WSRF-compliant Java implementation using Globus Toolkit 4 (GT4) primitives

8 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Design goals Service-oriented architecture (Web Services-based) Based on open standards  GGF: OGSA, UR  OASIS: WSRF, XACML, WS-Security  WS-SecureConversation Light-weight/non-intrusive deployment  Single-point-of integration with underlying middleware End-to-end security  Message-level and transport-level security  Fine-grained authorization model based on XACML policies End-user transparency Flexibility and customizability  Can account for any type of resource usage  Abstract “currency” (Grid credits)  Usage transformed into Grid credits before charging an account  Policy customization on three different levels User: “only run jobs if sufficient quota is available” Resource owner: “run quota-exceeding jobs with low priority” Allocation authority: “allow 10 % account overdraft”

9 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE SGAS component overview Four main components  Bank Online service Manages project accounts (resource allocations) Provides Grid users/resources with consistent information about resources consumed by Grid projects  JARM (Job Account Reservation Manager) Intercepts job requests on resources Makes account reservation prior to job execution Charges project account after job completion Single-point-of-integration  LUTS (Logging and Usage Tracking Service) Collects and publishes usage records which can be queried by users  PAT (Policy Administration Tool) Client tool to manage Bank and LUTS policies

10 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Component interactions 1.Contact resource 2.Authenticate/authorize (delegate credentials) 3.Submit job request 4.JARM intercepts request 5.Make account reservation 6.Run job 7.Collect usage info 8.Charge project account and log usage info

11 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Policy enforcement overview PAP = Policy Administration Point - set up policies PIP = Policy Information Point - retrieve policies PDP = Policy Decision Point - make policy decisions/manage policy PEP = Policy Enforcement Point - intercept request and query PDP(s)

12 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Bank component Composed of WSRF-compliant Web services  Bank Creates and locates accounts  Account Represents a project’s resource allocation Users make reservations on account allocation. A successful reservation results in a...  Hold Time-limited reservation on the account Used to charge the account Overdraft policy can be associated with each account Batch operations for scalability/performance Each account manages a set of time-stamped allocations  Each allocation valid for a limited time period  Allows total allocation to be spread out in time  Implements a "use-it-or-lose-it" policy … Bank Account … Hold >

13 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Allocation Strategies Picture from:

14 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Allocation strategy example Picture from:

15 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Allocation Strategies Picture from:

16 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Logging and Usage Tracking Service (LUTS) Collects and publishes usage records compliant with GGF- UR specification  XML-based format for storing detailed information about the resources consumed by Grid jobs CPU time, memory, storage, network, … Authorized users are allowed to run XPath queries directly against LUTS URs can be extended to hold additional information only understood by a subset of users/resources without modifying LUTS URs can be logged in batches  Improved performance and scalability XSLT-based transformation infrastructure to allow sites to easily convert their non-XML usage data to a UR- compliant format

17 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Collecting usage data

18 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Job Account Reserv. Manager (JARM) Integration-point between SGAS and underlying Grid env.  Workload manager independent NorduGrid integration  configuration of plug-in scripts triggered on state-transitions during the NG job submission process Plugged into workload manager at each cluster  Intercepts job submissions  Makes account reservations prior to job execution Can be carried out in parallel with job preparation (less overhead)  Collects usage data from batch system when job has finished  Charges account and logs a usage record in LUTS Charging & logging of jobs usually deferred and performed in batches Local site policies can be enforced by overloading the default Site Policy Manager  Default Site Policy Manager let job through even if bank cannot be reached; log and charge later overdraft violation detected: run job with lower priority

19 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE sgas-admin A tool for:  Administering SGAS  Collecting information from SGAS Provides a command line interface including commands for:  Bank management Creating and removing accounts Managing account allocations Managing account policies  Retrieving usage information  Off-line corrections Can be run in interactive or script mode TODO: web/graphical interface

20 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Authorization framework Fine-grained authorization framework  Authorization specified on a per-operation basis Separate Globus contribution Associate authorization policy and engine with service  Service orthogonal: transparent to service implementation  Customizable: allows different backend engines/policy languages SGAS authorization engine based on XACML

21 CENTER FOR PARALLEL COMPUTERS DEPARTMENT OF COMPUTING SCIENCE DEPARTMENT OF COMPUTING SCIENCE Project information Please visit us at  SGAS download  Documentation  Publications Mailing list: Globus Toolkit contribution  Grid research at Umeå University 