EGEE is a project funded by the European Union under contract IST-2003-508833 Grid developments and middleware components Mike Mineter EGEE Training team.

Slides:



Advertisements
Similar presentations
Workload Management David Colling Imperial College London.
Advertisements

EU 2nd Year Review – Jan – Title – n° 1 WP1 Speaker name (Speaker function and WP ) Presentation address e.g.
Workload management Owen Maroney, Imperial College London (with a little help from David Colling)
INFSO-RI Enabling Grids for E-sciencE Workload Management System and Job Description Language.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
The Grid Constantinos Kourouyiannis Ξ Architecture Group.
Job Submission The European DataGrid Project Team
A Computation Management Agent for Multi-Institutional Grids
INFSO-RI Enabling Grids for E-sciencE EGEE Middleware The Resource Broker EGEE project members.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Job Submission Fokke Dijkstra RuG/SARA Grid.
The EDG Workload Management System – n° 1 The EDG Workload Management System.
Basic Grid Job Submission Alessandra Forti 28 March 2006.
Grid Infrastructure.
INFSO-RI Enabling Grids for E-sciencE Grid Infrastructure & Related Projects Eddie Aronovich Tel-Aviv University, School of CS
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
NGS in the future: emerging middleware.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management Services - Overview Mike Mineter National e-Science Centre, Edinburgh.
Computational grids and grids projects DSS,
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
Enabling Grids for E-sciencE Workload Management System on gLite middleware Matthieu Reichstadt CNRS/IN2P3 ACGRID School, Hanoi (Vietnam)
M. Sgaravatto – n° 1 The EDG Workload Management System: release 2 Massimo Sgaravatto INFN Padova - DataGrid WP1
DataGrid WP1 Massimo Sgaravatto INFN Padova. WP1 (Grid Workload Management) Objective of the first DataGrid workpackage is (according to the project "Technical.
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
Grid Workload Management Massimo Sgaravatto INFN Padova.
- Distributed Analysis (07may02 - USA Grid SW BNL) Distributed Processing Craig E. Tull HCG/NERSC/LBNL (US) ATLAS Grid Software.
Training and the NGS Mike Mineter
EGEE is a project funded by the European Union under contract IST Middleware components in EGEE Mike Mineter NeSC Training team
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE middleware: gLite Data Management EGEE Tutorial 23rd APAN Meeting, Manila Jan.
Enabling Grids for E-sciencE Introduction Data Management Jan Just Keijser Nikhef Grid Tutorial, November 2008.
EGEE-II INFSO-RI Enabling Grids for E-sciencE An Introduction to the EGEE Project Presented by Min Tsai ISGC 2007, Taipei With thanks.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Job Submission Fokke Dijkstra RuG/SARA Grid.
EGEE is a project funded by the European Union under contract IST Job Description Language - more control over your Job Assaf Gottlieb University.
Replica Management Services in the European DataGrid Project Work Package 2 European DataGrid.
INFSO-RI Enabling Grids for E-sciencE OSG-LCG Interoperability Activity Author: Laurence Field (CERN)
EGEE is a project funded by the European Union under contract IST EGEE Tutorial Turin, January Job Services Emidio.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
M. Sgaravatto – n° 1 Overview of release 2 of the EDG WP1 Workload Management System deployed in the INFN production Grid Massimo Sgaravatto INFN Padova.
INFSO-RI Enabling Grids for E-sciencE Αthanasia Asiki Computing Systems Laboratory, National Technical.
High-Performance Computing Lab Overview: Job Submission in EDG & Globus November 2002 Wei Xing.
EGEE-0 / LCG-2 middleware Practical.
INFSO-RI Enabling Grids for E-sciencE Introduction Data Management Ron Trompert SARA Grid Tutorial, September 2007.
EGEE is a project funded by the European Union under contract IST WS-Based Advance Reservation and Co-allocation Architecture Proposal T.Ferrari,
Workload Management System Jason Shih WLCG T2 Asia Workshop Dec 2, 2006: TIFR.
Induction: General components of Grid middleware and User Interfaces –April 26-28, General components of Grid middleware and User Interfaces Roberto.
EGEE-II INFSO-RI Enabling Grids for E-sciencE An overview of the EGEE project and middleware Mike Mineter
Summary from WP 1 Parallel Section Massimo Sgaravatto INFN Padova.
Further aspects of EGEE middleware components INFN, Catania EGEE is funded by the European Union under contract IST
EGEE is a project funded by the European Union under contract IST Job Description Language – How to control your Job Nadav Grossaug IsraGrid.
Biomed tutorial 1 Enabling Grids for E-sciencE INFSO-RI EGEE is a project funded by the European Union under contract IST JDL Flavia.
User Interface UI TP: UI User Interface installation & configuration.
13th EELA Tutorial, La Antigua, 18-19, October E-infrastructure shared between Europe and Latin America FP6−2004−Infrastructures−6-SSA
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
EGEE is a project funded by the European Union under contract IST GENIUS and GILDA Guy Warner NeSC Training Team Induction to Grid Computing.
GRID commands lines Original presentation from David Bouvet CC/IN2P3/CNRS.
Introduction to Computing Element HsiKai Wang Academia Sinica Grid Computing Center, Taiwan.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Enabling Grids for E-sciencE Work Load Management & Simple Job Submission Practical Shu-Ting Liao APROC, ASGC EGEE Tutorial.
Workload Management System ( WMS )
EGEE tutorial, Job Description Language - more control over your Job Assaf Gottlieb Tel-Aviv University EGEE is a project.
Job Submission in the DataGrid Workload Management System
Introduction to Grid Technology
Workload Management System
5. Job Submission Grid Computing.
Job Submission M. Jouvin (LAL-Orsay)
Presentation transcript:

EGEE is a project funded by the European Union under contract IST Grid developments and middleware components Mike Mineter EGEE Training team

Grid developments and middleware components, 19 July Acknowledgements This presentation for the GGF Summer School, 2004 was prepared by the NeSC Edinburgh training team. It includes slides and information from many sources:  Roberto Barbera (Slides on middleware are based on presentations given in Edinburgh, April 2004)  Malcolm Atkinson and Ian Bird (Sites in LCG-2/EGEE-0 at GGF-11)  Other colleagues in EGEE (project overview slides)  The European DataGrid training team  Authors of the LCG-2 User Guide v. 2.0 : Antonio Delgado Peris, Patricia Méndez Lorenzo, Flavia Donno, Andrea Sciabà, Simone Campana, Roberto Santinelli

Grid developments and middleware components, 19 July Outline Grid developments from an EGEE perspective:  Creating e-Infrastructure  Building on and with other Grid projects  Towards service-orientation  Establishing a “production Grid” Overview of the middleware of the current EGEE-0 system  Major components  Lifecycle of a job Summary

Grid developments and middleware components, 19 July Towards a European e-Infrastructure To underpin European science and technology in the service of society To link with and build on  National, regional and international initiatives  Emerging technologies (e.g. fibre optic networks) To foster international cooperation  both in the creation and the use of the e-infrastructure Network infrastructure ( GÉANT ) Operations, Support and training Collaboration Pan-European Grid

Grid developments and middleware components, 19 July In 2 years EGEE will: Establish production quality sustained Grid services  3000 users from at least 5 disciplines  over 20,000 CPU's, 50 sites  over 5 Petabytes (10 15 ) storage Demonstrate a viable general process to bring other scientific communities on board Spend 32 Million Euros - started April 2004  70 institutions in 27 countries Propose a second phase in mid 2005 to take over EGEE in early 2006 InitialNew

Grid developments and middleware components, 19 July EGEE will: Establish production quality sustained Grid services  3000 users from at least 5 disciplines  over 20,000 CPU's Demonstrate a viable general process to bring other scientific communities on board Spend 32 Million Euros over 2 years starting April 2004  70 institutions in 28 countries Propose a second phase in mid 2005 to take over EGEE in early 2006 Initial domains

Grid developments and middleware components, 19 July EGEE activity groups 1: Middleware Engineering and Integration 2: Quality Assurance 3: Security 4: Network Services Development 1: Grid Operations, Support and Management 2: Network Resource Provision 1: Management 2: Dissemination and Outreach 3: User Training and Education 4: Application Identification and Support 5: Policy and International Cooperation 24% Joint Research28% Networking 48% Services Emphasis in EGEE is on operating a production grid and supporting the end- users

Grid developments and middleware components, 19 July Outline Grid developments from an EGEE perspective:  Creating e-Infrastructure  Building on and with other Grid projects  Towards service-orientation  Establishing a “production Grid” Overview of the middleware of the current EGEE-0 system  Major components  Lifecycle of a job Summary

Grid developments and middleware components, 19 July EGEE view of history... LCG EGEE Used in USA EU NextGrid … GridCC Future e-Infrastructure EDG GlobusMyProxyCondor... VDT DataTAG AliEn CrossGrid... … SRM

Grid developments and middleware components, 19 July Outline Grid developments from an EGEE perspective:  Creating e-Infrastructure  Building on and with other Grid projects  Towards service-orientation  Establishing a “production Grid” Overview of the middleware of the current EGEE-0 system  Major components  Lifecycle of a job Summary

Grid developments and middleware components, 19 July Service orientation: building EGEE-1 “gLite” - the new EGEE middleware (under test) Service oriented - components that are :  Loosely coupled (by messages – examples tomorrow)  Accessible across network; modular and self-contained; clean modes of failure  So can change implementation without changing interfaces  Can be developed in anticipation of new uses … and are based on standards. Opens EGEE to:  New middleware (plethora of tools now available)  Heterogeneous resources (storage, computation…)  Interact with other Grids (international, regional and national)

Grid developments and middleware components, 19 July Outline Grid developments from an EGEE perspective:  Creating e-Infrastructure  Building on and with other Grid projects  Towards service-orientation  Establishing a “production Grid” Overview of the middleware of the current EGEE-0 system  Major components  Lifecycle of a job Summary

Grid developments and middleware components, 19 July LCG and EGEE LCG: Large Hadron Collider Computing Grid LCG infrastructure running LCG-2 is “EGEE-0” In parallel producing new web-service-oriented middleware (“gLite”) Will replace LCG-2 as production facility in 2005 New major releases each year Globus 2 basedWeb services based EGEE-2EGEE-1LCG-2LCG-1

Grid developments and middleware components, 19 July Sites in LCG-2/EGEE-0 : June Countries 58 Sites (45 Europe, 2 US, 5 Canada, 5 Asia, 1 HP) Coming: New Zealand, China, other HP (Brazil, Singapore) 3800 cpu

Grid developments and middleware components, 19 July Operations Infrastructure A lot more than middleware!! 40% of EGEE budget 10 ROCs: coordinate deployment & operation, tasks include:  First point of contact for all new sites, new users, and user support; Issue certificates  Negotiate policies with resource providers 5 CICs: tasks include provision of  VO services, core Grid services (RBs, UIs, database services, BDIIs)

Grid developments and middleware components, 19 July EGEE: adding a VO EGEE has a formal procedure for adding selected new user communities: Negotiation with one of the Regional Operations Centres Seek balance between the resources contributed by a VO and those that they consume. Resource allocation will be made at the VO level. Many resources need to be available to multiple VOs : shared use of resources is fundamental to a Grid

Grid developments and middleware components, 19 July Story so far: themes illustrated by EGEE e-Infrastructure  Integrating networks, grids and emerging technologies  Based on standards  Underpinning research, industry, … the “knowledge economy” International, collaborative effort Moving to a Service Orientated Architecture Focus: Production grids for multiple VOs  Demands massive effort in organisation and administration: Operations Support Training

Grid developments and middleware components, 19 July Present: Globus A software toolkit addressing certain technical problems in the development of Grid enabled tools, services, and applications  Offers a modular “bag of technologies”  Made available under liberal open source license Not turnkey solutions, but building blocks and tools for application developers and system integrators

Grid developments and middleware components, 19 July Globus: Key components Grid Security Infrastructure (GSI)  X.509 authentication with delegates and single sign-on Grid Resource Allocation Mgmt (GRAM)  Remote allocation, monitoring of job, control of compute resources GridFTP protocol (FTP extensions)  High-performance data access & transport Grid Resource Information Service (GRIS) + Monitoring and Discovery Service (MDS)  Access to structure & state information XIO library  TCP, UDP, IP multicast, and file I/O Others…

Grid developments and middleware components, 19 July Notes - VDT “The Virtual Data Toolkit (VDT) is an ensemble of grid middleware that can be easily installed and configured. In our experience, installing grid software is challenging and time consuming. The goal of the VDT is to make it as easy as possible for users to deploy, maintain and use grid middleware.”

Grid developments and middleware components, 19 July Virtual Data Toolkit Condor Group  Condor/Condor-G  DAGMan  Fault Tolerant Shell  ClassAds Globus Alliance  Job submission (GRAM)  Information service (MDS)  Data transfer (GridFTP)  Replica Location (RLS) EDG & LCG  Make Gridmap  Certificate Revocation List Updater  GLUE Schema ISI & UC  Chimera & Pegasus NCSA  MyProxy  GSI OpenSSH  UberFTP LBL  PyGlobus  Netlogger Caltech  MonaLisa VDT  VDT System Profiler  Configuration software Others  KX509 (U. Mich.)

Grid developments and middleware components, 19 July Outline Grid developments from an EGEE perspective:  Creating e-Infrastructure  Building on and with other Grid projects  Towards service-orientation  Establishing a “production Grid” Overview of the middleware of the current EGEE-0 system  Major components  Lifecycle of a job Summary

Grid developments and middleware components, 19 July User-view of EGEE: a multi-VO Grid User Interface Grid services User Interface

Grid developments and middleware components, 19 July Middleware components ReplicaCatalogue Logging & Book-keeping ResourceBrokerStorageElementComputingElement InformationService Job Status DataSets info Author. &Authen. Job Submit Event Job Query Job Status Input “sandbox” Input “sandbox” + Broker Info Output “sandbox” Publish SE & CE info “User interface”

Grid developments and middleware components, 19 July Workload Management System (WMS) Distributed scheduling  multiple UI’s where you submit your job  multiple RB’s from where the job is sent to a CE  multiple CE’s where the job can be put in a queuing system Distributed resource management  multiple information systems that monitor the state of the grid  Information from SE, CE, sites

Grid developments and middleware components, 19 July Authentication, Authorisation Authentication  User obtains certificate from CA  Connects to UI by ssh  Downloads certificate  Invokes Proxy server  Single logon – to UI - then Secure Socket Layer with proxy identifies user to other nodes Authorisation - currently  User joins Virtual Organisation  VO negotiates access to Grid nodes and resources (CE, SE)  Authorisation tested by CE, SE: gridmapfile maps user to local account UI CA VO mgr Personal VO database Gridmapfiles On CE, SE nodes SSL (proxy) VO service

Grid developments and middleware components, 19 July User Interface node The user’s interface to the Grid Command-line interface to  Proxy server  Job operations To submit a job Monitor its status Retrieve output  Data operations Upload file to SE Create replica Discover replicas  Other grid services Also C++ and Java APIs To run a job user creates a JDL (Job Description Language) file UI JDL

Grid developments and middleware components, 19 July “Compute element” in LCG-2 A CE is a grid batch queue with a “grid gate” front-end: Homogeneous set of worker nodes Grid gate node Local resource management system: Condor / PBS / LSF master Globus gatekeeper Job request Info system Logging gridmapfile I.S. Logging

Grid developments and middleware components, 19 July Storage elements and files Storage elements hold files: write once, read many Replica files can be held on different SE:  “close” to CE; share load on SE Replica Catalogue - what replicas exist for a file? Replica Location Service - where are they? Local Info Event Logging gridmapfile GridFTP Disk arrays or tapes Info system Logging Globus gatekeeper File transferRequests

Grid developments and middleware components, 19 July Naming Conventions  Logical File Name (LFN) An alias created by a user to refer to some item of data e.g. “lfn:cms/ /run2/track1”  Site URL (SURL) (or Physical File Name (PFN)) The location of an actual piece of data on a storage system e.g. “srm://pcrd24.cern.ch/flatfiles/cms/output10_1”  Globally Unique Identifier (GUID) A non-human readable unique identifier for an item of data e.g. “guid:f81d4fae-7dec-11d0-a765-00a0c91e6bf6” Logical File Name 1 Logical File Name 2 Logical File Name n GUID Physical File SURL n Physical File SURL 1

Grid developments and middleware components, 19 July Storage Element Data Replication Services: Basic Functionality Replica Manager Replica Location Service Replica Metadata Catalog Storage Element Files have replicas stored at many Grid sites on Storage Elements. Each file has a unique GUID. Locations corresponding to the GUID are kept in the Replica Location Service. Users may assign aliases to the GUIDs. These are kept in the Replica Metadata Catalog. The Replica Manager provides atomicity for file operations, assuring consistency of SE and catalog contents.

Grid developments and middleware components, 19 July Resource Broker nodes Run the Workload Management System  To accept job submissions  Dispatch jobs to appropriate Compute Element (CE)  Allow users To get information about their status To retrieve their output A configuration file on each UI node determines which RB node(s) will be used When a user submits a job, JDL options are to:  Specify CE  Allow RB to choose CE (using optional tags to define requirements)  Specify SE (then RB finds “nearest” appropriate CE, after interrogating Replica Location Service)

Grid developments and middleware components, 19 July Logging and Book-keeping Who did what when?? What’s happening to my job? Usually runs on Resource Broker node See LCG-2 user guide for a bit more on this

Grid developments and middleware components, 19 July Information System Receives periodic (~5 minutes) updates from CE, SE Used by RB node to determine resources to be used by a job “Leaf/node” system: currently BDII is used CE SECESE Site Site aSite b Element

Grid developments and middleware components, 19 July Outline Grid developments from an EGEE perspective:  Creating e-Infrastructure  Building on and with other Grid projects  Towards service-orientation  Establishing a “production Grid” Overview of the middleware of the current EGEE-0 system  Major components  Lifecycle of a job Summary

UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status

UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status submitted Job Status UI: allows users to access the functionalities of the WMS (via command line, GUI, C++ and Java APIs)

UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status edg-job-submit myjob.jdl Myjob.jdl JobType = “Normal”; Executable = "$(CMS)/exe/sum.exe"; InputSandbox = {"/home/user/WP1testC","/home/file*”, "/home/user/DATA/*"}; OutputSandbox = {“sim.err”, “test.out”, “sim.log"}; Requirements = other. GlueHostOperatingSystemName == “linux" && other. GlueHostOperatingSystemRelease == "Red Hat 7.3“ && other.GlueCEPolicyMaxCPUTime > 10000; Rank = other.GlueCEStateFreeCPUs; submitted Job Statu s Job Description Language (JDL) to specify job characteristics and requirements

UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage Input Sandbox files Job waiting submitted Job Status NS: network daemon responsible for accepting incoming requests

Job submission UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage waiting submitted Job Status WM: responsible to take the appropriate actions to satisfy the request Job

Job submission UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage waiting submitted Job Status Match- Maker/ Broker Where must this job be executed ?

Job submission UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage waiting submitted Job Status Match- Maker/ Broker Matchmaker: responsible to find the “best” CE where to submit a job

Job submission UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage waiting submitted Job Status Match- Maker/ Broker Where are (which SEs) the needed data ? What is the status of the Grid ?

Job submission UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage waiting submitted Job Status Match- Maker/ Broker CE choice

Job submission UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage waiting submitted Job Status Job Adapter JA: responsible for the final “touches” to the job before performing submission (e.g. creation of wrapper script, etc.)

Job submission UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage Job Status JC: responsible for the actual job management operations (done via CondorG) Job submitted waiting ready

Job submission UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node CE characts & status SE characts & status RB storage Job Status Job Input Sandbox files submitted waiting ready scheduled

Job submission UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node RB storage Job Status Input Sandbox submitted waiting ready scheduled running “Grid enabled” data transfers/ accesses Job

Job submission UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node RB storage Job Status Output Sandbox files submitted waiting ready scheduled running done

Job submission UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node RB storage Job Status Output Sandbox submitted waiting ready scheduled running done edg-job-get-output

Job submission UI Network Server Job Contr. - CondorG Workload Manager Replica Location Server Inform. Service Computing Element Storage Element RB node RB storage Job Status Output Sandbox files submitted waiting ready scheduled running done cleared

Job monitoring UI Log Monitor Logging & Bookkeeping Network Server Job Contr. - CondorG Workload Manager Computing Element RB node LM: parses CondorG log file (where CondorG logs info about jobs) and notifies LB LB: receives and stores job events; processes corresponding job status Log of job events edg-job-status edg-job-get-logging-info Job status

Grid developments and middleware components, 19 July Other UI commands edg-job-list-match  Lists resources matching a job description  Performs the matchmaking without submitting the job edg-job-cancel  Cancels a given job edg-job-status  Displays the status of the job edg-job-get-output  Returns the job-output (the OutputSandbox files) to the user edg-job-get-logging-info  Displays logging information about submitted jobs (all the events “pushed” by the various components of the WMS)  Very useful for debug purposes

Grid developments and middleware components, 19 July Storage Element Replication Services: Basic Functionality Replica Manager Replica Location Service Replica Metadata Catalog Storage Element Files have replicas stored at many Grid sites on Storage Elements. Each file has a unique GUID. Locations corresponding to the GUID are kept in the Replica Location Service. Users may assign aliases to the GUIDs. These are kept in the Replica Metadata Catalog. The Replica Manager provides atomicity for file operations, assuring consistency of SE and catalog contents.

Grid developments and middleware components, 19 July Storage Element Higher Level Replication Services Replica Manager Replica Location Service Replica Optimization Service Replica Metadata Catalog SE Monitor Network Monitor Storage Element The Replica Manager may call on the Replica Optimization service to find the best replica among many based on network and SE monitoring. Hooks for user-defined pre- and post- processing for replication operations are available.

Grid developments and middleware components, 19 July Naming Conventions  Logical File Name (LFN) An alias created by a user to refer to some item of data e.g. “lfn:cms/ /run2/track1”  Site URL (SURL) (or Physical File Name (PFN)) The location of an actual piece of data on a storage system e.g. “srm://pcrd24.cern.ch/flatfiles/cms/output10_1”  Globally Unique Identifier (GUID) A non-human readable unique identifier for an item of data e.g. “guid:f81d4fae-7dec-11d0-a765-00a0c91e6bf6” Logical File Name 1 Logical File Name 2 Logical File Name n GUID Physical File SURL n Physical File SURL 1

Grid developments and middleware components, 19 July Replica Metadata Catalog (RMC) vs. Replica Location Service (RLS) RMC:  Stores LFN-GUID mappings RLS:  Stores GUID-SURL mappings Logical File Name 1 Logical File Name 2 Logical File Name n GUID Physical File SURL n Physical File SURL 1 RMC RLS RM RLS ROS RMC

Grid developments and middleware components, 19 July Replica Location Service (RLS) The Replica Location Service is a system that maintains and provides access to information about the physical location of copies of data files. It is a distributed service that stores mappings between globally unique identifiers of datafiles and the physical identifiers of all existing replicas of these datafiles. Design was a collaboration between Globus and EDG RM RLS ROS RMC

Grid developments and middleware components, 19 July Job submission edg-job-submit [–r ] [-c ] [-vo ] [-o ]  -r the job is submitted directly to the computing element identified by  -c the configuration file is pointed by the UI instead of the standard configuration file  -vo the Virtual Organization (if user is not happy with the one specified in the UI configuration file)  -o the generated edg_jobId is written in the Useful for other commands, e.g.: edg-job-status –i (or edg_jobId) -i the status information about edg_jobId contained in the are displayed

Grid developments and middleware components, 19 July Job Definition Attributes Executable (mandatory)  The command name Arguments (optional)  Job command line arguments StdInput, StdOutput, StdErr (optional)  Standard input/output/error of the job Environment (optional)  List of environment settings InputSandbox (optional)  List of files on the UI local disk needed by the job for running  The listed files are staged from the UI to the remote CE OutputSandbox (optional)  List of files, generated by the job, which have to be retrieved

Grid developments and middleware components, 19 July Resource Attributes Requirements  Job requirements on computing resources  Specified using attributes of resources published in the Information System  If not specified, default value defined in UI configuration file is considered Default: other.GlueCEStateStatus == "Production" (the resource has to be in the Production grid) Rank  Expresses preference (how to rank resources that have already met the Requirements expression)  Specified using attributes of resources published in the Information Service  If not specified, default value defined in the UI configuration file is considered Default: - other.GlueCEStateFreeCPUs (the highest number of free CPUs)

Grid developments and middleware components, 19 July “Data” Attributes InputData (optional)  Refers to data used as input by the job: these data are published in the Replica Catalog and stored in the SEs)  PFNs and/or LFNs DataAccessProtocol (mandatory if InputData specified)  The protocol or the list of protocols which the application is able to speak with for accessing InputData on a given SE OutputSE (optional)  The hostname of the output SE  RB uses it to choose a CE that is compatible with the job and is close to SE OutputData (optional)  Output Data that will be registered at the end of the job

Grid developments and middleware components, 19 July Example JDL File Executable = “gridTest”; StdError = “stderr.log”; StdOutput = “stdout.log”; InputSandbox = {“/home/joda/test/gridTest”}; OutputSandbox = {“stderr.log”, “stdout.log”}; InputData = “lfn:testbed ”; DataAccessProtocol = “gridftp”; Requirements = other.Architecture==“INTEL” && \ other.OpSys==“LINUX” && other.FreeCpus >=4; Rank = “other.GlueHostBenchmarkSF00”;

Grid developments and middleware components, 19 July Summary… 1 EGEE is creating a production-quality Grid as a step towards an emerging Europe-wide e-Infrastructure  Secure, reliable, sustainable  Wide spectrum of VOs  Integrating with national, regional, international grids and networks EGEE is reengineering middleware, with Service Orientation The LCG is providing a service now EGEE-0 components, Job submission and life-cycle have been described….

Grid developments and middleware components, 19 July Summary -2: EGEE components ReplicaCatalogue Logging & Book-keeping ResourceBrokerStorageElementComputingElement InformationService Job Status DataSets info Author. &Authen. Job Submit Event Job Query Job Status Input “sandbox” Input “sandbox” + Broker Info Output “sandbox” Publish SE & CE info “User interface”