INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org Middleware Concepts and Services in Campus, National and International Grids Mike Mineter.

Slides:



Advertisements
Similar presentations
The National Grid Service Mike Mineter.
Advertisements

The National Grid Service and OGSA-DAI Mike Mineter
Current status of grids: the need for standards Mike Mineter TOE-NeSC, Edinburgh.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
High Performance Computing Course Notes Grid Computing.
INFSO-RI Enabling Grids for E-sciencE Concepts of grid computing Guy Warner NeSC Training Team
INFSO-RI Enabling Grids for E-sciencE EGEE Middleware The Resource Broker EGEE project members.
The UK National Grid Service Using the NGS. Outline NGS Background Getting Certificates Acceptable usage policies Joining VO’s What resources will be.
OxGrid, A Campus Grid for the University of Oxford Dr. David Wallom.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
Riccardo Bruno INFN.CT Sevilla, Sep 2007 The GENIUS Grid portal.
DISTRIBUTED COMPUTING
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Configuring and Maintaining EGEE Production.
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
EGEE-II INFSO-RI Enabling Grids for E-sciencE An Overview of Grid Computing Richard Hopkins Training Outreach and Education National.
Computational grids and grids projects DSS,
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
1 Overview of the Application Hosting Environment Stefan Zasada University College London.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
The National Grid Service Guy Warner.
Production Grids Mike Mineter NeSC-TOE. EU project: RIO31844-OMII-EUROPE 2 Production Grids - examples 1.EGEE: Enabling Grids for e-Science 2.National.
Neil Geddes GridPP-10, June 2004 UK e-Science Grid Dr Neil Geddes CCLRC Head of e-Science Director of the UK Grid Operations Support Centre.
The UK’s Grid Operations Support Centre and National Grid Service Core Components of the UK’s e-Infrstructure Neil Geddes Director, GOSC All Hands Meeting,
Training and the NGS Mike Mineter
NGS Innovation Forum, Manchester4 th November 2008 Condor and the NGS John Kewley NGS Support Centre Manager.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
EGEE-II INFSO-RI Enabling Grids for E-sciencE An Introduction to the EGEE Project Presented by Min Tsai ISGC 2007, Taipei With thanks.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Next steps with EGEE EGEE training community.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
The National Grid Service Mike Mineter
EGEE-II INFSO-RI Enabling Grids for E-sciencE Application Development on Grids Gergely Sipos MTA SZTAKI Hungarian.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The GILDA training infrastructure.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
Next Steps: becoming users of the NGS Mike Mineter
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Campus grids: e-Infrastructure within a University Mike Mineter National e-Science Centre 14 February 2006.
Creating and running an application.
The National Grid Service Mike Mineter.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
EGEE-0 / LCG-2 middleware Practical.
INFSO-RI Enabling Grids for E-sciencE GILDA and GENIUS Guy Warner NeSC Training Team An induction to EGEE for GOSC and the NGS NeSC,
Enabling, facilitating and delivering quality training in the UK and Internationally Introduction to e-science concepts Mike Mineter Training Outreach.
EGEE-II INFSO-RI Enabling Grids for E-sciencE An overview of the EGEE project and middleware Mike Mineter
INFSO-RI Enabling Grids for E-sciencE Web Services Mike Mineter National e-Science Centre, Edinburgh.
The National Grid Service Mike Mineter.
The National Grid Service Mike Mineter
1 Porting applications to the NGS, using the P-GRADE portal and GEMLCA Peter Kacsuk MTA SZTAKI Hungarian Academy of Sciences Centre for.
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
UK Grid Operations Support Centre All slides stolen by P.Clarke from a talk given by: Dr Neil Geddes CCLRC Head of e-Science Director of the UK Grid Operations.
Campus grids: e-Infrastructure within a University Mike Mineter National e-Science Centre 22 February 2006.
Campus grids: e-Infrastructure within a University Mike Mineter National e-Science Centre
The UK National Grid Service Andrew Richards – CCLRC, RAL.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
Antonio Fuentes RedIRIS Barcelona, 15 Abril 2008 The GENIUS Grid portal.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Accessing the VI-SEEM infrastructure
Workload Management System
Grid Computing.
The National Grid Service
Overview of the EGEE project and middleware
The National Grid Service Mike Mineter NeSC-TOE
Presentation transcript:

INFSO-RI Enabling Grids for E-sciencE Middleware Concepts and Services in Campus, National and International Grids Mike Mineter Training Outreach and Education, Edinburgh

Enabling Grids for E-sciencE INFSO-RI Policy for re-use This presentation can be re-used for academic purposes. However if you do so then please let training- know. We need to gather statistics of re-use: no. of events, number of people trained. Thank you!!training-

Enabling Grids for E-sciencE INFSO-RI Outline Concepts: Grids enable “Virtual computing across administrative domains” Current middleware in –International - EGEE grid –National Grid Service –Campus Grids Service orientation –Web services and Grids: “Open Grid Services Architecture”

Enabling Grids for E-sciencE INFSO-RI Grid Middleware When using a PC or workstation you –Login with a username and password (“Authentication”) –Use rights given to you (“Authorisation”) –Run jobs –Manage files: create them, read/write, list directories Components are linked by a bus Operating system One admin domain When using a Grid you –Login with digital credentials – single sign- on (“Authentication”) –Use rights given you (“Authorisation”) –Run jobs –Manage files: create them, read/write, list directories Services are linked by the Internet Middleware Many admin domains

Enabling Grids for E-sciencE INFSO-RI INTERNET Virtual organisations (collaborations) negotiate with sites to agree access to resources Grid middleware runs on each shared resource to provide –Data services –Computation services –Single sign-on Distributed services (both people and middleware) enable the grid Typical current grid

Enabling Grids for E-sciencE INFSO-RI The vital layer Where computer science meets the application communities! Developments built on higher- level tools and core services Makes Grid services useable by non-specialists Grids provide the compute and data storage resources Basic Grid services: AA, job submission, info, … Higher-level grid services (brokering,…) Application toolkits, ….. Application Production grids provide these core services.

Enabling Grids for E-sciencE INFSO-RI The many scales of grids Campus grids: PlymGrid Regional grids (e.g. White Rose Grid) National grids (National Grid Service) International grid (EGEE) Wider collaboration greater resources National datacentres, HPC, instruments Institutes’ data; Condor pools International instruments,.. Desktop

Enabling Grids for E-sciencE INFSO-RI EGEE: “Enabling Grids for e-Science” and the gLite middleware

Enabling Grids for E-sciencE INFSO-RI EGEE – international e-infrastructure Network infrastructure & Resource centres Operations, Support and training Collaboration Pan-European Grid A four year programme: Build, deploy and operate a consistent, robust a large scale production grid service that –Links with and build on national, regional and international initiatives Improve and maintain the middleware in order to deliver a reliable service to users Attract new users from research and industry and ensure training and support for them

Enabling Grids for E-sciencE INFSO-RI Access service Access serviceHow users logon to a Grid Computing Element (CE) Computing Element (CE): A batch queue on a site’s computers where the user’s job is executed Storage Element (SE) Storage Element (SE): provides (large-scale) storage for files Resource Broker (RB) Resource Broker (RB): Service that matches the user’s requirements with the available resources on a Grid Main components Information System Information System: Characteristics and status of resources

Enabling Grids for E-sciencE INFSO-RI Current EGEE grid File Replica Catalogue Logging & Book-keeping ResourceBroker StorageResource ComputingResource = batch queue InformationService Job Status Datasets info Author. &Authen. Job Submit Event Job Query Job Status Input files Input Output Output files Publish Resource info User/Grid interface

Enabling Grids for E-sciencE INFSO-RI Example JDL file Executable = “gridTest”; StdError = “stderr.log”; StdOutput = “stdout.log”; InputSandbox = {“/home/joda/test/gridTest”}; OutputSandbox = {“stderr.log”, “stdout.log”}; InputData = “lfn:/grid/gilda/training/testbed ”; DataAccessProtocol = “gridftp”; Requirements = other.Architecture==“INTEL” && \ other.OpSys==“LINUX”; Rank = “other.GlueHostBenchmarkSF00”; Building on basic tools and Information Service Submit job to grid via the “resource broker (RB)”, glite_job_submit my.jdl Returns a “job-id” used to monitor job, retrieve output

Enabling Grids for E-sciencE INFSO-RI Who provides the resources?! ServiceProviderNote Access service User / institute/ VO / grid operations Computer with client software Resource Broker (RB) VO / grid operations(No NGS-wide RB) Information System Information System:ditto Computing Element (CE) VO / sometimes centralised provision also Scalability requires that VOs provide resources to match average need Storage Element (SE) ditto “VO”: virtual organisation “Grid operations”: funded effort

Enabling Grids for E-sciencE INFSO-RI VDT VDT: Virtual Data Toolkit –ensemble of grid middleware that can be easily installed and configured –Includes Globus Toolkit, Condor, MyProxy,… – Been used by EGEE

Enabling Grids for E-sciencE INFSO-RI Grid security and trust Providers of resources (computers, databases,..) need risks to be controlled: they are asked to trust users they do not know User’s need –single sign-on: to be able to logon to a machine that can pass the user’s identity to other resources –To trust owners of the resources they are using Build middleware on layer providing: –Authentication: know who wants to use resource –Authorisation: know what the user is allowed to do –Security: reduce vulnerability, e.g. from outside the firewall –Non-repudiation: knowing who did what The “Grid Security Infrastructure” middleware is the basis of (most) production grids

Enabling Grids for E-sciencE INFSO-RI The Role of the Virtual Organisation (VO) Compute Center VO Service slide based on presentation given by Carl Kesselman at GGF Summer School 2004

17 The National Grid Service The core UK grid, resulting from the UK's e-Science programme. Production use of computational and data grid resources. Supported by JISC

18 Leeds and Oxford (core compute nodes) –64 dual CPU intel 3.06GHz (1MB cache). Each node: 2GB memory, 2x120GB disk, Redhat ES3.0. Gigabit Myrinet connection. 2TB data server. Manchester and Rutherford Appleton Laboratory (core data nodes) –20 dual CPU (as above). 18TB SAN. Bristol –initially GHz Athlon processors in 10 dual CPU nodes. Cardiff –1000 hrs/week on a SGI Origin system comprising 4 dual CPU Origin 300 servers with a Myrinet™ interconnect. Lancaster –8 Sun Blade 1000 execution nodes, each with dual UltraSPARC IIICu processors connected via a Dell 1750 head node. Westminster –32 Sun V60 compute nodes HPCx –… NGS Facilities For more details:

19 NGS software Computation services based on Globus Toolkit –Use compute nodes for sequential or parallel jobs, primarily from batch queues –Can run multiple jobs concurrently (be reasonable!) Data services: –Later talk

20 UK National Grid Service Current use cases Data Collaborative research Project support (incl VOMS) Portal Application hosting expose data to grid use

21 NGS – The Future Middleware recently deployed –Portal v2 –GridSAM – alternative job submission and monitoring Being deployed –VOMS Under development –Shibboleth integration

Enabling Grids for E-sciencE INFSO-RI Campus “grids”

Enabling Grids for E-sciencE INFSO-RI Harvesting CPU time Teaching labs. + Researchers Often-idle processors!! Analyses constrained by CPU time!

Enabling Grids for E-sciencE INFSO-RI Harvesting CPU time - 2 Teaching lab machines lie idle for most of the time Harvest spare compute cycles to create a low-cost “high throughput computing” (HTC) platform –Goal: run many tasks in a week, month, … –Typically: many similar tasks invoked from workflow or a script  Monte-Carlo  Simulation – parameter sweeps Pool of processors are a batch processing resource Condor most common approach – Job is described in a text file “Match-maker” selects appropriate resource

Enabling Grids for E-sciencE INFSO-RI Campus grids Resources in many administrative domains Need basic services that provide: –Authentication, Authorisation mechanisms  Based on certificates  Single sign-on to access many resources  Control of who can do what –Job submission services  Submit jobs to batch queues on clusters or Condor pools –Information systems  So you know what can be used –Ability to share data

Enabling Grids for E-sciencE INFSO-RI Middleware in (some) campus grids Condor –Harvest compute cycles Globus toolkit –Tools built on Grid Security Infrastructure and include:  Job submission: run a job on a remote computer  Information services: So I know which computer to use  … Storage Resource Broker –Sharing of files held in SRB vaults in multiple locations SRB and Globus Toolkit are part of the National Grid Service stack

27 Example: OxGrid, a University Campus Grid Single entry point for Oxford users to shared and dedicated resources Seamless access to National Grid Service and Oxford Supercomputing Centre for registered users Single sign-on using PKI technology integrated with current methods NGSOSC OxGrid Central Management Broker MDS/ VOM Storage College Resources Departmental Resources Oxford Users David Wallom

Enabling Grids for E-sciencE INFSO-RI Service oriented middleware

Enabling Grids for E-sciencE INFSO-RI Service orientation – software components that are… Accessible across a network Loosely coupled, defined by the messages they receive / send Interoperable: each service has a description that is accessible and can be used to create software to invoke that service Based on standards (for which tools do / could exist) Developed in anticipation of new uses Service Registry Client

Enabling Grids for E-sciencE INFSO-RI Web Services Grid Technology Grid Services October 2001 View Commerce Standards Tools Research driven Data-intensive Compute intensive Collaboration – sharing of resources - Trust: opening resources infrastructure for the information society

Enabling Grids for E-sciencE INFSO-RI Extending Web Service standards Web Services Short-lived Interactions Call-return interaction Grids Persistence – –Infrastructure –Computation –Data –People also, Event-driven interactions Need to add to basic web services the notion of persistency STATEFUL SERVICES

Enabling Grids for E-sciencE INFSO-RI Resources and Event-driven Interaction Resource –Existence  creation  identifier  deletion  lifetime –Resource Properties – state elements  XML representation  get  put  partial get  partial put Event Driven –Subscribe to a topic  itself a resource –Notify a topic-relevant event WS Resource Framework WS Notification WS Transfer WS Eventing WS Resource Transfer WS Event Notification

Enabling Grids for E-sciencE INFSO-RI Summary Widely used middleware: –Grid Security Infrastructure:  Authorisation and authentication underpins it all –Compute services  Condor – cycle harvesting from pools of processors  Globus – job submission across administrative domains –Data services  Growth area !! –“Higher level services” built on these  Portals,… Service orientation –Basis of emerging standards built on web services

Enabling Grids for E-sciencE INFSO-RI Further reading Open Grid Forum Globus Alliance VDT Grid Today The Grid Cafe

Enabling Grids for E-sciencE INFSO-RI EGEE - Further information EGEE Conference: September conference-September-2006-geneva/ conference-September-2006-geneva/ EGEE digital library: EGEE EGEE: 1 st user Forum Forum Forum gLite

36 NGS Web Sites NGS – –To see what’s happening: –New wiki service: –Training events: HPCx –