Presentation is loading. Please wait.

Presentation is loading. Please wait.

An Introduction to Grid Computing - David Groep 2004.09.13 1 Grid Computing Introduction David Groep NIKHEF Physics Data Processing Group.

Similar presentations


Presentation on theme: "An Introduction to Grid Computing - David Groep 2004.09.13 1 Grid Computing Introduction David Groep NIKHEF Physics Data Processing Group."— Presentation transcript:

1 An Introduction to Grid Computing - David Groep 2004.09.13 1 Grid Computing Introduction David Groep NIKHEF Physics Data Processing Group

2 An Introduction to Grid Computing - David Groep 2004.09.13 2 Talk Outline The vision A problem and a computing model What makes a Grid –Authentication and Authorization –Protocols and Standards –Putting it together: Collective Services Into production: the LGC Computing Grid Building your own Grid

3 An Introduction to Grid Computing - David Groep 2004.09.13 3 Grid – a vision The GRID: networked data processing centres and ”middleware” software as the “glue” of resources. Researchers perform their activities regardless geographical location, interact with colleagues, share and access data Scientific instruments and experiments provide huge amounts of data Federico.Carminati@cern.ch

4 An Introduction to Grid Computing - David Groep 2004.09.13 4  Place event info on 3D map  Trace trajectories through hits  Assign type to each track  Find particles you want  Needle in a haystack!  This is “relatively easy” case A Glimpse of the Problem in HEP

5 An Introduction to Grid Computing - David Groep 2004.09.13 5 level 1 - special hardware 40 MHz (40 TB/sec) level 2 - embedded processors level 3 - PCs 75 KHz (75 GB/sec) 5 KHz (5 GB/sec) 100 Hz (100 MB/sec) data recording & offline analysis HEP Data Rates Reconstruct & analyze 1 event takes about 90 s Maybe only a few out of a million are interesting. But we have to check them all! Analysis program needs lots of calibration; determined from inspecting results of first pass.  Each event will be analyzed several times! Raw data rate ~ 5PByte/yr/expt. total volume: ~20 Pbyte/yr per major centre: ~2 PByte/yr The ATLAS experiment

6 An Introduction to Grid Computing - David Groep 2004.09.13 6 LHC User Distribution Putting all computers in one spot leads to traffic jams Which spot is willing to pay for & maintain 100k CPUs?

7 An Introduction to Grid Computing - David Groep 2004.09.13 7 Computing models Mini Computer Microcomputer Cluster (by Christophe Jacquet) Once upon a time…….. mainframe

8 An Introduction to Grid Computing - David Groep 2004.09.13 8 The Grid & Distributed Computing (by Christophe Jacquet) …and today

9 An Introduction to Grid Computing - David Groep 2004.09.13 9 Realizing the Grid Vision Grid was the logical next step in the end of the 1990: Harnassing desktop power became commonplace – 1988: Condor, later: SETI@Home, Entropia, Distributed.NET Peer-to-peer data access protocols emerged ~ 1999: Napster, later: Gnutella, KaZaa, BitTorrent Network access became extremely fast – 1997: wide area bandwidth starts to double every 9 months! Meta-computing experiments – 1995: I-Way, GUSTO, …

10 An Introduction to Grid Computing - David Groep 2004.09.13 10 Beyond meta-computing: the Grid A grid integrates resources that are –not owned or administered by one single organisation –speak a common, open protocol … that is generic –working as a coordinated, transparent system And … –can be used by many people from multiple organisations –that work together in one Virtual Organisation

11 An Introduction to Grid Computing - David Groep 2004.09.13 11 Virtual Organisations A set of individuals or organisations, not under single hierarchical control, temporarily joining forces to solve a particular problem at hand, bringing to the collaboration a subset of their resources, sharing those at their discretion and each under their own conditions. A VO is a temporary alliance of stakeholders – Users – Service providers – Information Providers

12 An Introduction to Grid Computing - David Groep 2004.09.13 12 Coordination and Security Parties have no a-priori trust relationship –need for trusted third parties (TTPs): PKI Certificates –should span whole Grid infrastructure Community formation is independent of resources –VOs as a whole negotiate access to resources –community management is done by the VO –VO establishment/change/liquidation can be rapid –needs Grid-wide authorization/enforcement solution Grid Security Infrastructure (GSI)

13 An Introduction to Grid Computing - David Groep 2004.09.13 13 Certification Authorities Alice (d,n)(d,n) (e,n)(e,n) CommonName=‘Alice’ Organization=‘KNMI’ Certificate Request CA key CA cert (self-signed) Alice… CA checks identifiers against identity of requestor sign request with CA key ship to Alice and publish Alice generates key pair and ships `request’ to CA

14 An Introduction to Grid Computing - David Groep 2004.09.13 14 Your certificate Your private key is valuable, keep it safe –protected with a pass phrase (conventional symmetric crypto) –store it securely –make proxies for signing on to the grid Find all your credential data in $HOME/.globus/ –Private key in “userkey.pem” –Public key certificate in “usercert.pem” –CA’s that you trust in “~/.globus/certificates/” CAUserUser proxy

15 An Introduction to Grid Computing - David Groep 2004.09.13 15 CAs everywhere Everyone (almost) in Europe has a national CA  Green: CA Accredited  Yellow: t.b. discussed (march 2004) Other Accredited CAs:  DoEGrids (US)  GridCanada  ASCCG (Taiwan)  ArmeSFO (Armenia)  CERN  Russia (HEP)  FNAL Service CA (US)  Israel  Pakistan  …

16 An Introduction to Grid Computing - David Groep 2004.09.13 16 Authorization: grid-mapfile and ACs Each resource can authorize or ban users Tedious but simple way: grid-mapfile $ cat /etc/grid-security/grid-mapfile "/O=dutchgrid/O=users/O=nikhef/CN=David Groep" davidg "/O=dutchgrid/O=users/O=nikhef/CN=Michiel Botje" h24 "/O=dutchgrid/O=users/O=sara/CN=Ron Trompert" ront "/O=dutchgrid/O=users/O=nikhef/CN=Jeffrey Templon" aliprod VO-managed membership lists: edg-mkgridmap $ cat /opt/edg/etc/edg-mkgridmap.conf group ldap://grid-vo.nikhef.nl/ou=lcg1,o=alice,dc=cern,dc=ch.alice $ cat /var/adm/grid-mapfile "/C=IT/O=INFN/L=Catania/CN=Roberto Barbera".alice dynamic communities and multi-VO membership: VOMS and/or Attribute Certificates

17 An Introduction to Grid Computing - David Groep 2004.09.13 17 ‘Common and open protocols’ Applications Grid Services GRAM Grid Security Infrastructure (GSI) Fabric FARMSMPPsDesktopsHPSSEquipment Application Toolkits EDG RBMPICH-G2Condor-G GridFTPBDII Genius RLS/RMC MySQL

18 An Introduction to Grid Computing - David Groep 2004.09.13 18 Current protocols Protocols used in production today (legacy): GridFTP – data transfer GRAM – job submission LDAP and GLUE – information system (MDS, BDII) New direction: Web Services Web Services Resource Framework WSRF syntax for service access Open Grid Services Architecture OGSA composition and behaviour

19 An Introduction to Grid Computing - David Groep 2004.09.13 19 What is a WS-Resource? Web service: Operation execution component made available at an endpoint address –Implementation often stateless, but accesses state WS-Resource: Web service + associated resource –Equivalently: A resource with an associated WS A WS-Resource has –Identity: Can be uniquely identified/referenced –Lifetime: Often created & destroyed by clients –State: Can be projected as an XML document WS-Resource type = Web service interface WS-Resources are not just for physical devices –Jobs, subscriptions, logical data sets, etc. slide by Steve Tuecke

20 An Introduction to Grid Computing - David Groep 2004.09.13 20 Interface Web Service WSDL Run-time environment Web Services Model slide by Steve Tuecke

21 An Introduction to Grid Computing - David Groep 2004.09.13 21 Interface Web Service message Invoking a Web Service address message Endpoint Reference Run-time environment Web Services Model slide by Steve Tuecke

22 An Introduction to Grid Computing - David Groep 2004.09.13 22 context Interface Web Service message id message Using a Web service to access a resource id address resource Run-time environment Endpoint Reference WS-Resource Framework Model Address Resource id slide by Steve Tuecke

23 An Introduction to Grid Computing - David Groep 2004.09.13 23 Access in a coordinated way Transparently crossing of domain boundaries satisfying constraints of – site autonomy – authenticity, integrity, confidentiality single sign-on to all services ways to address services collectively –via command-line tools or submission portals (existing apps) –via API calls to Grid middleware

24 An Introduction to Grid Computing - David Groep 2004.09.13 24 Grid Security In Action User Single sign-on via “grid-id” & generation of proxy cred. Or: retrieval of proxy cred. from online repository User Proxy Proxy credential Communication* Site C (Kerberos) Storage system GSI-enabled FTP server Authorize Map to local id Access file Remote file access request* Remote process creation requests* * With mutual authentication Site A (Kerberos) Computer GSI-enabled GRAM server Process Kerberos ticket Restricted proxy Local id Site B (Unix) Computer GSI-enabled GRAM server Process Restricted proxy Local id Authorize Map to local id Create process Generate credentials Ditto slide by Steve Tuecke

25 An Introduction to Grid Computing - David Groep 2004.09.13 25 C = DS = = Grid software service (like http server) Information System C C C C C C C Information System is Central Nervous System of Grid Info system defines grid slide by Jeff Templon

26 An Introduction to Grid Computing - David Groep 2004.09.13 26 C = DS = = Grid software service I.S. C C C C C C C DS D.M.S Data Grid slide by Jeff Templon

27 An Introduction to Grid Computing - David Groep 2004.09.13 27 C = DS = = Grid software service I.S. C C C C C C C DS D.M.S Computing Task Submission W.M.S. proxy + command; (data); Get fresh, detailed info Coarse Requirements Candidate Clusters slide by Jeff Templon

28 An Introduction to Grid Computing - David Groep 2004.09.13 28 DS List of best locations C = DS = = Grid software service I.S. C D.M.S Computing Task Execution W.M.S. proxy + command; (data); logger Where is my data? proxy Find DMS slide by Jeff Templon

29 An Introduction to Grid Computing - David Groep 2004.09.13 29 DS C = = = Grid software service I.S. C D.M.S Computing Task Execution W.M.S. logger How to contact O.D.S.? Where do I put the data? proxy + data Register output Done slide by Jeff Templon

30 An Introduction to Grid Computing - David Groep 2004.09.13 30 New Stuff Directed Acyclic Graphs Job Optimizers –move job to data –pre-stage data with job –fastest turn-around –cheapest –…

31 An Introduction to Grid Computing - David Groep 2004.09.13 31 Realising the Grid Vision Grid was the logical next step in the end of the 1990: Harnassing desktop power became commonplace – 1988: Condor, later: SETI@Home, Entropia, Distributed.NET Peer-to-peer data access protocols emerged – 1999: Napster, later: Gnutella, KaZaa, BitTorrent Network access became extremely fast – 1997: wide area bandwidth starts to double every 9 months! 1997: Globus starts developing basic middleware – 1996: middleware by Legion, 2000: Unicore Massive take-up of the Grid vision in 1999 – lead in Europe by the EU DataGrid – others include: NASA-IPG, CrossGrid, GridLab, PPDG, Alliance, Global Production Grids since 2003 – LHC Computing Grid project (LCG) – Enabling Grids for e-Science Europe (EGEE)

32 An Introduction to Grid Computing - David Groep 2004.09.13 32 Building the Grid: LCG

33 An Introduction to Grid Computing - David Groep 2004.09.13 33 Some of the Resources 1.2 PByte near-line StorageTek 36 node IA32 cluster ‘matrix’ 468 CPU IA64 + 1024 CPU MIPS multi-Gbit links to 100TByte cache 7 TByte cache 140 nodes IA32 1Gbit link SURFnet multiple links with SARA

34 An Introduction to Grid Computing - David Groep 2004.09.13 34 applet and video by Stuart Wakefield, IC London

35 An Introduction to Grid Computing - David Groep 2004.09.13 35 Building your own grid After the break …


Download ppt "An Introduction to Grid Computing - David Groep 2004.09.13 1 Grid Computing Introduction David Groep NIKHEF Physics Data Processing Group."

Similar presentations


Ads by Google