INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org Overview of e-Infrastructure Mike Mineter Training Outreach and Education National e-Science.

Slides:



Advertisements
Similar presentations
1 Senn, Information Technology, 3 rd Edition © 2004 Pearson Prentice Hall James A. Senns Information Technology, 3 rd Edition Chapter 7 Enterprise Databases.
Advertisements

Distributed Systems Architectures
Chapter 7 System Models.
CSF4 Meta-Scheduler Tutorial 1st PRAGMA Institute Zhaohui Ding or
Cultural Heritage in REGional NETworks REGNET T1.4: Development of the system specification.
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination. Introduction to the Business.
1 Introducing the Specifications of the Metro Ethernet Forum MEF 19 Abstract Test Suite for UNI Type 1 February 2008.
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
Conference xxx - August 2003 Fabrizio Gagliardi EDG Project Leader and EGEE designated Project Director Position paper Delivery of industrial-strength.
An open source approach for grids Bob Jones CERN EU DataGrid Project Deputy Project Leader EU EGEE Designated Technical Director
1 Preliminary results of the Environmental Data Exchange Network for Inland Waters (EDEN-IW) project Practical lessons. P. Haastrup.
5-Dec-02D.P.Kelsey, GridPP Security1 GridPP Security UK Security Workshop 5-6 Dec 2002, NeSC David Kelsey CLRC/RAL, UK
The National Grid Service Mike Mineter.
NGS computation services: API's,
Research Councils ICT Conference Welcome Malcolm Atkinson Director 17 th May 2004.
National e-Science Centre Glasgow e-Science Hub Opening: Remarks NeSCs Role Prof. Malcolm Atkinson Director 17 th September 2003.
Open Grid Service Architecture - Data Access & Integration (OGSA-DAI) Dr Martin Westhead Principal Consultant, EPCC Telephone: Fax:+44.
The National Grid Service and OGSA-DAI Mike Mineter
Current status of grids: the need for standards Mike Mineter TOE-NeSC, Edinburgh.
SWITCH Visit to NeSC Malcolm Atkinson Director 5 th October 2004.
02/07/03 Grid Support Centre 1 UK Grid Support Centre Alistair Mills CLRC e-Science Centre
OMII-UK Steven Newhouse, Director. © 2 OMII-UK aims to provide software and support to enable a sustained future for the UK e-Science community and its.
1 Click here to End Presentation Software: Installation and Updates Internet Download CD release NACIS Updates.
I n t e g r i t y - S e r v i c e - E x c e l l e n c e Headquarters U.S.A.F. 1 Commodity Councils 101 NAME (S) SAF/AQCDATE.
Configuration management
EIS Bridge Tool and Staging Tables September 1, 2009 Instructor: Way Poteat Slide: 1.
Copyright 2007, Information Builders. Slide 1 Introduction to Web Services Efrem Litwin Director, WebFOCUS Integration Products Information Builders.
Sample Service Screenshots Enterprise Cloud Service 11.3.
1 RA III - Regional Training Seminar on CLIMAT&CLIMAT TEMP Reporting Buenos Aires, Argentina, 25 – 27 October 2006 Status of observing programmes in RA.
31242/32549 Advanced Internet Programming Advanced Java Programming
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
Introduction Peter Dolog dolog [at] cs [dot] aau [dot] dk Intelligent Web and Information Systems September 9, 2010.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
INFSO-RI Enabling Grids for E-sciencE Concepts of grid computing Guy Warner NeSC Training Team
GEODE Workshop 16 th January 2007 Issues in e-Science Richard Sinnott University of Glasgow Ken Turner University of Stirling.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Tutorial Welcome!!
INFSO-RI Enabling Grids for E-sciencE What is Grid Computing? Mike Mineter Training Outreach and Education National e-Science Centre.
DISTRIBUTED COMPUTING
DAME: Distributed Engine Health Monitoring on the Grid
EGEE-II INFSO-RI Enabling Grids for E-sciencE An Overview of Grid Computing Richard Hopkins Training Outreach and Education National.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
From GEANT to Grid empowered Research Infrastructures ANTONELLA KARLSON DG INFSO Research Infrastructures Grids Information Day 25 March 2003 From GEANT.
Jarek Nabrzyski, Ariel Oleksiak Comparison of Grid Middleware in European Grid Projects Jarek Nabrzyski, Ariel Oleksiak Poznań Supercomputing and Networking.
Training and the NGS Mike Mineter
Interoperability Grids, Clouds and Collaboratories Ruth Pordes Executive Director Open Science Grid, Fermilab.
Introduction to Grid Computing Ed Seidel Max Planck Institute for Gravitational Physics
SEEK Welcome Malcolm Atkinson Director 12 th May 2004.
Authors: Ronnie Julio Cole David
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
INFSO-RI Enabling Grids for E-sciencE What is Grid Computing? Richard Hopkins Training Outreach and Education National e-Science.
7. Grid Computing Systems and Resource Management
Enabling, facilitating and delivering quality training in the UK and Internationally Introduction to e-science concepts Mike Mineter Training Outreach.
INFSO-RI Enabling Grids for E-sciencE Overview of e-Infrastructure Mike Mineter Training Team National e-Science Centre
Toward a common data and command representation for quantum chemistry Malcolm Atkinson Director 5 th April 2004.
INFSO-RI Enabling Grids for E-sciencE Web Services Mike Mineter National e-Science Centre, Edinburgh.
The National Grid Service Mike Mineter.
GRIDSTART a European GRID coordination attempt Fabrizio Gagliardi CERN.
GRIDSTART Brussels 20/9/02 1www.gridstart.org GRIDSTART and European activities Dr Francis Wray EPCC The University of Edinburgh.
Welcome Grids and Applied Language Theory Dave Berry Research Manager 16 th October 2003.
INFSO-RI Enabling Grids for E-sciencE What is Grid Computing? Mike Mineter Training Outreach and Education National e-Science Centre.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
Bob Jones EGEE Technical Director
Accessing the VI-SEEM infrastructure
Ian Bird GDB Meeting CERN 9 September 2003
Grid Application Model and Design and Implementation of Grid Services
Large Scale Distributed Computing
The National Grid Service Mike Mineter NeSC-TOE
Presentation transcript:

INFSO-RI Enabling Grids for E-sciencE Overview of e-Infrastructure Mike Mineter Training Outreach and Education National e-Science Centre

2 You are welcome to re-use these slides. We ask only that you let us know, by to

Enabling Grids for E-sciencE INFSO-RI Contents Introduction to –e-Research and e-Science –Grids –e-Infrastructure Grid concepts Grids - Where are we now? Enabling the research of the future –and for early adopters… the present!

Enabling Grids for E-sciencE INFSO-RI ‘e-Science is about global collaboration in key areas of science, and the next generation of infrastructure that will enable it.’ John Taylor Director General of Research Councils Office of Science and Technology

5 IRAS 25  2MASS 2  DSS Optical IRAS 100  NVSS 20cm GB 6cm ROSAT ~keV WENSS 92cm Observations made across entire electromagnetic spectrum  e.g. different views of a local galaxy Need all of them to understand physics fully Databases are located throughout the world Peter Clarke Virtual Observatories

Biomedical Research Informatics Delivered by Grid Enabled Services Synteny Grid Service blast + VO Authorisation Information Integrator

Enabling Grids for E-sciencE INFSO-RI Engine flight data Airline office Maintenance Centre European data center London Airport New York Airport American data center Grid Diagnostics Centre “A Significant factor in the success of the Rolls-Royce campaign to power the Boeing 7E7 with the Trent 1000 was the emphasis on the new aftermarket support service for the engines provided via DS&S. Boeing personnel were shown DAME as an example of the new ways of gathering and processing the large amounts of data that could be retrieved from an advanced aircraft such as the 7E7, and they were very impressed”, DS&S 2004 XTO Engine Model Case Based Reasoning Signal Data Explorer Companies: Rolls-Royce DS&S Cybula DAME: Grid based tools and Infer- structure for Aero-Engine Diagnosis and Prognosis Universities: York, Leeds, Sheffield, Oxford

climateprediction.net and GENIE Largest climate model ensemble >45,000 users, >1,000,000 model years 10K 2K Response of Atlantic circulation to freshwater forcing

UK Grid for Particle Physics GridPP detectors, 2/3/06

Enabling Grids for E-sciencE INFSO-RI Connecting people: Access Grid Microphones Cameras

Enabling Grids for E-sciencE INFSO-RI What is e-Research? Collaborative research that is made possible by the sharing across the Internet of resources (data, instruments, computation, people’s expertise...) –Crosses organisational boundaries –Often very compute intensive –Often very data intensive –Sometimes large-scale collaboration Began with focus in the “big sciences” hence initiatives are often badged as “e-science” Relevance of “e-science technologies” to new user communities (social science, arts, humanities…) led to the term “e-research”

Grids: a foundation for e-Research sensor nets Shared data archives computers software colleagues instruments Grid e-Science methodologies will rapidly transform science, engineering, medicine and business driven by exponential growth (×1000/decade)  enabling a whole-system approach Diagram derived from Ian Foster’s slide

Enabling Grids for E-sciencE INFSO-RI What is Grid Computing? The grid vision is of “Virtual computing” (+ information services to locate computation, storage resources) –Compare: The web: “virtual documents” (+ search engine to locate them) MOTIVATION: collaboration through sharing resources (and expertise) to expand horizons of –Research –Commerce – engineering, … –Public service – health, environment,…

Enabling Grids for E-sciencE INFSO-RI The Grid Metaphor

Enabling Grids for E-sciencE INFSO-RI What is e-Infrastructure? – Political view A shared resource –That enables science, research, engineering, medicine, industry, … –It will improve UK / European / … productivity  Lisbon Accord 2000  E-Science Vision SR2000 – John Taylor –Commitment by UK government  Sections –Always there  c.f. telephones, transport, power, internet

Enabling Grids for E-sciencE INFSO-RI What is e-Infrastructure? Grids: permit resource sharing across administrative domains Networks: permit communication across geographical distance Supporting organisations –Operations for grids, networks Resources –Computers –Digital libraries –Research data –Instruments Middleware –Authentication, Authorisation –Registries, search engines –Toolkits, environments  E.g. for collaboration Network infrastructure & Resources Operations, Support and training Collaboration Grid

Enabling Grids for E-sciencE INFSO-RI Global Drivers of e-Research Digital technology – exponential growth - e.g. bandwidth Opportunities for e-Infrastructure to support faster, better, different research –Sharing expertise  Support for cooperation and communication –Sharing computation services  E.g. to serve occasional peaks of high demand for computation (especially trivially parallelisable ones) –Sharing data  New sensors and instruments  Databases Based on an infrastructure that requires and enables multidisciplinary research  Requires: IT + domain specialists  Enables: New interdisciplinary research

Enabling Grids for E-sciencE INFSO-RI What is Grid computing? The term “Grid” has become popular! –Sometimes in Industry : “Grids” = clusters  Motivations: better use of resources; scope for commercial services –Also used to refer to the harvesting of donated, unused compute cycles  Climateprediction.net) –These are e-Infrastructure but are not “grids” from the e- Research viewpoint!

Enabling Grids for E-sciencE INFSO-RI Grid concepts

Enabling Grids for E-sciencE INFSO-RI Virtual organisations and grids What’s a Virtual Organisation? –People in different organisations seeking to cooperate and share resources across their organisational boundaries E.g. A research collaboration Each grid is an infrastructure enabling one or more “virtual organisations” to share and access resources Key concept: The ability to negotiate resource-sharing arrangements among a set of participating parties (providers and consumers) and then to use the resulting resource pool for some purpose. (Ian Foster)

Enabling Grids for E-sciencE INFSO-RI INTERNET Virtual organisations negotiate with sites to agree access to resources Grid middleware runs on each shared resource to provide –Data services –Computation services –Single sign-on Distributed services (both people and middleware) enable the grid Typical current grid

Enabling Grids for E-sciencE INFSO-RI Empowering VO’s Where computer science meets the application communities! VO-specific developments: –Portals –Virtual Research Environments –Semantics, ontologies –Workflow –Registries of VO services Basic Grid services: AA, job submission, info, … Middleware: “collective services” Application toolkits, standards Application Production grids provide these services.

Enabling Grids for E-sciencE INFSO-RI Workflow example Taverna in MyGrid “allows the e-Scientist to describe and enact their experimental processes in a structured, repeatable and verifiable way” GUI Workflow language enactment engine

Enabling Grids for E-sciencE INFSO-RI The many scales of grids Campus grids Regional grids (e.g. White Rose Grid) National grids (e.g. National Grid Service) International grid (EGEE) Wider collaboration greater resources National datacentres, HPC, instruments Institutes’ data; Condor pools International instruments,.. Desktop

Enabling Grids for E-sciencE INFSO-RI Access service Access serviceHow users logon to a Grid Computing Element (CE) Computing Element (CE): A batch queue on a site’s computers where the user’s job is executed Storage Element (SE) Storage Element (SE): provides (large-scale) storage for files Resource Broker (RB) Resource Broker (RB): Service that matches the user’s requirements with the available resources on a Grid Main components Information System Information System: Characteristics and status of resources

Enabling Grids for E-sciencE INFSO-RI Current EGEE grid File Replica Catalogue Logging & Book-keeping ResourceBroker StorageResource ComputingResource = batch queue InformationService Job Status Datasets info Author. &Authen. Job Submit Event Job Query Job Status Input files Input Output Output files Publish Resource info User/Grid interface

Enabling Grids for E-sciencE INFSO-RI Who provides the resources?! ServiceProviderNote Access service User / institute/ VO / grid operations Computer with client software Resource Broker (RB) VO / grid operations(No NGS-wide RB) Information System Information System:ditto Computing Element (CE) VO / sometimes centralised provision also Scalability requires that VOs provide resources to match average need Storage Element (SE) ditto “VO”: virtual organisation “Grid operations”: funded effort

Enabling Grids for E-sciencE INFSO-RI Grid security and trust -1 Providers of resources (computers, databases,..) need risks to be controlled: they are asked to trust users they do not know –They trust a VO –The VO trusts its members User’s need –single sign-on: to be able to logon to a machine that can pass the user’s identity to other resources –To trust owners of the resources they are using Build middleware on layer providing: –Authentication: know who wants to use resource –Authorisation: know what the user is allowed to do –Security: reduce vulnerability, e.g. from outside the firewall –Non-repudiation: knowing who did what The “Grid Security Infrastructure” middleware is the basis of (most) production grids

Enabling Grids for E-sciencE INFSO-RI Grid security and trust -2 Achieved by Certification: –User’s identity has to be certified by one of the national Certification Authorities (CAs)  mutually recognized – In UK go to to find CA’s local “Registration Authorities” – Resources are also certified by CAs User –User joins a VO –Digital certificate is basis of AA –Identity passed to resources you use, where it is mapped to a local account Policies express the rights for a Virtual Organization to use resources

Enabling Grids for E-sciencE INFSO-RI … then where are we now? If “The Grid” vision leads us here…

Enabling Grids for E-sciencE INFSO-RI Grid projects - ~ 2003 Many Grid development efforts — all over the world NASA Information Power Grid DOE Science Grid NSF National Virtual Observatory NSF GriPhyN DOE Particle Physics Data Grid NSF TeraGrid DOE ASCI Grid DOE Earth Systems Grid DARPA CoABS Grid NEESGrid DOH BIRN NSF iVDGL DataGrid (CERN,...) EuroGrid (Unicore) DataTag (CERN,…) Astrophysical Virtual Observatory GRIP (Globus/Unicore) GRIA (Industrial applications) GridLab (Cactus Toolkit) CrossGrid (Infrastructure Components) EGSO (Solar Physics) UK – OGSA-DAI, RealityGrid, GeoDise, Comb-e-Chem, DiscoveryNet, DAME, AstroGrid, GridPP, MyGrid, GOLD, eDiamond, Integrative Biology, … Netherlands – VLAM, PolderGrid Germany – UNICORE, Grid proposal France – Grid funding approved Italy – INFN Grid Eire – Grid proposals Switzerland - Network/Grid proposal Hungary – DemoGrid, Grid proposal Norway, Sweden - NorduGrid

Enabling Grids for E-sciencE INFSO-RI Grids: where are we now? Many key concepts identified and known Many grid projects have tested, and benefit from, these Major efforts now on establishing: –Production Grids for multiple VO’s  “Production” = Reliable, sustainable, with commitments to quality of service In Europe, EGEE In UK, National Grid Service In US, Teragrid and OSG  One stack of middleware that serves many research communities  Establishing operational procedures and organisation –Standards (a slow process) (e.g. Global Grid Forum, ) Service orientation - “the way to build grids”

Enabling Grids for E-sciencE INFSO-RI Where are we now? –user’s view ResearchPilot projects Early adopters Routine production Unimagined possibilities Grids Networks Web Sciences, engineering Arts Humanities e-Soc-Sci Early production grids: UK – National Grid Service International - EGEE

Enabling Grids for E-sciencE INFSO-RI Where are we now?! Standards are emerging… some near acceptance and some being discarded –Standards bodies:  W3C  GGF  OASIS  IETFhttp:// –For a summary see Production grids are based on de-facto standards at present –Inevitably! –GT2 especially –But locks a grid into one middleware stack unable to benefit from the diverse developments of new services Globus Toolkit 4 has been released

Enabling Grids for E-sciencE INFSO-RI National grid initiatives now include… CroGrid

Enabling Grids for E-sciencE INFSO-RI Service-Oriented Architecture Client Registry Service Discovery Invocation Registration

Enabling Grids for E-sciencE INFSO-RI Service orientation – software components that are… Accessible across a network Loosely coupled, defined by the messages they receive / send Interoperable: each service has a description that is accessible and can be used to create software to invoke that service Based on standards (for which tools do / could exist) Developed in anticipation of new uses Service Registry Client

Enabling Grids for E-sciencE INFSO-RI Web Services Grid Technology Grid Services Grid and Web services Commerce Standards Tools Research driven Data-intensive Compute intensive Collaboration intensive Open Grid Services Architecture

Enabling Grids for E-sciencE INFSO-RI A bit of history “Open grid services architecture” OGSA– proposed in 2001 Open Grid Services Infrastructure –Globus Toolkit 3 resulted Then in January 2004 –OGSI to be replaced by emerging WS-RF (Web Services Resource Framework): manage “state” without major rewrite of WS standards WS-I used meanwhile: Open standards: –SOAP: protocol for message passing –Web Service Description Language: to describe services –UDDI: Universal Description, Discovery and Integration –WS-Security: incorporates security

Enabling Grids for E-sciencE INFSO-RI WS & Grid Goals Web Services Goals –Computational presentation & access of Enterprise services –Marketing integrated large scale software and systems –Model for independent development –Model for independent operation Grids Goals –Inter-organisational collaboration –Sharing information and resources –Framework for collaborative development –Framework for collaborative operation

41 Birmingham The Globus-Based LIGO Data Grid Replicating >1 Terabyte/day to 8 sites >40 million replicas so far MTBF = 1 month LIGO Gravitational Wave Observatory  Cardiff AEI/Golm

42 Decomposition Enables Separation of Concerns & Roles User Service Provider “Provide access to data D at S1, S2, S3 with performance P” Resource Provider “Provide storage with performance P1, network with P2, …” D S1 S2 S3 D S1 S2 S3 Replica catalog, User-level multicast, … D S1 S2 S3 Ian Foster GGF16

Enabling Grids for E-sciencE INFSO-RI Contents Introduction to –e-Research and e-Science –Grids –e-Infrastructure Grid concepts Grids - Where are we now? Enabling the research of the future –Grids already empower a widening spectrum of research but. –What happens if research becomes service-oriented??

44 Provisioning Service-Oriented Systems: The Role of Grid Infrastructure l Service-oriented Grid infrastructure u Provision physical resources to support application workloads Appln Service Users Workflows Composition Invocation l Service-oriented applications u Wrap applications as services u Compose applications into workflows “The Many Faces of IT as Service”, ACM Queue, Foster, Tuecke, 2005

Enabling Grids for E-sciencE INFSO-RI Service-oriented research?? “potential to increase individual and collective scientific productivity by making powerful information tools available to all” “Ultimately, we can imagine a future in which a community's shared understanding … is documented also in the various databases and programs that represent—and automatically maintain and evolve—a collective knowledge base. ” Ian Foster, 4?ijkey=aqCCmCFix8Ll.&keytype=ref&siteid=sci 4?ijkey=aqCCmCFix8Ll.&keytype=ref&siteid=sci Science 6 May 2005

Enabling Grids for E-sciencE INFSO-RI Expanding horizons for research Early grids –Resource utilisation –A few big-science VOs  Trivial parallelism – many concurrent independent jobs  Data management – files only Grid-enabling databases –Pre-existing databases accessible from grids –Data integration Service-oriented grid: possibilities for –any collaborative research –International / national / university resources become accessible  With control and AA (authorisation and authentication)

Enabling Grids for E-sciencE INFSO-RI Summary -1: enabling collaboration Grids: collaboration across administrative domains Networks: collaboration across geographical distance Semantics, ontologies: collaboration across disciplines / groups Storage, (“curation”): collaboration across time Network infrastructure & Resource centres Operations, Support and training Collaboration Grid

Enabling Grids for E-sciencE INFSO-RI Summary -2 Ask not what “the Grid” can do for you BUT With whom do you collaborate? What resources / services can you provide? What resources would empower your research? People Data Computation

Enabling Grids for E-sciencE INFSO-RI Further reading The Grid Core Technologies, Maozhen Li and Mark Baker, Wiley, 2005 The Globus Toolkit 4 Programmer's Tutorial Borja Sotomayor, Globus Alliance, The Web Services Grid Architecture (WSGA) Web Services Grid Architecture (WSGA) Global Grid Forum (see GGF16)