Overview of the EGEE project and the gLite middleware

Slides:



Advertisements
Similar presentations
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
Advertisements

FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
ROSCOE Status IT-GS Group Meeting Focus on POW November 2009.
An overview of the EGEE project Bob Jones EGEE Technical Director DTI International Technology Service-GlobalWatch Mission CERN – June 2004.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
The LHC Computing Grid – February 2008 The Worldwide LHC Computing Grid Dr Ian Bird LCG Project Leader 15 th April 2009 Visit of Spanish Royal Academy.
SICSA student induction day, 2009Slide 1 Social Simulation Tutorial Session 6: Introduction to grids and cloud computing International Symposium on Grid.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of the EGEE project and the gLite middleware Gergely Sipos MTA SZTAKI
1 Introduction to EGEE-II Antonio Fuentes Tutorial Grid Madrid, May 2007 RedIRIS/Red.es (Slices of Bob Jone, Director of EGEE-II.
A short introduction to GRID Gabriel Amorós IFIC.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Configuring and Maintaining EGEE Production.
INFSO-RI Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
Training and the NGS Mike Mineter
Responsibilities of ROC and CIC in EGEE infrastructure A.Kryukov, SINP MSU, CIC Manager Yu.Lazin, IHEP, ROC Manager
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE – paving the way for a sustainable infrastructure.
INFSO-RI Enabling Grids for E-sciencE Plan until the end of the project and beyond, sustainability plans Dieter Kranzlmüller Deputy.
EGEE-II INFSO-RI Enabling Grids for E-sciencE An Introduction to the EGEE Project Presented by Min Tsai ISGC 2007, Taipei With thanks.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Next steps with EGEE EGEE training community.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Next steps with EGEE Gergely Sipos
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE II: an eInfrastructure for Europe and.
CEOS WGISS-21 CNES GRID related R&D activities Anne JEAN-ANTOINE PICCOLO CEOS WGISS-21 – Budapest – 2006, 8-12 May.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
EGEE is a project funded by the European Union under contract IST Presentation of NA4 Generic Applications Roberto Barbera NA4 Generic Applications.
INFSO-RI Enabling Grids for E-sciencE GILDA and GENIUS Guy Warner NeSC Training Team An induction to EGEE for GOSC and the NGS NeSC,
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of the EGEE project and the gLite middleware Mike Mineter
INFSO-RI Enabling Grids for E-sciencE The EGEE Project Owen Appleton EGEE Dissemination Officer CERN, Switzerland Danish Grid Forum.
EGEE is a project funded by the European Union under contract IST EGEE Summary NA2 Partners April
EGEE Project Review Fabrizio Gagliardi EDG-7 30 September 2003 EGEE is proposed as a project funded by the European Union under contract IST
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
EGEE is a project funded by the European Union under contract IST GENIUS and GILDA Guy Warner NeSC Training Team Induction to Grid Computing.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Robin McConnell Activity Manager UEDIN (NeSC)
INFSO-RI Enabling Grids for E-sciencE EGEE general project update Fotis Karayannis EGEE South East Europe Project Management Board.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Overview for ENVRI Gergely Sipos, Malgorzata Krakowian EGI.eu
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
Enabling Grids for E-sciencE University of Perugia Computational Chemistry status report EGAAP Meeting – 21 rst April 2005 Athens, Greece.
Bob Jones EGEE Technical Director
Accessing the VI-SEEM infrastructure
EGEE Project and Middleware Overview
Grid Computing: Running your Jobs around the World
Regional Operations Centres Core infrastructure Centres
SA1 Execution Plan Status and Issues
Applications Using the EGEE Grid Infrastructure
Practical: The Information Systems
Ian Bird GDB Meeting CERN 9 September 2003
Long-term Grid Sustainability
Introduction to Grid Technology
EGEE support for HEP and other applications
Short update on the latest gLite status
LCG middleware and LHC experiments ARDA project
Connecting the European Grid Infrastructure to Research Communities
Cécile Germain-Renaud Grid Observatory meeting 19 October 2007 Orsay
EGEE Middleware: gLite Information Systems (IS)
The GENIUS portal and the GILDA t-Infrastructure
Future EU Grid Projects
Wide Area Workload Management Work Package DATAGRID project
How To Integrate an Application on Grid
Overview of the EGEE project and middleware
Status of Grids for HEP and HENP
gLite The EGEE Middleware Distribution
Grid Computing Software Interface
Presentation transcript:

Overview of the EGEE project and the gLite middleware

Outline What is EGEE? gLite middleware EGEE applications The project The infrastructure gLite middleware EGEE applications Sources of further information

Defining the Grid A Grid is the combination of networked resources and the corresponding middleware, which provides services for the user. Providing a Production Grid Infrastructure for Collaborative Science 3 3

The EGEE Project Aim of EGEE: “to establish a seamless European Grid infrastructure for the support of the European Research Area (ERA)” EGEE 1 April 2004 – 31 March 2006 71 partners in 27 countries, federated in regional Grids EGEE-II 1 April 2006 – 30 April 2008 Expanded consortium EGEE-III 1 May 2008 – 30 April 2010 Transition to sustainable model Challenge: large consortium  needs clearly defined management structure and reporting lines tested in EGEE, expanded in EGEE-II maintain expertise from EGEE partners EGEE-II focus on providing a service to users and less on building up the infrastructure increased work with industry: now also direct input in technical management on same level as scientific applications Involvement of industry also in the technical work – as EGEE business associates or through collaboration with the CERN openlab project (www.cern.ch/openlab) SA = Service Activities: operation and management of the Grid as well as provision of network resources  working, sustainable infrastructure: EGEE-II as core service provider new sites/countries joining the infrastructure new activity (SA3) responsible for software integration and testing NA = Networking Activities: management and coordination of all the communication aspects of the project, application support  Growth of infrastructure user base + international collaboration increased support for applications reinforced dissemination and training: closer links to industry; extend coverage to all regions JRA = Joint Research Activities: Re-engineering and integration of Grid middleware components  Middleware becoming more mature, needs less development also includes quality assurance for the whole project and security New coordination bodies: Technical Coordination Group User Information Group EGEE-II website: www.eu-egee.org Providing a Production Grid Infrastructure for Collaborative Science 4 4

Defining the Grid A Grid is the combination of networked resources and the corresponding middleware, which provides services for the user. Providing a Production Grid Infrastructure for Collaborative Science 5 5

EGEE working with related infrastructure projects GIN Providing a Production Grid Infrastructure for Collaborative Science 6 6

What is happening now? Real Time Monitor Java tool Displays jobs running (submitted through RBs) Shows jobs moving around world map in real time, along with changes in status http://gridportal.hep.ph.ic.ac.uk/rtm/ (snapshot 16 January 2007)

EGEE Infrastructures Production service Pre-production service Scaling up the infrastructure with resource centres around the globe Stable, well-supported infrastructure, running only well-tested and reliable middleware Pre-production service Run in parallel with the production service (restricted nr of sites) First deployment of new versions of the gLite middleware Test-bed for applications and other external functionality T-Infrastructure (Training&Education) Complete suite of Grid elements and application (Testbed, CA, VO, monitoring, support, …) Everyone can register and use GILDA for training and testing 20 sites on 3 continents Providing a Production Grid Infrastructure for Collaborative Science 8 8

NA3 activity: User training and induction Expand portfolio of training materials & courses Train a wide variety of EGEE users (internal/external) Develop effective mechanisms for training end-users of the EGEE infrastructure Collaborate in cross-activity initiatives ICEAGE Project Digital Library http://library.iceage-eu.org/ Videos, MP3 talks on grid computing http://www.egee.nesc.ac.uk/ Training events Training material repository http://egee.lib.ed.ac.uk/ EGEE Digital Library Repository of training materials 29 Active partners ~ 29 FTEs 89 Individuals 6 Federations

NA4 Activity: Application identification and support Application Identification and Support (NA4) 25 countries, 40 partners, 280+ participants, 1000s of users Support the large and diverse EGEE user community: Promote dialog: Users’ Forums & EGEE Conferences Technical Aid: Porting support, procedural issues Liaison: Software and operational requirements Main activities: 5 application clusers: HEP, Life sciences, Astronomy & astrophysics, Earth science, Computational chemistry, Fusion, Grid observatory Support: Application porting support www.lpds.sztaki.hu/gasuc VO support Direct user support www.ggus.org Regional support http://egeena4.lal.in2p3.fr

Country participating in EGEE EGEE Infrastructure Country participating in EGEE > 200 sites in 40 countries ~ 38 000 CPUs ~ 5 PB storage 98k jobs/day > 200 Virtual Organizations ⇨The world’s largest multi-disciplinary Grid Providing a Production Grid Infrastructure for Collaborative Science 11 11

VO concept gLite middleware runs on each shared resource to provide Data services Computation services Security service Resources and users form Virtual organisations: basis for collaboration Distributed services (both people and middleware) enable the grid INTERNET Single sign-on – instead of individual usernames and passwords at each resource (many being used by software clients) have a digital credential that can be securely passed to identify user… more later.

Defining the Grid A Grid is the combination of networked resources and the corresponding middleware, which provides services for the user. Providing a Production Grid Infrastructure for Collaborative Science 14 14

Grid middleware The Grid relies on advanced software, called middleware, which interfaces between resources and the applications The Grid middleware: Basic services Secure and effective access to resources High level services Optimal use of resources Authentication to the different sites that are used Job execution & monitoring of progress Problem recovery Transfer of results back to the user

EGEE Middleware: gLite gLite 3.0, gLite 3.1 ⇨ Merger of LCG 2.7 and GLite 1.5 Exploit experience and existing components from VDT (Condor, Globus), EDG/LCG, and others Develop a lightweight stack of generic middleware useful to EGEE applications (HEP and Biomedics are pilot applications). Should eventually deploy dynamically (e.g. as a globus job) Pluggable components – cater for different implementations Focus is on providing a stable and usable infrastructure

Basic gLite use case: Job submission User Interface Information System Submit job (executable + small inputs) Resource Broker query Retrieve status & (small) output files create proxy query publish state Submit job File and Replica Catalog Retrieve output Job status Logging Computing Element Storage Element Site X Register file Input file(s) Job status process VO Management Service (DB of VO users) Output file(s) Logging and bookkeeping

Main components User Interface (UI): The place where users logon to the Grid Resource Broker (RB) (Workload Management System (WMS): Matches the user requirements with the available resources on the Grid Information System: Characteristics and status of CE and SE File and replica catalog: Location of grid files and grid file replicas Logging and Bookkeeping (LB): Log information of jobs Computing Element (CE): A batch queue on a site’s computers where the user’s job is executed Storage Element (SE): provides (large-scale) storage for files

All built upon authorisation, Main components User Interface (UI): The place where users logon to the Grid Resource Broker (RB) (Workload Management System (WMS): Matches the user requirements with the available resources on the Grid All built upon authorisation, authentication, security Information System: Characteristics and status of CE and SE File and replica catalog: Location of grid files and grid file replicas Logging and Bookkeeping (LB): Log information of jobs Computing Element (CE): A batch queue on a site’s computers where the user’s job is executed Storage Element (SE): provides (large-scale) storage for files

Who provides the resources?! Service Provider Note User interface User / institute / VO Computer with client SW Resource Broker (WMS) VOs - EGEE does not fund RBs Information System Grid operations - EGEE funded effort File and replica catalog VOs - EGEE does not fund catalogs Logging and Bookkeeping VOs - EGEE does not fund LB servers Computing Element (CE) VOs - EGEE does not fund CEs VOs provide resources to match average need Storage Element (SE) VOs - EGEE does not fund SEs External services To extend the capabilities of the core infrastructure Grids don’t descend from the skies ready for you to use….somebody pays for the resources.

Empowering VOs Application Application toolkits Where computer science meets the application communities! Recommended External Software Packages for Egee CommuniTies Current RESPECT tools: GridWay P-GRADE Portal GANGA GRelC I2glogin http://egeena4.lal.in2p3.fr/  “Grid software” menu Application Application toolkits Command line & APIs Higher-level gLite services (WMS,…) Production infrastructure contains these services Basic services: Must be complete and robust; Should not assume the use of Higher-Level Grid Services High level services: help the users building their computing infrastructure but should not be mandatory Basic gLite services: CE, SE, info, security

Defining the Grid A Grid is the combination of networked resources and the corresponding middleware, which provides services for the user. Providing a Production Grid Infrastructure for Collaborative Science 23 23

EGEE Applications >270 VOs from several scientific domains Astronomy & Astrophysics Civil Protection Computational Chemistry Comp. Fluid Dynamics Computer Science/Tools Condensed Matter Physics Earth Sciences Fusion High Energy Physics Life Sciences Further applications under evaluation At Feb review: 100 sites, 10K CPUs 1st gLite release foreseen for March’05 6 domains and Applications have moved from testing to routine and daily usage ~80-95% efficiency Providing a Production Grid Infrastructure for Collaborative Science 24 24

Application families Simulation Bulk Processing Workflow Large number of similar, independent jobs – parameter study Bulk Processing Widely-distributed input data, Sophisticated data management Workflow Complex dependencies between individual tasks Legacy Applications Licenses: control access to software on the grid No recompilation  no direct use of grid APIs Parallel Jobs Many CPUs needed simultaneously, Use of MPI libraries Limited support in gLite: MPI configuration is not uniform Responsive Apps. Short response time No real support in gLite  Interactive Grid FP6 project

Further information, references EGEE http://www.eu-egee.org/ gLite middleware http://www.glite.org gLite manuals, documentation http://glite.web.cern.ch/glite/documentation/ (gLite user guide) Recommended External Software Packages for Egee CommuniTies (RESPECT) http://egeena4.lal.in2p3.fr/ Description of work of EGEE-III https://edms.cern.ch/document/886385/4

Summary EGEE is running the largest multi-VO grid in the world! Creating the “grid layer” in e-Infrastructure for research, public service and industry Key concepts for EGEE Sustainability – planning for the long-term Production quality User support EGEE’s middleware: gLite. Current version 3.1 Basic middleware services High level middleware services External software to foster uptake of technology From that commitment flow standards, policies, access to resources and these are expressed in the middleware.