Overview of the EGEE project and middleware

Slides:



Advertisements
Similar presentations
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
Advertisements

FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
An overview of the EGEE project Bob Jones EGEE Technical Director DTI International Technology Service-GlobalWatch Mission CERN – June 2004.
INFSO-RI Enabling Grids for E-sciencE Concepts of grid computing Guy Warner NeSC Training Team
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Enabling, facilitating and delivering quality training in the UK and Internationally The challenge of grid training and education David Fergusson, Deputy.
Porting applications to EU-IndiaGrid: EGEE Marco Verlato EU-IndiaGrid Workshop April 2007 Bangalore, India.
SICSA student induction day, 2009Slide 1 Social Simulation Tutorial Session 6: Introduction to grids and cloud computing International Symposium on Grid.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of the EGEE project and the gLite middleware Gergely Sipos MTA SZTAKI
1 Introduction to EGEE-II Antonio Fuentes Tutorial Grid Madrid, May 2007 RedIRIS/Red.es (Slices of Bob Jone, Director of EGEE-II.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Configuring and Maintaining EGEE Production.
EGEE-II INFSO-RI Enabling Grids for E-sciencE An Overview of Grid Computing Richard Hopkins Training Outreach and Education National.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Related Projects Dieter Kranzlmüller Deputy.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
Production Grids Mike Mineter NeSC-TOE. EU project: RIO31844-OMII-EUROPE 2 Production Grids - examples 1.EGEE: Enabling Grids for e-Science 2.National.
Training and the NGS Mike Mineter
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE – paving the way for a sustainable infrastructure.
INFSO-RI Enabling Grids for E-sciencE Plan until the end of the project and beyond, sustainability plans Dieter Kranzlmüller Deputy.
EGEE-II INFSO-RI Enabling Grids for E-sciencE An Introduction to the EGEE Project Presented by Min Tsai ISGC 2007, Taipei With thanks.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Next steps with EGEE EGEE training community.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Next steps with EGEE Gergely Sipos
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE II: an eInfrastructure for Europe and.
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
INFSO-RI Enabling Grids for E-sciencE External Projects Integration Summary – Trigger for Open Discussion Fotis Karayannis, Joanne.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
EGEE is a project funded by the European Union under contract IST Presentation of NA4 Generic Applications Roberto Barbera NA4 Generic Applications.
INFSO-RI Enabling Grids for E-sciencE EGEE SA1 in EGEE-II – Overview Ian Bird IT Department CERN, Switzerland EGEE.
Status Organization Overview of Program of Work Education, Training It’s the People who make it happen & make it Work.
Easy Access to Grid infrastructures Dr. Harald Kornmayer (NEC Laboratories Europe) Dr. Mathias Stuempert (KIT-SCC, Karlsruhe) EGEE User Forum 2008 Clermont-Ferrand,
Enabling, facilitating and delivering quality training in the UK and Internationally Introduction to e-science concepts Mike Mineter Training Outreach.
EGEE-II INFSO-RI Enabling Grids for E-sciencE An overview of the EGEE project and middleware Mike Mineter
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of the EGEE project and the gLite middleware Mike Mineter
INFSO-RI Enabling Grids for E-sciencE The EGEE Project Owen Appleton EGEE Dissemination Officer CERN, Switzerland Danish Grid Forum.
The National Grid Service Mike Mineter.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks NA3 Activity – Training and Induction Robin.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Training Synergies with Collaborating Projects.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Robin McConnell Activity Manager UEDIN (NeSC)
INFSO-RI Enabling Grids for E-sciencE EGEE general project update Fotis Karayannis EGEE South East Europe Project Management Board.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Overview for ENVRI Gergely Sipos, Malgorzata Krakowian EGI.eu
GridMaGrid Users & Applications Conclusions 16/ Grid activities in Morocco Abderrahman El Kharrim CNRST - MaGrid Team Morocco Grid Workshop - Rabat,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI solution for high throughput data analysis Peter Solagna EGI.eu Operations.
Bob Jones EGEE Technical Director
Accessing the VI-SEEM infrastructure
EGEE Project and Middleware Overview
Grid Computing: Running your Jobs around the World
Regional Operations Centres Core infrastructure Centres
JRA3 Introduction Åke Edlund EGEE Security Head
Ian Bird GDB Meeting CERN 9 September 2003
Long-term Grid Sustainability
Introduction to Grid Technology
EGEE support for HEP and other applications
Grid Computing.
P-GRADE and GEMLCA.
NA3: User Community Support Team
Network Requirements Javier Orellana
Connecting the European Grid Infrastructure to Research Communities
Introduction to Grid Computing and EGEE
Cécile Germain-Renaud Grid Observatory meeting 19 October 2007 Orsay
Overview of the EGEE project and the gLite middleware
The GENIUS portal and the GILDA t-Infrastructure
Future EU Grid Projects
How To Integrate an Application on Grid
gLite The EGEE Middleware Distribution
Presentation transcript:

Overview of the EGEE project and middleware Presented by Gergely Sipos, sipos@sztaki.hu With thanks to EGEE colleagues for many of these slides

Contents What is EGEE? EGEE’s Grid middleware: gLite 3 Goals Status Activities EGEE’s Grid middleware: gLite 3 Sources of further information

EGEE goals – international e-infrastructure Build, deploy and operate a consistent, robust a large scale production grid service that Links with and build on national, regional and international initiatives Improve and maintain the middleware in order to deliver a reliable service to users Attract new users from research and industry and ensure training and support for them Network infrastructure & Resource centres Operations, Support and training Collaboration International Grid

What is happening now? http://gridportal.hep.ph.ic.ac.uk/rtm

EGEE-II From April 2006, natural continuation of EGEE Expanded consortium Emphasis on providing an infrastructure  increased support for applications  interoperate with other infrastructures  more involvement from Industry SA: service activities - establishing operations NA: network activities - supporting VOs JRA: “joint research activities” - e.g. hardening middleware

Collaborating e-Infrastructures Potential for linking ~80 countries by 2008

Related projects: infrastructure, engineering, education Name Description BalticGrid EGEE extension to Estonia, Latvia, Lithuania EELA EGEE extension to Brazil, Chile, Cuba, Mexico, Argentina EUChinaGRID EGEE extension to China EUMedGRID EGEE extension to Malta, Algeria, Morocco, Egypt, Syria, Tunisia, Turkey ISSeG Site security eIRGSP Policies ETICS Repository, Testing OMII-Europe to provide key software components for building e-infrastructures; BELIEF Digital Library of Grid documentation, organisation of workshops, conferences BIOINFOGRID Biomedical Health-e-Child Biomedical – Integration of heterogeneous biomedical information for improved healthcare ICEAGE International Collaboration to Extend and Advance Grid Education Included to explain the acronyms on the previous slide.

Contents What is EGEE? EGEE’s Grid middleware: gLite 3 Goals Status Activities EGEE’s Grid middleware: gLite 3 Sources of further information

Virtual organisations and grids What is a Virtual Organisation? People in different organisations seeking to cooperate and share resources across their organisational boundaries E.g. A research collaboration Each grid is an infrastructure enabling one or more “virtual organisations” to share and access resources Each resource is exposed to the grid through an abstraction that masks heterogeneity, e.g. Multiple diverse computational platforms Multiple data resources Resources are usually owned by VO members. Negotiations lead to VOs sharing resources

Typical current grid INTERNET Virtual organisations negotiate with sites to agree access to resources Grid middleware runs on each shared resource to provide Data services Computation services Single sign-on Distributed services (both people and middleware) enable the grid INTERNET Single sign-on – instead of individual usernames and passwords at each resource (many being used by software clients) have a digital credential that can be securely passed to identify user… more later.

Typical current grid INTERNET At each site that provides computation: Local resource management system (= batch queue) PBS … EGEE term: queue is a “Computing element” Grid middleware runs on each shared resource Data storage (Usually) batch queues on pools of processors Users join VO’s Virtual organisation negotiates with sites to agree access to resources Distributed services (both people and middleware) enable the grid, allow single sign-on INTERNET

Grid Middleware When using a Grid you Login with digital credentials – single sign-on (“Authentication”) Use rights given you (“Authorisation”) Run jobs Manage files: create them, read/write, list directories Services are linked by the Internet Middleware Many admin. domains When using a PC or workstation you Login with a username and password (“Authentication”) Use rights given to you (“Authorisation”) Run jobs Manage files: create them, read/write, list directories Components are linked by a bus Operating system One admin. domain “services” = data storage, CPUs In effect, restating the idea that “the network becomes the computer”.

Main components User Interface (UI): The place where users logon to the Grid Resource Broker (RB) (Workload Management System (WMS): Matches the user requirements with the available resources on the Grid Information System: Characteristics and status of CE and SE Computing Element (CE): A batch queue on a site’s computers where the user’s job is executed Storage Element (SE): provides (large-scale) storage for files

All built upon authorisation, Main components User Interface (UI): The place where users logon to the Grid Resource Broker (RB): Matches the user requirements with the available resources on the Grid All built upon authorisation, authentication, security Information System: Characteristics and status of CE and SE (Uses “GLUE schema”) Computing Element (CE): A batch queue on a site’s computers where the user’s job is executed Storage Element (SE): provides (large-scale) storage for files Grid Security Infrastructure: Single logon with security and trust

Main components Today Today Today Today Tomorrow Today User Interface (UI): The place where users logon to the Grid Resource Broker (RB): Matches the user requirements with the available resources on the Grid Today Today Information System: Characteristics and status of CE and SE (Uses “GLUE schema”) Today Computing Element (CE): A batch queue on a site’s computers where the user’s job is executed Tomorrow Storage Element (SE): provides (large-scale) storage for files Today Grid Security Infrastructure: Single logon with security and trust

Current production middleware Replica Catalogue “User interface” Input “sandbox” DataSets info Information Service Output “sandbox” SE & CE info Resource Broker Job Submit Event Job Query Job Status Publish Author. &Authen. Input “sandbox” + Broker Info Output “sandbox” Slide inherited from EDG – European Data Grid Storage Element Logging & Book-keeping Computing Element Job Status

Who provides the resources?! Service Provider Note Access service User / institute / VO Computer with client software Resource Broker (RB) VOs - EGEE does not fund RBs Information System: Grid operations Computing Element (CE) VOs - EGEE does not fund CEs Scalability requires that VOs provide resources to match average need Storage Element (SE) VOs - EGEE does not fund SEs Grids don’t descend from the skies ready for you to use….although in the UK we are fortunate to have the NGS provided but even their provision of core resources is done as a bootstrapping . “VO”: virtual organisation “Grid operations”: funded effort

Resource management: structure Operations Coordination Centre (OCC) management, oversight of all operational and support activities Regional Operations Centres (ROC) providing the core of the support infrastructure, each supporting a number of resource centres within its region Grid Operator on Duty Resource centres providing resources (computing, storage, network, etc.); Grid User Support (GGUS) At FZK, coordination and management of user support, single point of contact for users

Middleware structure Access for applications to: Higher-level Grid Services Foundation Grid Middleware Higher-Level Grid Services are supposed to help the users building their computing infrastructure but should not be mandatory Foundation Grid Middleware will be deployed on the EGEE infrastructure Must be complete and robust Should allow interoperation with other major grid infrastructures Should not assume the use of Higher-Level Grid Services Higher-Level Grid Services Workload Management Replica Management Visualization Workflow Grid Economies ... Foundation Grid Middleware Security model and infrastructure Computing (CE) and Storage Elements (SE) Accounting Information and Monitoring Overview paper http://doc.cern.ch/archive/electronic/egee/tr/egee-tr-2006-001.pdf

Empowering VO’s Application Application toolkits Where computer science meets the application communities! High level toolkits and services: Portals – P-GRADE Job management – Ganga Alternative WMS - GridWAY Workflow Semantics, ontologies Registries of VO services Application Application toolkits Higher-level grid services (brokering,…) Basic Grid services: AA, job submission, info, … Production grids provide these services.

Who is using EGEE? Many differing application domains High-Energy Physics: LHC, Tevatron, HERA, … Biology: Medical Images, Bioinformatics, Drug Discovery Earth Science: Hydrology, Pollution, Climate, Geophysics, … Astrophysics: Planck, MAGIC Fusion Computational Chemistry Related Projects: Finance, Digital Libraries, … New areas: nanotechnology, … 98k jobs/day

Application families 1. Simulation Bulk Processing Responsive Apps. Large number of CPU-intensive, independent jobs Small input; large output Run by few (expert) users Bulk Processing Widely-distributed input data Significant amount of input and output data Meta-data services More sophisticated data management Responsive Apps. Small amounts of input and output data Not CPU-intensive Short response time (few minutes) / Rapid response: a human waiting for the result! No real support in gLite  Interactive Grid – FP6 project

Application families 2. Workflow Parallel Jobs Legacy Applications Complex set of algorithms for the analysis Complex dependencies between individual tasks No support in gLite yet Higher level tools are needed (e.g. P-GRADE, Moteur) Parallel Jobs Many interdependent, communicating tasks Many CPUs needed simultaneously Use of MPI libraries Limited support in gLite: MPI configuration is not uniform Legacy Applications Licenses: control access to software on the grid No recompilation  no direct use of grid APIs!

NA4 Activity: Application identification and support Application Identification and Support (NA4) 25 countries, 40 partners, 280+ participants, 1000s of users Support the large and diverse EGEE user community: Promote dialog: Users’ Forums & EGEE Conferences Technical Aid: Porting code, procedural issues Liaison: Software and operational requirements Need active participation: Feedback: Infrastructure, configuration, and middleware Resources: Hardware and human http://egeena4.lal.in2p3.fr

NA3: User training and induction Expand portfolio of training materials & courses Train a wide variety of EGEE users (internal/external) Develop effective mechanisms for training end-users of the EGEE infrastructure Enhance e-Learning structure and provision of t-Infrastructure Validate cohorts of trainers & compile directory Collaborate in cross-activity initiatives http://www.egee.nesc.ac.uk/

Further information EGEE digital library: http://egee.lib.ed.ac.uk/ EGEE www.eu-egee.org gLite http://www.glite.org

Summary EGEE is running the largest multi-VO grid in the world! Creating the “grid layer” in e-Infrastructure for research, public service and industry Key concepts for EGEE Sustainability – planning for the long-term Production quality User support  End users: NA3 and NA4 EGEE’s middleware: gLite From that commitment flow standards, policies, access to resources and these are expressed in the middleware.