EGEE-II INFSO-RI-031688 Enabling Grids for E-sciencE Introduction to Grid Application Development Vangelis Floros Application.

Slides:



Advertisements
Similar presentations
The National Grid Service and OGSA-DAI Mike Mineter
Advertisements

Legacy code support for commercial production Grids G.Terstyanszky, T. Kiss, T. Delaitre, S. Winter School of Informatics, University.
A Computation Management Agent for Multi-Institutional Grids
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
INFSO-RI Enabling Grids for E-sciencE FloodGrid application Ladislav Hluchy, Viet D. Tran Institute of Informatics, SAS Slovakia.
Porting applications to EU-IndiaGrid: EGEE Marco Verlato EU-IndiaGrid Workshop April 2007 Bangalore, India.
SICSA student induction day, 2009Slide 1 Social Simulation Tutorial Session 6: Introduction to grids and cloud computing International Symposium on Grid.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management Services - Overview Mike Mineter National e-Science Centre, Edinburgh.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
INFSO-RI Enabling Grids for E-sciencE The US Federation Miron Livny Computer Sciences Department University of Wisconsin – Madison.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Grid application development with gLite and P-GRADE Portal Miklos Kozlovszky MTA SZTAKI.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Security and Job Management.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Provenance Challenge gLite Job Provenance.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Using gLite API Vladimir Dimitrov IPP-BAS “gLite middleware Application Developers.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Feb. 06, Introduction to High Performance and Grid Computing Faculty of Sciences,
Holding slide prior to starting show. A Portlet Interface for Computational Electromagnetics on the Grid Maria Lin and David Walker Cardiff University.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Application Porting Support in EGEE Gergely Sipos MTA SZTAKI EGEE’08.
MTA SZTAKI Hungarian Academy of Sciences Introduction to Grid portals Gergely Sipos
P-GRADE and GEMLCA.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Application Development on Grids Gergely Sipos MTA SZTAKI Hungarian.
1 P-GRADE Portal: a workflow-oriented generic application development portal Peter Kacsuk MTA SZTAKI, Hungary Univ. of Westminster, UK.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The GILDA training infrastructure.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Creating and running an application.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
Easy Access to Grid infrastructures Dr. Harald Kornmayer (NEC Laboratories Europe) Dr. Mathias Stuempert (KIT-SCC, Karlsruhe) EGEE User Forum 2008 Clermont-Ferrand,
EGEE-0 / LCG-2 middleware Practical.
INFSO-RI Enabling Grids for E-sciencE GILDA and GENIUS Guy Warner NeSC Training Team An induction to EGEE for GOSC and the NGS NeSC,
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Alexandre Duarte CERN IT-GD-OPS UFCG LSD 1st EELA Grid School.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
INFSO-RI Enabling Grids for E-sciencE Charon Extension Layer. Modular environment for Grid jobs and applications management Jan.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMPROXY usage Álvaro Fernández IFIC (CSIC)
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid2Win : gLite for Microsoft Windows Roberto.
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Introduction to P-GRADE Portal hands-on Miklos Kozlovszky MTA SZTAKI
Università di Perugia Enabling Grids for E-sciencE Status of and requirements for Computational Chemistry NA4 – SA1 Meeting – 6 th April.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using WMProxy advanced job submission.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite – UNICORE interoperability Daniel Mallmann.
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
EGEE is a project funded by the European Union under contract IST GENIUS and GILDA Guy Warner NeSC Training Team Induction to Grid Computing.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
INFSO-RI Enabling Grids for E-sciencE Padova site report Massimo Sgaravatto On behalf of the JRA1 IT-CZ Padova group.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Enabling Grids for E-sciencE Work Load Management & Simple Job Submission Practical Shu-Ting Liao APROC, ASGC EGEE Tutorial.
FESR Trinacria Grid Virtual Laboratory Practical using WMProxy advanced job submission Emidio Giorgio INFN Catania.
Enabling Grids for E-sciencE University of Perugia Computational Chemistry status report EGAAP Meeting – 21 rst April 2005 Athens, Greece.
Introduction to Grid Application Development
Practical using C++ WMProxy API advanced job submission
P-GRADE and GEMLCA.
How To Integrate an Application on Grid
Presentation transcript:

EGEE-II INFSO-RI Enabling Grids for E-sciencE Introduction to Grid Application Development Vangelis Floros Application Support Team NCSR “Demokritos”, Institute of Nuclear Physics

Enabling Grids for E-sciencE EGEE-II INFSO-RI Acknowledgments Portion of slides (derived from those) prepared by: –Mike Mineter, NESC –Charles Loomis, LAL-Orsay –Roberto Barbera and his GILDA team University of Catania and INFN –EGEE-II NA4 Activity Member’s

Enabling Grids for E-sciencE EGEE-II INFSO-RI Contents Basic Concepts Types of Grid Applications Challenges

Enabling Grids for E-sciencE EGEE-II INFSO-RI Definition, Ian Foster A Virtual Organisation is: People from different institutions working to solve a common goal Sharing distributed processing and data resources “Grid computing is coordinated resource sharing and problem solving in dynamic, multi-institutional virtual organizations” (I.Foster) Focus: Wide area, collaboration, virtual organisations

Enabling Grids for E-sciencE EGEE-II INFSO-RI Practical definition in some areas Grid Computing == Clustering Clusters; Building a global batch submission system...

Enabling Grids for E-sciencE EGEE-II INFSO-RI Basic Concepts Types of Grid Applications Challenges

Enabling Grids for E-sciencE EGEE-II INFSO-RI The four pillars of Grid Computing Job/ App Execution Management Data Management Security Information & Monitoring Authentication Authorization Confidentiality Integrity VO management Remote execution Load balancing Interactivity Parallelism Workflows Data staging Bulk data transfers Replication Metadata Resource discovery Events & Notifications Resource status & monitoring

Enabling Grids for E-sciencE EGEE-II INFSO-RI Grid Application Development Application development in the Grid implies the exploitation of APIs, tools and environments that provide the four basic Grid capabilities order to perform complex tasks and achieve diverse goals. The extend and approach that the four basic Grid concepts are materialized depends on the specific capabilities of the Grid enabling technologies (in our case the gLite middleware suite)

Enabling Grids for E-sciencE EGEE-II INFSO-RI The vital layer Where computer science meets the application communities! VO-specific developments built on higher-level tools and core services Makes Grid services useable by non-specialists Grids provide the compute and data storage resources Basic Grid services: AA, job submission, info, … Higher-level grid services (brokering,…) Application toolkits, ….. Application Production grids provide these core services.

Enabling Grids for E-sciencE EGEE-II INFSO-RI gLite Grid Middleware Services API Access Workload Mgmt Services Computing Element Workload Management Metadata Catalog Data Management Storage Element Data Movement File & Replica Catalog Authorization Security Services Authentication Information & Monitoring Information & Monitoring Services Application Monitoring Connectivity Accounting Auditing Job Provenance Package Manager CLI

Enabling Grids for E-sciencE EGEE-II INFSO-RI The Job concept gLite follows the job submission concept for application execution and resource sharing A job is a self contained entity that packages and conveys all the required information and artifacts for the successful remote execution of an application. –Executable files –Input/Output data –Parameters –Environment –Infrastructure Requirements –Workflows Described using the Job Description Language (JDL) Executables Data Parameters Requirements JOB

Enabling Grids for E-sciencE EGEE-II INFSO-RI Complexities of grid jobs 1.Simple jobs – submitted to WMS to run in batch mode 2.Job invokes grid services To read & write files on SE Monitoring For outbound connectivity (interactive jobs) To manage metadata … 3.Complex jobs An environment controls multiple jobs on users’ behalf High-level services Portals with workflow Software written for the VO (or by the user) …

Enabling Grids for E-sciencE EGEE-II INFSO-RI Complex Workflows Direct Acyclic Graph (DAG) is a set of jobs where the input, output, or execution of one or more jobs depends on one or more other jobs A Collection is a group of jobs with no dependencies –basically a collection of JDL’s A Parametric job is a job having one or more attributes in the JDL that vary their values according to parameters Using compound jobs it is possible to have one shot submission of a (possibly very large, up to thousands) group of jobs –Submission time reduction  Single call to WMProxy server  Single Authentication and Authorization process  Sharing of files between jobs –Availability of both a single Job Id to manage the group as a whole and an Id for each single job in the group nodeE nodeC nodeA nodeD nodeB

Enabling Grids for E-sciencE EGEE-II INFSO-RI Pragmatic approach to Grid application development Benefits and Restrictions. Potential compromises Resource sharing (no dedicated resources) Explicit and implicit collaboration (working in shared environment) Security risks (yes there are!) Performance compromises (wrt system responsiveness. Some times too much middleware!) Application Models (the application may have to adapt to the grid and not vice versa)

Enabling Grids for E-sciencE EGEE-II INFSO-RI Basic Concepts Types of Grid Applications Challenges

Enabling Grids for E-sciencE EGEE-II INFSO-RI Gridification Levels No development. Wrap existing applications as jobs. No source code modification is required Minor modifications. The application exposes minimal interaction with the grid services (e.g. Data Managements) Major modifications. A wide portion of the code is re- written to adopt to the new environment (e.g. parallelization, metadata, information) Pure grid applications. Developed from scratch. Extensively exploit existing grid services to provide new capabilities customized for a specific domain (e.g. metadata, job management, credential management)

Enabling Grids for E-sciencE EGEE-II INFSO-RI Application Programming Interfaces Classic APIs –Static compilation –Shared libraries –Need access to static or dynamic libraries Web Service Interfaces –The programmer may generate Web Service stubs and develop new clients from scratch –Libraries are comprised of precompiled Service clients –Need access to Web Services WSDL Support for C, C++ and Java An Application Programming Interface (API) is the interface that a computer system, library or application provides in order to allow requests for services to be made of it by other computer programs, and/or to allow data to be exchanged between them.interface

Enabling Grids for E-sciencE EGEE-II INFSO-RI Invocation of applications From the UI –Command Line Interfaces / Scripts –APIs –Higher level tools From desktop Windows applications –Use Grids without awareness of them! –But gLite not (yet) supporting Windows From portals –For recurring tasks: “core grid services” as well as application layer –Accessible from any browser –Tailored to applications –In EGEE: P-GRADE and GENIUS

Enabling Grids for E-sciencE EGEE-II INFSO-RI Example of higher-level tools: GANGA Ganga is a lightweight user tool ganga.web.cern.ch/ But also: Ganga is a developer framework

Enabling Grids for E-sciencE EGEE-II INFSO-RI GENIUS For many application communities –Interface can be tailored for specific requirements For demonstration purposes –  Available for anyone to use –  Fuller functionality for users who have stored long-lived proxy in MyProxy server

Enabling Grids for E-sciencE EGEE-II INFSO-RI EGEE Grid e.g. VOCE UK NGS P-GRADE-Portal London Rome Athens Multi-Grid P-GRADE Portal Different jobs of a workflow can be executed in different grids The portal can be connected to multiple grids

Enabling Grids for E-sciencE EGEE-II INFSO-RI Basic Concepts Types of Grid Applications Challenges

Enabling Grids for E-sciencE EGEE-II INFSO-RI Different Goals for App. Development I need resources for my research –I need richer functionality  MPI, parametric sweeps,…  Data and compute services together… I provide an application for (y)our research –How!?  Pre-install executables ?  Hosting environment?  Share data  Use it via portal? We provide applications for (y)our research –Also need:  Coordination of development  Standards  … Engineering challenges increasing

Enabling Grids for E-sciencE EGEE-II INFSO-RI Challenges Research software is often –Created for one user: the developer –Familiarity makes it useable –Short-term goals: Used until papers are written and then discarded Grid applications are often used –by a VO –Without support from developer –In new contexts and workflows Grid application developers are –In a research environment –Yet their s/w must have:  Stability  Documentation  Useability  Extendability –i.e. Production quality Need expertise in: software engineering application domain grid computing

Enabling Grids for E-sciencE EGEE-II INFSO-RI Consequences Team work! Engaged in world-wide initiatives – reuse, don’t make your own! Cross disciplines for solutions. From research to production software: ~5 times the effort. –“80% of the time for last 10% of the functionality & reliability” Standardisation is key –For re-use, for dynamic configuration of services,.. –Both for middleware and domain specific (e.g. GEON) Need to follow a deliberate development process –Waterfall? Rapid prototyping? –Requirements engineering, design, implementation, validation, deployment –Engaged with the user community

EGEE-II INFSO-RI Enabling Grids for E-sciencE Questions?