Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Experiences with GridWay on CRO NGI infrastructure Emir Imamagic, Srce EGEE User.

Slides:



Advertisements
Similar presentations
CSF4 Meta-Scheduler Tutorial 1st PRAGMA Institute Zhaohui Ding or
Advertisements

W w w. h p c - e u r o p a. o r g HPC-Europa Portal: Uniform Access to European HPC Infrastructure Ariel Oleksiak Poznan Supercomputing.
1/22 Distributed Systems Architecture Research Group Universidad Complutense de Madrid Constantino Vázquez Eduardo Huedo Scaling DRMAA codes to the Grid:
Distributed Systems Architecture Research Group Universidad Complutense de Madrid EGEE UF4/OGF25 Catania, Italy March 2 nd, 2009 State and Future Plans.
Legacy code support for commercial production Grids G.Terstyanszky, T. Kiss, T. Delaitre, S. Winter School of Informatics, University.
TNC 2008 / Short Lived Credential Service Implementation Based on National AAI Short Lived Credential Service Implementation Based on National AAI Emir.
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
A Computation Management Agent for Multi-Institutional Grids
Congreso Cuidad, Spain May 15, 2007 GridWay 1/29 gLite Course EGEE’07 MTA SZTAKI, Budapest, Hungary September 30th, 2007 An Overview of the GridWay Metascheduler.
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
2 nd GADA Workshop / OTM 2005 Conferences Eduardo Huedo Rubén S. Montero Ignacio M. Llorente Advanced Computing Laboratory Center for.
Universität Dortmund Robotics Research Institute Information Technology Section Grid Metaschedulers An Overview and Up-to-date Solutions Christian.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Slides for Grid Computing: Techniques and Applications by Barry Wilkinson, Chapman & Hall/CRC press, © Chapter 1, pp For educational use only.
1-2.1 Grid computing infrastructure software Brief introduction to Globus © 2010 B. Wilkinson/Clayton Ferner. Spring 2010 Grid computing course. Modification.
Workload Management Massimo Sgaravatto INFN Padova.
Condor Overview Bill Hoagland. Condor Workload management system for compute-intensive jobs Harnesses collection of dedicated or non-dedicated hardware.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Cracow Grid Workshop’10 Kraków, October 11-13,
SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking.
Ashok Agarwal 1 BaBar MC Production on the Canadian Grid using a Web Services Approach Ashok Agarwal, Ron Desmarais, Ian Gable, Sergey Popov, Sydney Schaffer,
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
National Center for Supercomputing Applications The Computational Chemistry Grid: Production Cyberinfrastructure for Computational Chemistry PI: John Connolly.
Dynamic Firewalls and Service Deployment Models for Grid Environments Gian Luca Volpato, Christian Grimm RRZN – Leibniz Universität Hannover Cracow Grid.
NeSC Apps Workshop July 20 th, 2002 Customizable command line tools for Grids Ian Kelley + Gabrielle Allen Max Planck Institute for Gravitational Physics.
HPDC 2007 / Grid Infrastructure Monitoring System Based on Nagios Grid Infrastructure Monitoring System Based on Nagios E. Imamagic, D. Dobrenic SRCE HPDC.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.
OGF 25/EGEE User Forum Catania, March 2 nd 2009 Meta Scheduling and Advanced Application Support on the Spanish NGI Enol Fernández del Castillo (IFCA-CSIC)
GRAM5 - A sustainable, scalable, reliable GRAM service Stuart Martin - UC/ANL.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
BOF: Megajobs Gracie: Grid Resource Virtualization and Customization Infrastructure How to execute hundreds of thousands tasks concurrently on distributed.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks S. Natarajan (CSU) C. Martín (UCM) J.L.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks, An Overview of the GridWay Metascheduler.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios for Grid Services E. Imamagic, SRCE.
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
NW-GRID Campus Grids Workshop Liverpool31 Oct 2007 NW-GRID Campus Grids Workshop Liverpool31 Oct 2007 Moving Beyond Campus Grids Steven Young Oxford NGS.
1October 9, 2001 Sun in Scientific & Engineering Computing Grid Computing with Sun Wolfgang Gentzsch Director Grid Computing Cracow Grid Workshop, November.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Site Monitoring with Nagios E. Imamagic,
GVis: Grid-enabled Interactive Visualization State Key Laboratory. of CAD&CG Zhejiang University, Hangzhou
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks, Novelties and Features around the GridWay.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Application Porting Support in EGEE Gergely Sipos MTA SZTAKI EGEE’08.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGI Operations Tiziana Ferrari EGEE User.
Easy Access to Grid infrastructures Dr. Harald Kornmayer (NEC Laboratories Europe) Dr. Mathias Stuempert (KIT-SCC, Karlsruhe) EGEE User Forum 2008 Clermont-Ferrand,
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Globus: A Report. Introduction What is Globus? Need for Globus. Goal of Globus Approach used by Globus: –Develop High level tools and basic technologies.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Computational chemistry with ECCE on EGEE.
EGI Technical Forum Amsterdam, 16 September 2010 Sylvain Reynaud.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CharonGUI A Graphical Frontend on top of.
Migrating Desktop Uniform Access to the Grid Marcin Płóciennik Poznan Supercomputing and Networking Center Poznan, Poland EGEE’07, Budapest, Oct.
Migrating Desktop Uniform Access to the Grid Marcin Płóciennik Poznan Supercomputing and Networking Center Poland EGEE’08 Conference, Istanbul, 24 Sep.
CSF. © Platform Computing Inc CSF – Community Scheduler Framework Not a Platform product Contributed enhancement to The Globus Toolkit Standards.
Tutorial on Science Gateways, Roma, Catania Science Gateway Framework Motivations, architecture, features Riccardo Rotondo.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
ACGT Architecture and Grid Infrastructure Juliusz Pukacki ‏ EGEE Conference Budapest, 4 October 2007.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios Grid Monitor E. Imamagic, SRCE OAT.
Puppet and Cobbler for the configuration of multiple grid sites
GridOS: Operating System Services for Grid Architectures
Workload Management Workpackage
Use of Nagios in Central European ROC
Management of Virtual Machines in Grids Infrastructures
Management of Virtual Machines in Grids Infrastructures
Presentation transcript:

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Experiences with GridWay on CRO NGI infrastructure Emir Imamagic, Srce EGEE User Forum 2009, Catania, Italy

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Overview Introduction CRO NGI Middleware User Community GridWay Why do we like it? How could we like it even more? Conclusion

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Introduction Initial grid national initiative in Croatia – 2003 (CRO-GRID) Providing a job management interface is crucial for attracting cluster user communities Started using GridWay in 2005 The best among very few (working) solutions Truly middleware-independent We met developers on EGEE Conference, Geneva, 2006

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 CRO NGI Grid infrastructure provided for academic & research purposes Government approved, permanent grid service Community 8 partners 23 user institutions 80 users 42 active users

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Current Status

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Middleware Sites Standard cluster environment (OSCAR distribution) Torque, Maui and SGE batch systems Ganglia monitoring system Grid Globus Toolkit 2 (GRAM, GridFTP) Globus Toolkit 4 (WS-GRAM, WS-MDS, RFT) Job metascheduling and management Condor-G with GRAM GridWay with WS-GRAM

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 User Community Traditional cluster users Applications: Gaussian (computational chemistry) Gromacs (molecular dynamics) ABINIT (computational chemistry) Povray & FFmpeg (graphic rendering) Intel Math Kernel Library (mathematics) NAG Fortran Libraries (mathematics) Home-grown applications

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 GridWay Grid Metascheduler Transparent job management over heterogeneous resources Enables integration with various middleware systems (e.g. Globus Toolkit, UNICORE, gLite, NorduGrid, …) Standards compliant OGF DRMAA API OGF JSDL Aligned with Globus Toolkit

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Why do we like it? Multiple middleware systems support Convenient for heterogeneous environments Enables transparent changes of underlying infrastructure Positive feedback from users Easy to use – similar to cluster & OS-level tools Job recovery in case of service/resource failures Advanced scheduling policies Fair share – complaint with cluster policy

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Why do we like it? Advanced job options Checkpointing – automatic storing of checkpoint on remote GridFTP server Performance tracking & automatic migration to better resources Modular & extendable Modifying job wrappers to suite our environment Possibility to add custom scheduling policy Open source Modifying core code is possible

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 How could we like it even more? Periodic glitches User proxy rejected (could not register user error) Problem with staging files (seems gone in 5.4) MADs performance Heavyweight MAD processes present even if no job is running (roadmap #4484)

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 How could we like it even more? Improved integration with underlying layers Adding more resource attributes for ranking and filtering (roadmap #4492) Passing attributes to underlying middleware, e.g. WS-GRAM extensions (roadmap #4491) Robustness Too sensitive to network failures Too sensitive to middleware glitches Jobs do not recover properly after restart (bug #5308)

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Conclusion GridWay – open grid metascheduling solution Enables more than best effort grid scheduling Suitable for heterogeneous grid environments with tendency to change

Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Thank You! Questions?