EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE www.eu-egee.org Application Porting Support Group Demonstration at EGEE’08 Conference Istanbul, 22-26.

Slides:



Advertisements
Similar presentations
Experiences with GridWay on CRO NGI infrastructure / EGEE User Forum 2009 Experiences with GridWay on CRO NGI infrastructure Emir Imamagic, Srce EGEE User.
Advertisements

1/22 Distributed Systems Architecture Research Group Universidad Complutense de Madrid Constantino Vázquez Eduardo Huedo Scaling DRMAA codes to the Grid:
Distributed Systems Architecture Research Group Universidad Complutense de Madrid EGEE UF4/OGF25 Catania, Italy March 2 nd, 2009 State and Future Plans.
Legacy code support for commercial production Grids G.Terstyanszky, T. Kiss, T. Delaitre, S. Winter School of Informatics, University.
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Congreso Cuidad, Spain May 15, 2007 GridWay 1/29 gLite Course EGEE’07 MTA SZTAKI, Budapest, Hungary September 30th, 2007 An Overview of the GridWay Metascheduler.
MTA SZTAKI Hungarian Academy of Sciences Grid Computing Course Porto, January Introduction to Grid portals Gergely Sipos
Globus Toolkit 4 hands-on Gergely Sipos, Gábor Kecskeméti MTA SZTAKI
Grid Execution Management for Legacy Code Applications Exposing Application as Grid Services Porto, Portugal, 23 January 2007.
1 Application Specific Module for P-GRADE Portal 2.7 Application Specific Module overview Akos Balasko MTA-SZTAKI LPDS
Enabling Grids for E-sciencE Medical image processing web portal : Requirements analysis. An almost end user point of view … H. Benoit-Cattin,
1 portal.p-grade.hu Further information on P-GRADE Gergely Sipos MTA SZTAKI Hungarian Academy of Sciences.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Latest achievements of the Grid Application Support Centre at MTA SZTAKI Gergely.
INFSO-RI Enabling Grids for E-sciencE The US Federation Miron Livny Computer Sciences Department University of Wisconsin – Madison.
HPDC 2007 / Grid Infrastructure Monitoring System Based on Nagios Grid Infrastructure Monitoring System Based on Nagios E. Imamagic, D. Dobrenic SRCE HPDC.
Grids and Portals for VLAB Marlon Pierce Community Grids Lab Indiana University.
OGF 25/EGEE User Forum Catania, March 2 nd 2009 Meta Scheduling and Advanced Application Support on the Spanish NGI Enol Fernández del Castillo (IFCA-CSIC)
EGEE-III INFSO-RI Enabling Grids for E-sciencE Lessons learnt from the EGEE Application Porting Support activity Gergely Sipos Coordinator.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
GRAM5 - A sustainable, scalable, reliable GRAM service Stuart Martin - UC/ANL.
CSF4 Meta-Scheduler Name: Zhaohui Ding, Xiaohui Wei
1 Next steps with EGEE and P-GRADE Portal Gergely Sipos MTA SZTAKI EGEE Training and Induction.
INFSO-RI Enabling Grids for E-sciencE Supporting legacy code applications on EGEE VOs by GEMLCA and the P-GRADE portal P. Kacsuk*,
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks S. Natarajan (CSU) C. Martín (UCM) J.L.
Enabling Grids for E-sciencE EGEE-III INFSO-RI Using DIANE for astrophysics applications Ladislav Hluchy, Viet Tran Institute of Informatics Slovak.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks, An Overview of the GridWay Metascheduler.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Training services offered by SZTAKI for EGEE and EGI Gergely Sipos MTA SZTAKI (Hungarian.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Next steps with EGEE Gergely Sipos
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Application porting in EGEE Gergely Sipos.
Congreso Cuidad, Spain May 15, 2007 GridWay 1/26 EGEE’07 Conference Budapest, Hungary October 1st – 5th, 2007 Uniform Grid Access with GridWay GridWay.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks, Novelties and Features around the GridWay.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Application Porting Support in EGEE Gergely Sipos MTA SZTAKI EGEE’08.
1 P-GRADE Portal: a workflow-oriented generic application development portal Peter Kacsuk MTA SZTAKI, Hungary Univ. of Westminster, UK.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The GILDA training infrastructure.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Status report on Application porting at SZTAKI.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Services for advanced workflow programming.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid2Win : gLite for Microsoft Windows Roberto.
1 Practical information for the GEMLCA / P-GRADE hands-on Tamas Kiss University of Westminster.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
EGEE-II INFSO-RI Enabling Grids for E-sciencE Introduction to P-GRADE Portal hands-on Miklos Kozlovszky MTA SZTAKI
Università di Perugia Enabling Grids for E-sciencE Status of and requirements for Computational Chemistry NA4 – SA1 Meeting – 6 th April.
1 Other features and next steps Gergely Sipos MTA SZTAKI Hungarian Academy of Sciences.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
EGI Technical Forum Amsterdam, 16 September 2010 Sylvain Reynaud.
1 P-GRADE Portal hands-on Gergely Sipos MTA SZTAKI Hungarian Academy of Sciences.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Spanish National Research Council- CSIC Isabel.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using WMProxy advanced job submission.
1 Further information and next steps Further information and next steps Gergely Sipos MTA SZTAKI
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Application Porting Support Gergely Sipos,
1 An unattended, fault-tolerant approach for the execution of distributed applications Manuel Rodríguez-Pascual, Rafael Mayo-García CIEMAT Madrid, Spain.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Bob Jones EGEE project director.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Applications.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Latest results of the Grid Application Support Centre at MTA SZTAKI Gergely Sipos.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Application specific portlet other portal features and next steps Miklos Kozlovszky.
Enabling Grids for E-sciencE University of Perugia Computational Chemistry status report EGAAP Meeting – 21 rst April 2005 Athens, Greece.
Tamas Kiss University Of Westminster
Management of Virtual Machines in Grids Infrastructures
P-GRADE Portal tutorial
Grid Application Support Group Case study Schrodinger equations on the Grid Status report 16. January, Created by Akos Balasko
J.L. Vázquez-Poletti (UCM) EGEE08 (Istambul)
Management of Virtual Machines in Grids Infrastructures
Introduction to the SHIWA Simulation Platform EGI User Forum,
Presentation transcript:

EGEE-III INFSO-RI Enabling Grids for E-sciencE Application Porting Support Group Demonstration at EGEE’08 Conference Istanbul, September 2008

Enabling Grids for E-sciencE EGEE-III-INFSO-RI The Application Porting team MTA SZTAKI, Budapest –Grid Application Support Center (GASuC)  INFN, Catania –GILDA Team  UCM, Madrid –Distributed Systems Architecture Research Group  CSIC, Santander –Institute of Physics of Cantabria  ASGC, Taipei –Academia Sinica Grid Computing 

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Budapest Paris Madrid Catania Taipei Melbourne The Application Porting team

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Support cycle and services Interviews Apply online at Personalized training Porting specifications Problem analysis Writing publications and case studies Prototyping grid applications Fine tuning applications on production grids

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Application analysis Application Description Form at

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Dissemmination and outreach

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Main tools and technologies gLite command line tools and scripts –Interfacing with the infrastructure P-GRADE Portal –Workflows and parameter studies –Application specific interfaces GEMLCA service –Exposing legacy component as Grid services GridWay –Metascheduling on clusters and grids –Programming abstractions GILDA services –gLite infrastructure for training and prototyping –Training modules and services

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Success stories Solving the Schrodinger equation for triatomic systems using time independent method –Department of Chemistry, University of Perugia and MTA SZTAKI

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Success stories E-marketplace Model Integrated with Logistics –International Business School and MTA SZTAKI

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Success stories 2.5 Dimensional Frequency Domain Electromagnetic Numerical Modelling –University of Miskolc and MTA SZTAKI

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Success stories Hybrid pellet code –KFKI Research Institute for Particle and Nuclear Physics and MTA SZTAKI

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Success stories Application specific grid portals –Hiding grids and grid applications from non-technical end users

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Current applications at MTA SZTAKI RWavePR - Solving the Schrodinger equation for triatomic systems using time dependent methodRWavePR –Application specific grid portal for parameter study simulation SIMRI 3D - 3 dimensional MRI simulatorSIMRI 3D –Application specific grid portal for MPI code MPI-FD-FDTD - Numerical modeling of ElectroMagnetic field distribution in human tissuesMPI-FD-FDTD –Simulating signals propagating inside human body using large number of jobs and output files Universality Classes Explorer – Explaining classes in nonequilibrium systems –Describing collective behavior in statistical physics using a volunteer desktop grid

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Demo applications „ABC” computational chemistry code to EGEE –P-GRADE portal –MTA SZTAKI, Hungary „CD-HIT” computational chemistry code –GridWay –UCM, Spain Planck Applications –Command line scripting –IFCA, Spain

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Porting the „ABC” computational chemistry code to EGEE Case study from MTA SZTAKI

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Main facts about the ABC code Department of Chemistry, University of Perugia SOLUTION OF SCHRODINGER EQUATION FOR TRIATOMIC SYSTEMS USING TIME- DEPENDENT (RWAVEPR) OR TIME INDEPENDENT (ABC) METHOD A single execution can be between 5 hours and 10 hours SEQUENTIAL FORTRAN 90 Binary: 400KB One input: ~ 1KB One output: ~500KB Many simulations at the same time User-fiendy, graphical interface, easy to change scale and input of application

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Customized user interface Business layer EGEE Grid services (gLite WMS, LFC, VOMS, …) EGEE Grid Grid layer gLite command line tools ABC Grid system specification Read user input generate application input Collect result from grid Visualize result for user Start and manage ABC jobs

Enabling Grids for E-sciencE EGEE-III-INFSO-RI P-GRADE Portal ABC specific Gridsphere portlet Compchem VO EGEE Grid Fault tolerant grid execution and file transfer layer ABC Grid system implementation ABC parameter study workflow ABC parameter study job

Enabling Grids for E-sciencE EGEE-III-INFSO-RI ABC application workflow ABC job. Executed as many times as many parameters are provided by end user Job collects and archives all output files into a single TAR file Generator job to generates input files for ABC jobs

Enabling Grids for E-sciencE EGEE-III-INFSO-RI ABC user input interfaces In this form end users can define actual values for the ABC application Changeable parameters of the ABC grid application

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Fine tuning the ABC application Execution of 4 ABC parameter study jobs on a local machine P4, 3.4GHz, 1GByte on 4 broker selected clusters of EGEE Better speed-up can be achieved with more parameter jobs.

Enabling Grids for E-sciencE EGEE-III-INFSO-RI User manual of the ported application in PPT format

Enabling Grids for E-sciencE EGEE-III-INFSO-RI End user portlet for ported application

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Conference paper from the porting of the ABC grid application

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Training exercise from ported ABC code Demonstrating parameter study concept Demonstrating workflow concept Usage of gLite CE, SE, WMS, VOMS Tutorial for P-GRADE Portal Customized for GILDA

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Porting the „CD-HIT” computational chemistry code to EGEE Case study from UCM

Enabling Grids for E-sciencE EGEE-III-INFSO-RI The CD-HIT Application Application Description “Cluster Database at High Identity with Tolerance” Protein (and also DNA) clustering Compares protein DB entries Eliminates redundancies Examples: Used in UniProt for generating UniRef data sets UniProt is the world's most comprehensive catalog of information on proteins. CD-HIT program is used to generate the UniRef reference data sets, UniRef90 and UniRef50. CD-HIT is also used at the PDB to treat redundant sequences

Enabling Grids for E-sciencE EGEE-III-INFSO-RI The CD-HIT Application Application Description Our case: Widely used in the Spanish National Oncology Research Center (CNIO) Input DB now: 4,186,284 proteins / 1.7GB Infeasible to be executed on single machine Memory requirements Total execution time

Enabling Grids for E-sciencE EGEE-III-INFSO-RI The CD-HIT Application Execute cd-hit in parallel mode Idea: divide the input database to compare each division in parallel Divide the input db Repeat Cluster the first division (cd- hit) Compare others against this one (cd-hit-2d) Merge results Speed-up the process and deal with larger databases Computational characteristics Variable degree of parallelism Grain must be adjusted A90 B-AC-AD-A C-ABD-AB D-ABC B90 C90 A BCD D90 DB DB90 Merge Div. CD-HIT Parallel

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Grid Infrastructure Experiment Resources (during Grid porting process) (Research and short scale) (Production and large scale)

Enabling Grids for E-sciencE EGEE-III-INFSO-RI The GridWay Metascheduler

Enabling Grids for E-sciencE EGEE-III-INFSO-RI The GridWay Metascheduler GridWay Internals Execution Manager Transfer Manager Information Manager Dispatch Manager Request Manager Scheduler Job PoolHost Pool DRMAA libraryCLI GridWay Core Grid File Transfer Services Grid Execution Services GridFTPRFT pre-WS GRAM WS GRAM Grid Information Services MDS2 GLUE MDS4 Resource Discovery Resource Monitoring Resource Discovery Resource Monitoring Job Preparation Job Termination Job Migration Job Preparation Job Termination Job Migration Job Submission Job Monitoring Job Control Job Migration Job Submission Job Monitoring Job Control Job Migration

Enabling Grids for E-sciencE EGEE-III-INFSO-RI SGE Cluster Users PBS Cluster gLite GridWay gLite Services: BDII, GRAM, GridFTP EGEE Resource Broker DRMAA interface VO Schedulers GridWay Users Other V.O. Biomed EGEE RB Other App. Deployment Example The GridWay Metascheduler

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Distributed Resource Management Application API – Open Grid Forum Standard Homogeneous interface to different Distributed Resource Managers (DRM): –SGE –Condor –PBS/Torque –GridWay  C  JAVA  Perl  Ruby  Python What is DRMAA? The GridWay Metascheduler

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Community – Open Source Project. Globus Development Philosophy Development Infrastructure (thanks to Globus Project!) Mailing Lists Bugzilla CVS You are very welcome to contribute: Reporting Bugs Making feature requests for the next GridWay release Contributing your own developments (bug fixes, new features, documentation) Detailed Roadmap: GridWay Campaigns at bugzilla.mcs.anl.gov/globus/query.cgi www-unix.mcs.anl.gov/~bacon/cgi-bin/big-roadmap.cgi#Gridway Development Process The GridWay Metascheduler

Enabling Grids for E-sciencE EGEE-III-INFSO-RI PBS C-ABD-AB D-ABC C90 C-AD-A D-AB D90 cd-hit-div merge C90 B90 Merge sequential tasks to reduce overhead Provide a uniform interface (DRMAA) to interact with different DRMS. Some file manipulation still needed DRMS GridWay local cluster Grid Porting

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Grid Porting Optimization Heuristics More information on Globus GridWay

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Porting of Planck Applications to EGEE Case study from CSIC-IFCA Santander

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Marcos López-Caniego IFCA Santander Spain Compact Source Detection (Point Sources and SZ Clusters) Scheme: 1.Prepare intel-compatible static executable of the code 2.Run script to generate necessary macros to be submitted 3.Submit job to the node: 1.Retrieve maps/patches from the storage element 2.Move/Rename files 3.Execute application 4.Compress outputs and copy-register them in the storage element 4.Retrieve results from the storage element to the local disk Concept of porting

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Marcos López-Caniego IFCA Santander Spain These are the the libraries used by the application: HEALPix: pixelization scheme from Gorski et al. that produces a subdivision of a spherical surface in which each pixel covers the same surface area as every other pixel. Demo libraries

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Marcos López-Caniego IFCA Santander Spain CFITSIO: simple high-level routines for reading and writing FITS files developed and maintained by NASA HEASARC (High Energy Astrophysics Science Archive Research Center). CPACK: a package of routines produced at CPAC (Cambridge Planck Analysis Software). In particular, tesselation and proyection of regions of the sphere into flat patches. Demo libraries

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Marcos López-Caniego IFCA Santander Spain In this approach to detect point sources we work on flat patches of the sky Demo: Detection of Point Sources

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Marcos López-Caniego IFCA Santander Spain + Filtering MHW2 Demo: Detection of Point Sources

Enabling Grids for E-sciencE EGEE-III-INFSO-RI Application Porting Support Group Contact and further information