Cindy Zheng, Pragma Grid, 5/30/2006 The PRAGMA Testbed Building a Multi-Application International Grid Cindy Zheng P acific R im A pplication and G rid.

Slides:



Advertisements
Similar presentations
PRAGMA – TeraGrid – AIST Interoperation Testing Philip Papadopoulos.
Advertisements

Kento Aida, Tokyo Institute of Technology Grid Working Group Meeting Aug. 27 th, 2003 Tokyo Institute of Technology Kento Aida.
Resource WG Breakout. Agenda How we will support/develop data grid testbed and possible applications (1 st day) –Introduction of Gfarm (Osamu) –Introduction.
OSG All Hands, 3/5/2007 Cindy Zheng Peter Arzberger Philip Papadopoulos Mason Katz P acific R im A pplication and G rid M iddleware A ssembly University.
CSF4 Meta-Scheduler Tutorial 1st PRAGMA Institute Zhaohui Ding or
Multi-organisation Grid Accounting System (MOGAS): PRAGMA deployment update A/Prof. Bu-Sung Lee, Francis School of Computer Engineering, Nanyang Technological.
PRAGMA 17 (10/29/2009) Resources Group Pacific Rim Application and Grid Middleware Assembly Resources.
Resources WG Update PRAGMA 9 Hyderabad. Status (in 1 slide) Applications QMMD (AIST) Savannah (MU) iGAP (SDSC, AIST) Middleware Gfarm (AIST) Community.
Reports from Resource Breakout PRAGMA 16 KISTI, Korea.
Resource WG Update PRAGMA 14 Mason Katz, Yoshio Tanaka, Cindy Zheng.
PRAGMA 15 (10/24/2008) Resources Group Pacific Rim Application and Grid Middleware Assembly Resources.
17 th October, 2006PRAGMA 11, Beautiful Osaka, Japan COMPLAINTS TO RESOURCE GROUP Habibah A Wahab, Suhaini Ahmad, Nur Hanani Che Mat School of Pharmaceutical.
Resource WG Update PRAGMA 14 Mason Katz, Yoshio Tanaka, Cindy Zheng.
Resources WG Update PRAGMA 9 Hyderabad. Status (in 1 slide) Applications QMMD (AIST) Savannah (MU) iGAP (SDSC, AIST) Middleware Gfarm (AIST) Community.
Resource WG PRAGMA Mason Katz, Yoshio Tanaka, Cindy Zheng.
Cindy Zheng, PRAGMA 8, Singapore, 5/3-4/2005 Status of PRAGMA Grid Testbed & Routine-basis Experiments Cindy Zheng Pacific Rim Application and Grid Middleware.
Demonstrations at PRAGMA demos are nominated by WG chairs Did not call for demos. We will select the best demo(s) Criteria is under discussion. Notes.
Resource/data WG Summary Yoshio Tanaka Mason Katz.
Cindy Zheng, PRAGMA 11, 10/16/2006 Resources Group P acific R im A pplication and G rid M iddleware A ssembly
Resource WG Summary Mason Katz, Yoshio Tanaka. Next generation resources on PRAGMA Status – Next generation resource (VM-based) in PRAGMA by UCSD (proof.
Resource WG Report. Projects Applications EOL Ninf-G Climate model GridBlast GOC Gangla / SCMSWeb => Uniform Database Goodness Status map (e.g. IVDGL)
Kejun Dong, Kai Nan CNIC/CAS CNIC Resources And Activities Update Resources Working Group PRAGMA11 Workshop, Oct.16/17 Osaka, Japan.
CSF4 Meta-Scheduler PRAGMA13 Zhaohui Ding or College of Computer.
PRAGMA BioSciences Portal Raj Chhabra Susumu Date Junya Seo Yohei Sawai.
CCGrid 2006, 5/19//2006 The PRAGMA Testbed Building a Multi-Application International Grid San Diego Supercomputer Center / University of California, San.
GIN Testbed Status 5/11/2006 Peter Arzberger, Cindy Zheng
Cindy Zheng, SC2006, 11/12/2006 Cindy Zheng PRAGMA Grid Testbed Coordinator P acific R im A pplication and G rid M iddleware A ssembly San Diego Supercomputer.
ACOMP, 3/15/2007 Cindy Zheng Peter Arzberger Philip Papadopoulos Mason Katz P acific R im A pplication and G rid M iddleware A ssembly University of California,
GRID COMPUTING AND SOME RESEARCH ISSUES IN DEVELOPMENT GEOGRID AT VAST Dao Van Tuyet Department for Computational & Knowledge Engineering.
TeraGrid Deployment Test of Grid Software JP Navarro TeraGrid Software Integration University of Chicago OGF 21 October 19, 2007.
MyProxy Jim Basney Senior Research Scientist NCSA
National Institute of Advanced Industrial Science and Technology Ninf-G - Core GridRPC Infrastructure Software OGF19 Yoshio Tanaka (AIST) On behalf.
Three types of remote process invocation
Test harness and reporting framework Shava Smallen San Diego Supercomputer Center Grid Performance Workshop 6/22/05.
Current status of grids: the need for standards Mike Mineter TOE-NeSC, Edinburgh.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Grid Resource Allocation Management (GRAM) GRAM provides the user to access the grid in order to run, terminate and monitor jobs remotely. The job request.
PRAGMA19 – PRAGMA 20 Collaborative Activities Resources Working Group.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking.
PRAGMA21 – PRAGMA 22 Collaborative Activities Resources Working Group.
PRAGMA20 – PRAGMA 21 Collaborative Activities Resources Working Group.
PRAGMA: Cyberinfrastructure, Applications, People Yoshio Tanaka (AIST, Japan) Peter Arzberger (UCSD, USA)
Status of PRAGMA Activities at KISTI Jongbae Moon 1.
Cindy Zheng, Geon Workshop, 7/20/2006 PRAGMA Grid A Multi-Application Route-Use Global Grid Cindy Zheng PRAGMA Grid Coordinator P acific R im A pplication.
INFSO-RI Enabling Grids for E-sciencE SA1: Cookbook (DSA1.7) Ian Bird CERN 18 January 2006.
National Institute of Advanced Industrial Science and Technology Introduction of PRAGMA routine-basis experiments Yoshio Tanaka
Building the PRAGMA Grid Through Routine-basis Experiments Cindy Zheng Pacific Rim Application and Grid Middleware Assembly San Diego Supercomputer Center.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting October 10-11, 2002.
1 The state of Grid computing in Vietnam, and which aims the VNGrid Project wants to reach Dr. Lang Van, Tran HCM City Institute of Information Technology.
Building the PRAGMA Grid Through Routine-basis Experiments Cindy Zheng, SDSC, USA Yusuke Tanimura, AIST, Japan Pacific Rim Application Grid Middleware.
PRAGMA 17 – PRAGMA 18 Resources Group. PRAGMA Grid 28 institutions in 17 countries/regions, 22 compute sites (+ 7 site in preparation) UZH Switzerland.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Building the PRAGMA Grid Through Routine-basis Experiments Cindy Zheng, SDSC, USA Yusuke Tanimura, AIST, Japan Pacific Rim Application Grid Middleware.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GRIDS Center Middleware Overview Sandra Redman Information Technology and Systems Center and Information Technology Research Center National Space Science.
National Institute of Advanced Industrial Science and Technology APGrid PMA: Stauts Yoshio Tanaka Grid Technology Research Center,
SC2008 (11/19/2008) Resources Group Pacific Rim Application and Grid Middleware Assembly Reports.
SAN DIEGO SUPERCOMPUTER CENTER Inca Control Infrastructure Shava Smallen Inca Workshop September 4, 2008.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
1 e-Science AHM st Aug – 3 rd Sept 2004 Nottingham Distributed Storage management using SRB on UK National Grid Service Manandhar A, Haines K,
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Thoughts on International e-Science Infrastructure Kevin Thompson U.S. National Science Foundation Office of Cyberinfrastructure iGrid2005 9/27/2005.
National Institute of Advanced Industrial Science and Technology Developing Scientific Applications Using Standard Grid Middleware Hiroshi Takemiya Grid.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
PRAGMA19 – PRAGMA 20 Collaborative Activities Resources Working Group.
Leigh Grundhoefer Indiana University
Presentation transcript:

Cindy Zheng, Pragma Grid, 5/30/2006 The PRAGMA Testbed Building a Multi-Application International Grid Cindy Zheng P acific R im A pplication and G rid M iddleware A ssembly University of California, San Diego San Diego Supercomputer Center

Cindy Zheng, Pragma Grid, 5/30/2006 Overview PRAGMA PRAGMA Grid testbed Routine-basis experiments –Applications –Grid middleware –Grid infrastructure software Grid interoperation Lessons Learned

Cindy Zheng, Pragma Grid, 5/30/2006 PRAGMA PRAGMA ( ) –Open international organization –Grid applications, practical issues –Build international scientific collaborations Characteristics –No central funding, but mutual interests –Friendship, trust, help among people –Doers Working groups –Bio, telescience, data, Geon, resources Meetings

Cindy Zheng, Pragma Grid, 5/30/2006 Resources working group Improve –middleware interoperability –Global grid usability and productivity –Grid interoperability How to make a global grid easy to use? –For applications. Let applications drive –More organized testbed operation –Full-scale and integrated testing/research –Long daily application runs –Find problems, develop/research/test solutions

Cindy Zheng, Pragma Grid, 5/30/2006 Routine-basis Experiments Run applications while building testbed –Started 2004 –Grass-roots, PRAGMA membership not necessary –Voluntary contribution of resources/work –long term, persistent –General grid Coordinator Site supporters Application drivers Developers

Cindy Zheng, Pragma Grid, 5/30/2006 How We Operate Heterogeneity –fundings, policies, environments Motivation –learn, develop, test, interop Communication – , VTC, Skype, workshop, timezone, language Create operation procedures –joining testbed –running applications –resources, contacts, requirements, instructions, monitoring, status, tools, etc.

Cindy Zheng, Pragma Grid, 5/30/2006 How We Operate

Cindy Zheng, Pragma Grid, 5/30/2006 PRAGMA Grid Testbed AIST, Japan CNIC, China KISTI, Korea ASCC, Taiwan NCHC, Taiwan UoHyd, India MU, Australia BII, Singapore KU, Thailand USM, Malaysia NCSA, USA SDSC, USA CICESE, Mexico UNAM, Mexico UChile, Chile TITECH, Japan QUT, Australia UZurich, Switzerland JLU, China NGO, Singapore MIMOS, Malaysia OSAKAU, Japan IOIT-HCM, Vietnam

Cindy Zheng, Pragma Grid, 5/30/2006 PRAGMA Grid resources

Cindy Zheng, Pragma Grid, 5/30/2006 Software Layers Globus 2, 3, 4 GT4 pre-WS, 9 sites GT4 WS, 1 Moving requirements

Cindy Zheng, Pragma Grid, 5/30/2006 Trust Trust all site CAs –tarball Experimental -> production Setup PRAGMA CA –GAMA/Naregi-CA APGRID PMA, IGTF (5 accr.)

Cindy Zheng, Pragma Grid, 5/30/2006 Applications Real science, multiple applications –Resource sharing Mpich-g2 Reservation and meta-scheduling –TDDFT: quantum-chemistry, AIST, Japan –Savannah: climate Model, MU, Australia –QM-MD: quantum-mechanic, AIST, Japan –iGAP: bioinformatic, UCSD, USA –Gamess-APBS: organic chemistry, UZurich, Switzerland –Siesta: molecular simulation, UZurich, Switzerland –Amber: molecular simulation, USM, Malaysia –FMO: quantum-mechanics, AIST, Japan –HPM: Genomics, IOIT-HCM, Vietnam –(GEON, Sensor, … )

Cindy Zheng, Pragma Grid, 5/30/2006 Middleware Application middleware –enable application to run in grid –Ninf-G AIST, Japan TDDFT, QM/MD, FMO –Nimrod/G MU, Australia Savannah, Siesta, Gamess –Mpich-Gx KISTI, Korea MM5, CICESE, Mexico Infrastructure middleware –provide grid services –Gfarm AIST, Japan iGAP, testbed, 6 sites –SCMSWeb KU, Thailand Testbed, 20 sites –MOGAS NTU, Singapore Testbed, 14 sites

Cindy Zheng, Pragma Grid, 5/30/2006 Server Client Compuer Func. Handle Client Component Info. Manager Remote Executables GridRPC: A Programming Model based on RPC GridRPC API is a proposed recommendation at the GGF Three components Information Manager - Manages and provides interface info Client Component - Manages remote executables via function handles Remote Executables - Dynamically generated on remote servers Built on top of Globus Toolkit (MDS, GRAM, GSI) Simple and easy-to-use programming interface Hiding complicated mechanism of the grid Providing RPC semantics

Cindy Zheng, Pragma Grid, 5/30/2006 Nimrod Development Cycle Prepare Jobs using Portal Jobs Scheduled Executed Dynamically Results displayed & interpreted Sent to available machines

Cindy Zheng, Pragma Grid, 5/30/2006 Application Middleware Ninf-G –Support GridRPC model which will be a GGF standard –Integrated to NMI release 8 (first non-US software in NMI) –Ninf roll for Rocks 4.x is also available –On PRAGMA testbed, TDDFT and QM/MD application achieved long time executions (1 week ~ 50 days runs). Nimrod –Supports large scale parameter sweeps on Grid infrastructure Study the behaviour of some of the output variables against a range of different input scenarios. Computer parameters that optimize model output Computations are uncoupled (file transfer) Allows robust analysis and more realistic simulations Very wide range of applications from quantum chemistry to public health policy –Climate experiment ran some 90 different scenarios of 6 weeks each

Cindy Zheng, Pragma Grid, 5/30/2006 Fault-Tolerance Enhanced Ninf-G monitors each RPC call –Return error code for failures Explicit Faults : Server down, Disconnection of network Implicit Faults : Jobs not activated, unknown faults Timeout - grpc_wait*() –Retry/restart Nimrod/G monitors remote services and restarts failed jobs –Long jobs are split into many sequentially dependent jobs which can be restarted using sequential parameters called seqameters Improvement in the routine-basis experiment –developers test code on heterogeneous global grid –results guide developers to improve detection and handle faults

Cindy Zheng, Pragma Grid, 5/30/2006 Application Setup and Resource Management Heterogeneous platforms –Manual build, deploy applications, manage resources Labor intensive, time consuming, tidious Middleware solutions –For deployment Automatic distribution of executables use staging functions –For resource management Ninf-G client configuration allow description of server attributes –Port number of the Globus gatekeeper –Local scheduler type –Queue name for submitting jobs –Protocol for data transfer –Library path for dynamic linking Nimrod/G portal allows a user to generate a testbed and helps maintain information about resources, including use of different certificates.

Cindy Zheng, Pragma Grid, 5/30/2006 Gfarm – Grid Virtual File System -High transfer rate (parallel transfer, localization) -Scalable -File replication – user/application setup, fault tolerance -Support Linux, Solaris; also scp, gridftp, SMB -POSIX compliant -Gfarm-FUSE -6 sites, 3786 GBytes, 1527 MB/sec (70 I/O nodes)

Cindy Zheng, Pragma Grid, 5/30/2006 Application Benefit No modification required –Existing legacy application can access files in Gfarm file system without any modification Easy application deployment –Install Application in Gfarm file system, run everywhere It supports binary execution and shared library loading Different kinds of binaries can be stored at the same pathname, which will be automatically selected depending on client architecture Fault tolerance –Automatic selection of file replicas in access time tolerates disk and network failure File sharing – Community Software Area

Cindy Zheng, Pragma Grid, 5/30/2006 Performance Enhancements OriginalImproved metadata management W/ metadata cache server Performance for small files – Improve meta-cache management – add meta-cache server Directory listing of 16,393 files

Cindy Zheng, Pragma Grid, 5/30/2006 SCMSWeb Web-based monitoring system for clusters and grid –System usage –Performance metrics Reliability –Grid service monitoring –Spot problems at a glance

Cindy Zheng, Pragma Grid, 5/30/2006 PRAGMA-Driven Development Heterogeneity –Add platform support Solaris (CICESE, Mexico) IA64 (CNIC, China) Software deployment –NPACI Rocks Roll Support ROCKS – 4.1 –Native Linux RPM for various Linux platform Enhancement –Hierarchical monitoring on large scale Grid –Compress data exchange between Grid side For some site with slow network –Better and cleaner graphics user interfaces Standardize & more collaboration –GRMAP (Grid Resource Management & Account Project) – Collaboration between NTU and TNGC –GIN (Grid Interoperation Now) Monitoring – standardize data exchange between monitoring softwares

Cindy Zheng, Pragma Grid, 5/30/2006 Multi-organisation Grid Accounting System

Cindy Zheng, Pragma Grid, 5/30/2006 Information for grid resource managers/administrators: –Resource usage based on organization –Daily, weekly, monthly, yearly records –Resource usage based on project/individual/organisation –Individual log of jobs –Metering and charging tool, can decide a pricing system, e.g. Price = f(hardware specifications, software license, usage measurement) MOGAS Web information

Cindy Zheng, Pragma Grid, 5/30/2006 PRAGMA MOGAS status (27/3/2006) AIST, Japan CNIC, China KISTI, Korea ASCC, Taiwan NCHC, Taiwan UoHyd, India MU, Australia BII, Singapore KU, Thailand USM, Malaysia NCSA, USA SDSC, USA CICESE, Mexico UNAM, Mexico UChile, Chile TITECH, Japan Cindy Zheng, GGF13, 3/14/05 modified by A/Prof. Bu-Sung Lee MIMOS IOIT-HCM GT4 GT2 NGO, Singapore QUT

Cindy Zheng, Pragma Grid, 5/30/2006 Integrations and Collaborations Naregi-CA (AIST, Japan) and Gama (SDSC, USA) Integration Rocks (SDSC, USA) and SCE (KU, Thailand), Ninf-G (AIST), Gfarm (AIST), KISTI etc. PRAGMA and NLANR PRAGMA and GEON –PRAGMA grid testbed –UMC, SDSC (USA) –GSCAS, CNIC (China) –UoHyd (India) –AIST (Japan) PRAGMA and sensor networks –PRAGMA grid testbed –NCHC, Taiwan –Binghamton University, NY, USA GAMA

Cindy Zheng, Pragma Grid, 5/30/2006 Grid Interoperation Now (GIN) GIN testbed (started Feb. 2006) –PRAGMA –TeraGrid –EGEE Fist application: TDDFT/Ninf-G –Lead: Yoshio Tanaka, Yusuke Tanimura (AIST) –Deployed and run PRAGMA - AIST, NCSA, SDSC TeraGrid – ANL –Working on deployment to EGEE – LCG Middleware interoperability problem –Assumptions by middleware about local architecture –Standard protocol

Cindy Zheng, Pragma Grid, 5/30/2006 Lessons Learned, Issues and Work (1) Authentication –User obtain initial access Process documented by Cindy Zheng, goc.rocksclusters.org/gin/gin-egee.htmhttp://pragma- goc.rocksclusters.org/gin/gin-egee.htm Not easy, not simple Need documentation to guide users Develop software to simply the process –DN incompatibility Summarized by Oscar Koeroo, grid.net/gin/Cert-probs-GIN.pdfhttp://goc.pragma- grid.net/gin/Cert-probs-GIN.pdf Commented by Charles Bacon (Globus), Need both standard and flexibility Voms server is modified to handle both styles of DN strings

Cindy Zheng, Pragma Grid, 5/30/2006 Lessons Learned, Issues and Work (2) Software stack and Community Software Area (CSA) –Software stack is different among grids. Problems with conflicting requirements. CSA as a solution for users to deploy their sub- stack and share installed software –Near term - work on CSA within each grid Gfarm-FUSE –Need focused discussion on solution for GIN

Cindy Zheng, Pragma Grid, 5/30/2006 Lessons Learned, Issues and Work (3) Cross-grid monitoring –Summary by Somsak Sriprayoonsakul, monitor.htm monitor.htm Get some monitoring software together, develop a common schema –Wiki - grid.net/index.php?title=GIN_%28Grid_I nter-operation_Now%29_Monitoringhttp://wiki.pragma- grid.net/index.php?title=GIN_%28Grid_I nter-operation_Now%29_Monitoring

Cindy Zheng, Pragma Grid, 5/30/2006 Lessons Summary Problems and solutions –Information sharing (pragma-goc) –Trust and access (Naregi-CA, GAMA, myproxy) –Resource requirements (INCA) –User/application environment (Gfarm) –Job submission (Portal/service/middleware) –System/job monitoring (SCMSWeb, +) –Network monitoring (APAN, NLANR) –Resource/job accounting (SCMSWeb, NTU) –Fault tolerance (Ninf-G, Nimrod) Publications –Infrastructure, applications, software integration, organization

Cindy Zheng, Pragma Grid, 5/30/2006 Thank You Pointers PRAGMA: PRAGMA Testbed: PRAGMA: Example of Grass-Roots Grid Promoting Collaborative e-science Teams. CTWatch. Vol 2, No. 1 Feb 2006 The PRAGMA testbed – Building a Multi- application International Grid, CCGrid2006 Deploying Scientific Applications to the PRAGMA Grid Testbed: Strategies and Lessons, CCGrid2006 MOGAS: Analysis of Job in a Multi- Organizational Grid Test-bed, CCGrid2006