Cactus-G: Experiments with a Grid-Enabled Computational Framework Dave Angulo, Ian Foster Chuang Liu, Matei Ripeanu, Michael Russell Distributed Systems.

Slides:



Advertisements
Similar presentations
The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
Advertisements

1 A GPU Accelerated Storage System NetSysLab The University of British Columbia Abdullah Gharaibeh with: Samer Al-Kiswany Sathish Gopalakrishnan Matei.
Gabrielle Allen*, Thomas Dramlitsch*, Ian Foster †, Nicolas Karonis ‡, Matei Ripeanu #, Ed Seidel*, Brian Toonen † * Max-Planck-Institut für Gravitationsphysik.
Resource Management of Grid Computing
Cactus in GrADS Dave Angulo, Ian Foster Matei Ripeanu, Michael Russell Distributed Systems Laboratory The University of Chicago With: Gabrielle Allen,
Cactus in GrADS (HFA) Ian Foster Dave Angulo, Matei Ripeanu, Michael Russell.
Supporting Efficient Execution in Heterogeneous Distributed Computing Environments with Cactus and Globus Gabrielle Allen, Thomas Dramlitsch, Ian Foster,
Tools for Engineering Analysis of High Performance Parallel Programs David Culler, Frederick Wong, Alan Mainwaring Computer Science Division U.C.Berkeley.
The Cactus Portal A Case Study in Grid Portal Development Michael Paul Russell Dept of Computer Science The University of Chicago
Cactus Code and Grid Programming Here at GGF1: Gabrielle Allen, Gerd Lanfermann, Thomas Radke, Ed Seidel Max Planck Institute for Gravitational Physics,
Grid Application Development Software Project Outline l Resource Selection: Current Directions l Contracts: Current Directions l Current Status –Resource.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
GridLab & Cactus Joni Kivi Maarit Lintunen. GridLab  A project funded by the European Commission  The project was started in January 2002  Software.
Centralized vs. Decentralized Design for Internet Applications Adriana Iamnitchi Department of Computer Science The University of Chicago I.
GridSphere for GridLab A Grid Application Server Development Framework By Michael Paul Russell Dept Computer Science University.
The Globus Toolkit: Description and Applications Review Steve Tuecke & Ian Foster Argonne National Laboratory The University of Chicago Globus Co-PI: Carl.
Loads Balanced with CQoS Nicole Lemaster, Damian Rouson, Jaideep Ray Sandia National Laboratories Sponsor: DOE CCA Meeting – January 22, 2009.
Alok 1Northwestern University Access Patterns, Metadata, and Performance Alok Choudhary and Wei-Keng Liao Department of ECE,
Self Adaptivity in Grid Computing Reporter : Po - Jen Lo Sathish S. Vadhiyar and Jack J. Dongarra.
 What is an operating system? What is an operating system?  Where does the OS fit in? Where does the OS fit in?  Services provided by an OS Services.
DISTRIBUTED COMPUTING
Ocean Observatories Initiative Common Execution Infrastructure (CEI) Overview Michael Meisinger September 29, 2009.
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
Cactus Project & Collaborative Working Gabrielle Allen Max Planck Institute for Gravitational Physics, (Albert Einstein Institute)
An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.
An Integrated Instrumentation Architecture for NGI Applications Ian Foster, Darcy Quesnel, Steven Tuecke Argonne National Laboratory The University of.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Tekin Bicer Gagan Agrawal 1.
Programming Models & Runtime Systems Breakout Report MICS PI Meeting, June 27, 2002.
1 Cactus in a nutshell... n Cactus facilitates parallel code design, it enables platform independent computations and encourages collaborative code development.
Applications for the Grid Here at GGF1: Gabrielle Allen, Thomas, Dramlitsch, Gerd Lanfermann, Thomas Radke, Ed Seidel Max Planck Institute for Gravitational.
Challenges towards Elastic Power Management in Internet Data Center.
Development Timelines Ken Kennedy Andrew Chien Keith Cooper Ian Foster John Mellor-Curmmey Dan Reed.
Nomadic Grid Applications: The Cactus WORM G.Lanfermann Max Planck Institute for Gravitational Physics Albert-Einstein-Institute, Golm Dave Angulo University.
The Globus Project: A Status Report Ian Foster Carl Kesselman
© 2012 xtUML.org Bill Chown – Mentor Graphics Model Driven Engineering.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
Perspectives on Grid Technology Ian Foster Argonne National Laboratory The University of Chicago.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
NIH Resource for Biomolecular Modeling and Bioinformatics Beckman Institute, UIUC NAMD Development Goals L.V. (Sanjay) Kale Professor.
NIH Resource for Biomolecular Modeling and Bioinformatics Beckman Institute, UIUC NAMD Development Goals L.V. (Sanjay) Kale Professor.
Introduction to Grid Computing Ed Seidel Max Planck Institute for Gravitational Physics
Mark Rast Laboratory for Atmospheric and Space Physics Department of Astrophysical and Planetary Sciences University of Colorado, Boulder Kiepenheuer-Institut.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
Authors: Ronnie Julio Cole David
State Key Laboratory of Resources and Environmental Information System China Integration of Grid Service and Web Processing Service Gao Ang State Key Laboratory.
ANKITHA CHOWDARY GARAPATI
Evolution of the GrADS Software Architecture and Lessons Learned Fran Berman UCSD CSE and SDSC/NPACI.
Cactus/TIKSL/KDI/Portal Synch Day. Agenda n Main Goals:  Overview of Cactus, TIKSL, KDI, and Portal efforts  present plans for each project  make sure.
GridLab WP-2 Cactus GAT (CGAT) Ed Seidel, AEI & LSU Co-chair, GGF Apps RG, Gridstart Apps TWG Gabrielle Allen, Robert Engel, Tom Goodale, *Thomas Radke.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
Connections to Other Packages The Cactus Team Albert Einstein Institute
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Globus Grid Tutorial Part 2: Running Programs Across Multiple Resources.
2/22/2001Greenbook 2001/OASCR1 Greenbook/OASCR Activities Focus on technology to enable SCIENCE to be conducted, i.e. Software tools Software libraries.
GraDS MacroGrid Carl Kesselman USC/Information Sciences Institute.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Albert-Einstein-Institut Exploring Distributed Computing Techniques with Ccactus and Globus Solving Einstein’s Equations, Black.
Dynamic Grid Computing: The Cactus Worm The Egrid Collaboration Represented by: Ed Seidel Albert Einstein Institute
Motivation: dynamic apps Rocket center applications: –exhibit irregular structure, dynamic behavior, and need adaptive control strategies. Geometries are.
Use of Performance Prediction Techniques for Grid Management Junwei Cao University of Warwick April 2002.
Collaborative Tools for the Grid V.N Alexandrov S. Mehmood Hasan.
Parallel Computing Globus Toolkit – Grid Ayaka Ohira.
Cactus Project & Collaborative Working
Kai Li, Allen D. Malony, Sameer Shende, Robert Bell
Self Healing and Dynamic Construction Framework:
The Cactus Team Albert Einstein Institute
Exploring Distributed Computing Techniques with Ccactus and Globus
University of Technology
Dynamic Grid Computing: The Cactus Worm
Adaptive Grid Computing
Presentation transcript:

Cactus-G: Experiments with a Grid-Enabled Computational Framework Dave Angulo, Ian Foster Chuang Liu, Matei Ripeanu, Michael Russell Distributed Systems Laboratory University of Chicago & Argonne National Laboratory Gabrielle Allen, Thomas Dramlitsch, Ed Seidel, Thomas Radke Max-Planck-Institut für Gravitationsphysik UCSD, UIUC, U.Tenn GrADS Groups

Grid Application Development Software Project Overview l Research goals: Why Cactus-G l Context: Numerical relativity, Cactus, dynamic Grid computing l The Cactus-G Grid-enabled framework l Cactus and GrADS –The Cactus Worm model problem –Dynamic resource selection & code migration –Experimental results l Future directions & lessons learned

Grid Application Development Software Project Research Goals l Investigate methods and structures for efficient Grid execution via in-depth study of a demanding application, including –Constructs for adapting to heterogeneity –Constructs for dynamic resource acquisition l Create testbed for GrADSoft components, as they emerge l Investigate utility of computational frameworks as facilitator of Grid computing

Grid Application Development Software Project Context (1): Numerical Relativity l Numerical simulation of extreme astrophysical events: colliding black holes, neutron stars, etc. –Understand physics –Predict gravitational wave forms l Relativistic effects => Einstein eqns –Computationally intensive (can be 1000s flops/grid point) –3-D simulations only recently possible: demanding users LIGO gravitational wave observatory Colliding black holes

Grid Application Development Software Project Context (2): Cactus ( Allen, Dramlitsch, Seidel, Shalf, Radke) l Modular, portable framework for parallel, multidimensional simulations l Construct codes by linking –Small core (flesh): mgmt services –Selected modules (thorns): Numerical methods, grids & domain decomps, visualization and steering, etc. l Custom linking/configuration tools l Developed for astrophysics, but not astrophysics-specific Cactus “flesh” Thorns

Grid Application Development Software Project Context (3): Dynamic Grid Computing l Application behaviors in a Grid environment: –Identify fastest/cheapest/biggest resources –Configure for efficient execution –Detect need for new resources or behaviors (e.g., due to resource slowdown, new subtasks, new appln regime, user steering, new resource available) –Adapt, and/or discover new resources; invoke subtasks on new resources and/or migrate l We have users who want these behaviors; we also have the enabling machinery

Grid Application Development Software Project Cactus-G: An Application Framework for Dynamic Grid Computing l Cactus thorns for active management of application behavior and resource use l Heterogeneous resources, e.g.: –Irregular decompositions –Variable halo for managing message size –Msg compression (comp/comm tradeoff) –Comms scheduling for comp/comm overlap l Dynamic resource behaviors/demands, e.g.: –Perf monitoring, contract violation detection –Dynamic resource discovery & migration –User notification and steering

Grid Application Development Software Project Cactus-G Example: Terascale Computing Gig-E 100MB/sec SDSC IBM SP 1024 procs 5x12x17 =1020 NCSA Origin Array x12x(4+2+2) =480 OC-12 line But only 2.5MB/sec) l Solved EEs for gravitational waves (real code) –Tightly coupled, communications required through derivatives –Must communicate 30MB/step between machines –Time step take 1.6 sec l Used 10 ghost zones along direction of machines: communicate every 10 steps l Compression/decomp. on all data passed in this direction l Achieved 70-80% scaling, ~200GF (only 14% scaling without tricks)

Grid Application Development Software Project Cactus-G Model Problem: The Cactus Worm l Migrate to “faster/ cheaper” system –When better system discovered –When requirements change –When characteristics change (e.g., competition) l Tests most elements of Cactus-G & GrADS

Grid Application Development Software Project Cactus Worm Architecture Cactus “flesh” “ Tequila” Thorn Appln & other thorns Globus Toolkit substrate: resource discovery, allocation, management GrADS Mechanisms Resource selector Application manager

Grid Application Development Software Project Tequila Thorn Functions l Initiate adaptation on any one of –User request (e.g., HTTP thorn) –Notification of new resources –Application monitoring: contract violation l Request resources (ClassAd protocol) –E.g., GrADS ResourceSelector l Checkpoint application l Contact App Manager to request restart –Security, robustness advantages vs. direct restart

Grid Application Development Software Project Cactus Worm Detailed Architecture & Operation Cactus “flesh” “ Tequila” Thorn Compute resource Compute resource … Code repository … Code repository Storage resource Storage resource … Grid Information Service GrADS Resource Selector Application Manager Appln & other thorns (1) Adapt. request (2) Resource request (3) Write checkpoint (4) Migration request (5) Cactus startup (7) Read checkpoint (0) Possible user input (6) Load code (1’) Resource notification Store models, etc. Query

Grid Application Development Software Project Contract Monitor l Driven by three user-controllable parameters –Time quantum for “time per iteration” –% degradation in time per iteration (relative to prior average) before noting violation –Number of violations before migration l Potential causes of violation –Competing load on CPU –Computation requires more processing power: e.g., mesh refinement, new subcomputation –Hardware problems

Grid Application Development Software Project Current Status l We have developed –Tequila thorn: monitoring, selection, control –ResourceSelector (via ClassAd protocol) –Cactus performance model l We have demonstrated on GrADS Macro Grid –Contract monitoring for multiprocessor runs –Dynamic resource selection –Migration

Grid Application Development Software Project Migration in Action Load applied 3 successive contract violations Running At UIUC (migration time not to scale) Resource discovery & migration Running At UC

Grid Application Development Software Project Ongoing and Future Work: New and Improved Capabilities l Optimize migration process l Use performance models during selection –And include cost of migration & information about future computation in the model l Matchmaker-based ResourceSelector –Separation of concerns between resource characterization and selection –Study resource characterization process, use NWS-based prediction techniques l Dynamic notification of availability of “better” resources

Grid Application Development Software Project Ongoing and Future Work: Further Integration with GrADSoft l Contract monitoring –Pablo –Issues: determining which thorns are monitored, or if flesh is monitored l Program Preparation System l Configurable Object Program and Application Launcher –Cactus has its own launcher and compiles its own code

Grid Application Development Software Project Lessons Learned and Outcomes l Lessons learned –A real & demanding application can exploit adaptive techniques to execute efficiently in Grid environments –Even a relatively regular application can incorporate a range of useful mechanisms for adaptive behaviors & resource demands l Outcomes –Prototype Cactus-G framework: wonderful experimental platform