Cactus in GrADS (HFA) Ian Foster Dave Angulo, Matei Ripeanu, Michael Russell.

Slides:



Advertisements
Similar presentations
Linkage Editors Difference between a linkage editor and a linking loader: Linking loader performs all linking and relocation operations, including automatic.
Advertisements

GridRPC Sources / Credits: IRISA/IFSIC IRISA/INRIA Thierry Priol et. al papers.
GridFTP: File Transfer Protocol in Grid Computing Networks
Cactus in GrADS Dave Angulo, Ian Foster Matei Ripeanu, Michael Russell Distributed Systems Laboratory The University of Chicago With: Gabrielle Allen,
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
Supporting Efficient Execution in Heterogeneous Distributed Computing Environments with Cactus and Globus Gabrielle Allen, Thomas Dramlitsch, Ian Foster,
The Cactus Portal A Case Study in Grid Portal Development Michael Paul Russell Dept of Computer Science The University of Chicago
CHEP04 - Interlaken - Sep. 27th - Oct. 1st 2004T. M. Steinbeck for the Alice Collaboration1/27 A Control Software for the ALICE High Level Trigger Timm.
Cactus Code and Grid Programming Here at GGF1: Gabrielle Allen, Gerd Lanfermann, Thomas Radke, Ed Seidel Max Planck Institute for Gravitational Physics,
Grid Application Development Software Project Outline l Resource Selection: Current Directions l Contracts: Current Directions l Current Status –Resource.
Application architectures
GridScape Ding Choon Hoong Grid Computing and Distributed Systems (GRIDS) Lab. The University of Melbourne Melbourne, Australia WW Grid.
OS Spring’03 Introduction Operating Systems Spring 2003.
Scuola Superiore Sant’Anna Project Assignments Operating Systems.
Figure 1.1 Interaction between applications and the operating system.
GridLab & Cactus Joni Kivi Maarit Lintunen. GridLab  A project funded by the European Commission  The project was started in January 2002  Software.
Cross Cluster Migration Remote access support Adianto Wibisono supervised by : Dr. Dick van Albada Kamil Iskra, M. Sc.
Cactus-G: Experiments with a Grid-Enabled Computational Framework Dave Angulo, Ian Foster Chuang Liu, Matei Ripeanu, Michael Russell Distributed Systems.
Slide 3-1 Copyright © 2004 Pearson Education, Inc. Operating Systems: A Modern Perspective, Chapter 3 Operating System Organization.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
Charm++ Load Balancing Framework Gengbin Zheng Parallel Programming Laboratory Department of Computer Science University of Illinois at.
Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.
Self Adaptivity in Grid Computing Reporter : Po - Jen Lo Sathish S. Vadhiyar and Jack J. Dongarra.
Understanding the CORBA Model. What is CORBA?  The Common Object Request Broker Architecture (CORBA) allows distributed applications to interoperate.
1 Lecture 4: Threads Operating System Fall Contents Overview: Processes & Threads Benefits of Threads Thread State and Operations User Thread.
ICOM 5995: Performance Instrumentation and Visualization for High Performance Computer Systems Lecture 7 October 16, 2002 Nayda G. Santiago.
Parallel Programming Models Jihad El-Sana These slides are based on the book: Introduction to Parallel Computing, Blaise Barney, Lawrence Livermore National.
Capture and Replay Often used for regression test development –Tool used to capture interactions with the system under test. –Inputs must be captured;
Eric Keller, Evan Green Princeton University PRESTO /22/08 Virtualizing the Data Plane Through Source Code Merging.
Grads Meeting - San Diego Feb 2000 The Cactus Code Gabrielle Allen Albert Einstein Institute Max Planck Institute for Gravitational Physics
CS 390- Unix Programming Environment CS 390 Unix Programming Environment Topics to be covered: Distributed Computing Fundamentals.
Cactus Project & Collaborative Working Gabrielle Allen Max Planck Institute for Gravitational Physics, (Albert Einstein Institute)
NeSC Apps Workshop July 20 th, 2002 Customizable command line tools for Grids Ian Kelley + Gabrielle Allen Max Planck Institute for Gravitational Physics.
ESMF Code Generation Rocky Dunlap Spencer Rugaber Leo Mark Georgia Tech College of Computing.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Tekin Bicer Gagan Agrawal 1.
Chapter 34 Java Technology for Active Web Documents methods used to provide continuous Web updates to browser – Server push – Active documents.
1 Cactus in a nutshell... n Cactus facilitates parallel code design, it enables platform independent computations and encourages collaborative code development.
Applications for the Grid Here at GGF1: Gabrielle Allen, Thomas, Dramlitsch, Gerd Lanfermann, Thomas Radke, Ed Seidel Max Planck Institute for Gravitational.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Graduate Student Department Of CSE 1.
Nomadic Grid Applications: The Cactus WORM G.Lanfermann Max Planck Institute for Gravitational Physics Albert-Einstein-Institute, Golm Dave Angulo University.
CE Operating Systems Lecture 3 Overview of OS functions and structure.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
Copyright © cs-tutorial.com. Overview Introduction Architecture Implementation Evaluation.
Framework for MDO Studies Amitay Isaacs Center for Aerospace System Design and Engineering IIT Bombay.
Processes Introduction to Operating Systems: Module 3.
Cactus/TIKSL/KDI/Portal Synch Day. Agenda n Main Goals:  Overview of Cactus, TIKSL, KDI, and Portal efforts  present plans for each project  make sure.
GridLab WP-2 Cactus GAT (CGAT) Ed Seidel, AEI & LSU Co-chair, GGF Apps RG, Gridstart Apps TWG Gabrielle Allen, Robert Engel, Tom Goodale, *Thomas Radke.
ATmospheric, Meteorological, and Environmental Technologies RAMS Parallel Processing Techniques.
A. Frank - P. Weisberg Operating Systems Structure of Operating Systems.
Connections to Other Packages The Cactus Team Albert Einstein Institute
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Source Level Debugging of Parallel Programs Roland Wismüller LRR-TUM, TU München Germany.
2.1 Silberschatz, Galvin and Gagne ©2009 Operating System Concepts – 8 th Edition System Programs (p73) System programs provide a convenient environment.
Dynamic Grid Computing: The Cactus Worm The Egrid Collaboration Represented by: Ed Seidel Albert Einstein Institute
Motivation: dynamic apps Rocket center applications: –exhibit irregular structure, dynamic behavior, and need adaptive control strategies. Geometries are.
Cactus Workshop - NCSA Sep 27 - Oct Generic Cactus Workshop: Summary and Future Ed Seidel Albert Einstein Institute
Parallel Computing Presented by Justin Reschke
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
Integrating and Extending Workflow 8 AA301 Carl Sykes Ed Heaney.
INTRODUCTION TO HIGH PERFORMANCE COMPUTING AND TERMINOLOGY.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Computer System Structures
Simulation Production System
The Cactus Team Albert Einstein Institute
University of Technology
Machine Independent Features
Adaptive Grid Computing
Outline Chapter 2 (cont) OS Design OS structure
System calls….. C-program->POSIX call
Instructor: Xiuwen Liu Department of Computer Science
Presentation transcript:

Cactus in GrADS (HFA) Ian Foster Dave Angulo, Matei Ripeanu, Michael Russell

Presentation Outline  Introduction to Cactus  Cactus Applications  Cactus Architecture  Cactus Worm Thorn  Tequila Thorn (Cactus in GrADs)  Tequila Architectur  Issues

What is Cactus? Cactus is a freely available, modular, portable and manageable environment for collaboratively developing parallel, high-performance multidimensional simulations

Example Cactus Output Example output from Numerical Relativity Simulations

Cactus Applications  Application Thorns are Astrophysics applications  Calculate Schwartzchild Event Horizons for colliding black holes

Cactus Applications (cont.)  Candidate apps are Elliptical Solver or BenchADM  Abstract Topologies are simple 3D Grid

Cactus Applications (cont.)  Applications can easily be “linked” in with the other thorns used as tools.  Application Thorns are just selected and run with the other selected thorns

Cactus Architecture Configure CST Flesh Computational Toolkit Operating Systems AIX NT Linux Unicos Solaris HP-UX Thorns Cactus SuperUX Irix OSF Make

Cactus Model (cont.) Building an executable Cactus Source Flesh IOBasic IOASCII WaveToy LDAP Worm … Thorns Configuration Compiler options Tool options MPI options HDF5 options

Running Cactus Parameter File Specify which thorns to activate Specify global parameters Specify restricted parameters Specify private parameters

Cactus model  This is the currently working Cactus application framework that we will modify Cactus “flesh” internals Cactus Application Thorn(s) Other Cactus Thorn(s) Cactus “Worm” Thorn

Worm Thorn Functions  Initiates moving to new resource when scheduled time is exhausted  Contacts IS to get a new node to run on  Checkpoints application  Restarts application on the new node  Runs on single node only

GrADS Cactus Model  We will start with “Worm” thorn code to make new “Tequila” thorn (Apotheosized Cactus Worm). Cactus “flesh” internals Cactus Application Thorn(s) Other Cactus Thorn(s) Cactus “Tequila” Thorn

Tequila thorn functions  Receives event (generated by user) to initiate adapting resources.  Contacts ResourceSelector to get new bag of resources  Checkpoints application  Restarts application on the new resources

Events  Events that cause the user to want to adapt resources: –User changes parameters during runtime that requires additional resources Example: starting an analysis routine Example: running an event horizon finder –User specifies that performance is not meeting expectations

Future Events  Possible Future Plans for automatic resource adapting: –User changes parameters during runtime that requires additional resources –Contract violations fire similar events we were wrong first time resources get overloaded more (or fewer) (or different) processors appear distribution changed resolution changed

Tequila thorn contacts ResourceSelector  ResourceSelector must be set up as service  Tequila thorn sends request for new bag of resources  ResourceSelector responds with the new bag

Request and Response  The Request to the ResourceSelector will be stored in the InformationService  Only the pointer to the data in the IS will be passed to the ResourceSelector  The Response from the ResourceSelector will also be stored in the IS  Only the pointer to the data in the IS will be passed back.

Tequila communication overview Cactus Tequila Thorn Resource Selector Information Service

Cactus Architecture in GrADS Configure CST Flesh Computational Toolkit Operating Systems AIX NT Linux Unicos Solaris HP-UX Thorns Cactus SuperUX Irix OSF Make Toolkit Grads Communi- cation library

Open Issues  How does Contract Monitor fit into architecture?  How does PPS fit into architecture?  How does COP and Aplication Launcher fit into architecture (Cactus has its own launcher and compiles its own code)?  How does Pablo fit into architecture (Which thorns are monitored, is flesh monitored)?

End of Presentation

Slides Explaining Communication Details  ********************

Communication Details step 1  Event sent to Tequila thorn requesting restart Cactus Tqeuila Thorn Resource Selector Information Service

Communication Details step 2  Tequila store AART in IS Cactus Tqeuila Thorn Resource Selector Information Service

Communication Details step 3  Tequila sends request to ResourceSelector passing pointer to data in IS Cactus Tqeuila Thorn Resource Selector Information Service

Communication Details step 4  ResourceSelector retrieves AART from IS Cactus Tqeuila Thorn Resource Selector Information Service

Communication Details step 5  ResourceSelector stores bag of resources (in AART) in IS Cactus Tqeuila Thorn Resource Selector Information Service

Communication Details step 6  ResourceSelector responds to Tequila passing pointer to data in IS Cactus Tqeuila Thorn Resource Selector Information Service

Communication Details step 7  Tequila retrieves AART with new bag of resources from IS Cactus Tqeuila Thorn Resource Selector Information Service

Requirements  Using the IS for communication adds overhead.  Why do this?  GrADS requirement 1: do some things (e.g. compile) at one time and have the results stored in a persistent storage area. Pick these stored results up later and complete other phases.

Requirements (cont.)  GrADS requirement 2: Application people want to be able to allow users to manually interact in any of the "module interfaces." Tequila allows this to be done with a web client.

Slide Explaining Parallelism in Cactus  ***************

Parallelism in Cactus  Cactus is designed around a distributed memory model. Each thorn is passed a section of the global grid.  The actual parallel driver (implemented in a thorn) can use whatever method it likes to decompose the grid across processors and exchange ghost zone information - each thorn is presented with a standard interface, independent of the driver.  Standard driver distributed with Cactus (PUGH) is for a parallel unigrid and uses MPI for the communication layer  PUGH can do custom processor decomposition and static load balancing

Slide with Alternate Tequila Architecture  ***************

Sample Tequila Scenario  User asks to run an ADM simulation 400x400x400 for 1000 timesteps in 10s.  Resource selector contacted to obtain virtual machines  Best virtual machine selected based on performance model.  AM starts Cactus on that virtual machine (and monitors execution Contracts?)  User (or application manager) decides that computation advances too slow and decides to search for a better virtual machine  AM finds a better machine, commands the Cactus run to Checkpoint, transfers files and restart Cactus

Slides Explaining Different Tequila Architectures  *********************

Tequila Architecture Choices  Main presentation explained the short term Tequila Architecture  Open issues covered not-yet-resolved architectural choices for longer term integration

Worm Spawning Cactus Flesh Worm Thorn GIIS 1. Resources Obtained Cactus Flesh Worm Thorn Application Manager 2. Application Manager instructed to spawn new instance 3. New instance spawned

Tequila Spawning  The short-term plan is to simply replace the GIIS with the UCSD Resource Selector  Tequila would make the request for new resources to the RS instead of the GIIS

Tequila Spawning 1. Resources Obtained Cactus Flesh Tequila Thorn UCSD Resource Selector Cactus Flesh Tequila Thorn Application Manager 2. Application Manager instructed to spawn new instance 3. New instance spawned

Tequila Spawning  Longer term plan is not yet resolved.  One possibility is to put all grads pieces into Application Manager

Application Manager Cactus Flesh Tequila Thorn UCSD Resource Selector Cactus Flesh Tequila Thorn Application Manager 1. Application Manager instructed to spawn new instance 3. New instance spawned 2. Resources Obtained