MSA’2000 Metacomputing Systems and Applications. MSA Introduction2 Organizing Committee F. Desprez, INRIA Rhône-Alpes E. Fleury, INRIA Lorraine J.-F.

Slides:



Advertisements
Similar presentations
1 From Grids to Service-Oriented Knowledge Utilities research challenges Thierry Priol.
Advertisements

Towards a Virtual European Supercomputing Infrastructure Vision & issues Sanzio Bassini
Beowulf Supercomputer System Lee, Jung won CS843.
GridRPC Sources / Credits: IRISA/IFSIC IRISA/INRIA Thierry Priol et. al papers.
Dinker Batra CLUSTERING Categories of Clusters. Dinker Batra Introduction A computer cluster is a group of linked computers, working together closely.
Problem-Solving Environments: The Next Level in Software Integration David W. Walker Cardiff University.
Inter-Operating Grids through Delegated MatchMaking Alexandru Iosup, Dick Epema, Hashim Mohamed,Mathieu Jan, Ozan Sonmez 3 rd Grid Initiative Summer School,
Computer Science Department 1 Load Balancing and Grid Computing David Finkel Computer Science Department Worcester Polytechnic Institute.
Parallel Programming on the SGI Origin2000 With thanks to Moshe Goldberg, TCC and Igor Zacharov SGI Taub Computer Center Technion Mar 2005 Anne Weill-Zrahia.
CS 501: Software Engineering Fall 2000 Lecture 16 System Architecture III Distributed Objects.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Legion Worldwide virtual computer. About Legion Made in University of Virginia Object-based metasystems software project middleware that connects computer.
1 GRID D. Royo, O. Ardaiz, L. Díaz de Cerio, R. Meseguer, A. Gallardo, K. Sanjeevan Computer Architecture Department Universitat Politècnica de Catalunya.
Real Parallel Computers. Background Information Recent trends in the marketplace of high performance computing Strohmaier, Dongarra, Meuer, Simon Parallel.
Cluster Computers. Introduction Cluster computing –Standard PCs or workstations connected by a fast network –Good price/performance ratio –Exploit existing.
1 Computing platform Andrew A. Chien Mohsen Saneei University of Tehran.
N Tropy: A Framework for Analyzing Massive Astrophysical Datasets Harnessing the Power of Parallel Grid Resources for Astrophysical Data Analysis Jeffrey.
STRATEGIES INVOLVED IN REMOTE COMPUTATION
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.
German National Research Center for Information Technology Research Institute for Computer Architecture and Software Technology German National Research.
Introduction and Overview Questions answered in this lecture: What is an operating system? How have operating systems evolved? Why study operating systems?
1 Jack Dongarra University of Tennesseehttp://
ICOM 5995: Performance Instrumentation and Visualization for High Performance Computer Systems Lecture 7 October 16, 2002 Nayda G. Santiago.
DISTRIBUTED COMPUTING
Cluster Computers. Introduction Cluster computing –Standard PCs or workstations connected by a fast network –Good price/performance ratio –Exploit existing.
Through the development of advanced middleware, Grid computing has evolved to a mature technology in which scientists and researchers can leverage to gain.
Loosely Coupled Parallelism: Clusters. Context We have studied older archictures for loosely coupled parallelism, such as mesh’s, hypercubes etc, which.
Development Timelines Ken Kennedy Andrew Chien Keith Cooper Ian Foster John Mellor-Curmmey Dan Reed.
1 Introduction to Middleware. 2 Outline What is middleware? Purpose and origin Why use it? What Middleware does? Technical details Middleware services.
Example: Sorting on Distributed Computing Environment Apr 20,
The Globus Project: A Status Report Ian Foster Carl Kesselman
Heavy and lightweight dynamic network services: challenges and experiments for designing intelligent solutions in evolvable next generation networks Laurent.
“DECISION” PROJECT “DECISION” PROJECT INTEGRATION PLATFORM CORBA PROTOTYPE CAST J. BLACHON & NGUYEN G.T. INRIA Rhône-Alpes June 10th, 1999.
1 Logistical Computing and Internetworking: Middleware for the Use of Storage in Communication Micah Beck Jack Dongarra Terry Moore James Plank University.
Middleware for FIs Apeego House 4B, Tardeo Rd. Mumbai Tel: Fax:
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Multiprossesors Systems.. What are Distributed Databases ? “ A Logically interrelated collection of shared data ( and a description of this data) physically.
The Grid computing Presented by:- Mohamad Shalaby.
Parallelization of Classification Algorithms For Medical Imaging on a Cluster Computing System 指導教授 : 梁廷宇 老師 系所 : 碩光通一甲 姓名 : 吳秉謙 學號 :
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
The Grid the united computing power Jian He Amit Karnik.
Distributed DBMSs- Concept and Design Jing Luo CS 157B Dr. Lee Fall, 2003.
Distribution and components. 2 What is the problem? Enterprise computing is Large scale & complex: It supports large scale and complex organisations Spanning.
Hwajung Lee.  Interprocess Communication (IPC) is at the heart of distributed computing.  Processes and Threads  Process is the execution of a program.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
ProActive components and legacy code Matthieu MOREL.
COMP381 by M. Hamdi 1 Clusters: Networks of WS/PC.
Scheduling MPI Workflow Applications on Computing Grids Juemin Zhang, Waleed Meleis, and David Kaeli Electrical and Computer Engineering Department, Northeastern.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Wide-Area Parallel Computing in Java Henri Bal Vrije Universiteit Amsterdam Faculty of Sciences vrije Universiteit.
Cluster Computers. Introduction Cluster computing –Standard PCs or workstations connected by a fast network –Good price/performance ratio –Exploit existing.
Parallel Computing on Wide-Area Clusters: the Albatross Project Aske Plaat Thilo Kielmann Jason Maassen Rob van Nieuwpoort Ronald Veldema Vrije Universiteit.
Background Computer System Architectures Computer System Software.
The Globus Toolkit The Globus project was started by Ian Foster and Carl Kesselman from Argonne National Labs and USC respectively. The Globus toolkit.
SYSTEM MODELS FOR ADVANCED COMPUTING Jhashuva. U 1 Asst. Prof CSE
Chapter 16 Client/Server Computing Dave Bremer Otago Polytechnic, N.Z. ©2008, Prentice Hall Operating Systems: Internals and Design Principles, 6/E William.
Page : 1 SC2004 Pittsburgh, November 12, 2004 DEISA : integrating HPC infrastructures in Europe DEISA : integrating HPC infrastructures in Europe Victor.
INTRODUCTION TO HIGH PERFORMANCE COMPUTING AND TERMINOLOGY.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Clouds , Grids and Clusters
OpenMosix, Open SSI, and LinuxPMI
GWE Core Grid Wizard Enterprise (
Globus —— Toolkits for Grid Computing
Grid Computing.
University of Technology
The Globus Toolkit™: Information Services
Cluster Computers.
Presentation transcript:

MSA’2000 Metacomputing Systems and Applications

MSA Introduction2 Organizing Committee F. Desprez, INRIA Rhône-Alpes E. Fleury, INRIA Lorraine J.-F. Méhaut, INRIA Rhône-Alpes Y. Robert, ENS Lyon

MSA Introduction3 Program Committee H. Bal, Vrije University, Amsterdam F. Berman, UCSD San Diego J. Dongarra, UT Knoxville & ONRL G. von Laszewski, Argonne T. Ludwig, TUM München T. Priol, INRIA Rennes M. Resch, Stuttgart OC +

MSA Introduction4 ISBN chapters by expert authors including Andrew Chien, Jack Dongarra, Tom DeFanti, Andrew Grimshaw, Roch Guerin, Ken Kennedy, Paul Messina, Cliff Neuman, Jon Postel, Larry Smarr, Rick Stevens, and many others The Grid: Blueprint for a New Computing Infrastructure I. Foster, C. Kesselman (Eds), Morgan Kaufmann, 1999

MSA Introduction5 Bibliography Web –NPCACI (National Partnership for Advanced Computational Infrastructure) –GrADS (Grid Application Development Software Project) hipersoft.cs.rice.edu/grads –“An Overview of Computational Grids and Survey of a Few Research Projects”, Jack Dongarra LIP Report –“Algorithms and Tools for (Distributed) Heterogeneous Computing: A Prospective Report”

Framework

MSA Introduction7 Metacomputing Future of parallel computing distributed and heterogeneous Metacomputing = Making use of distributed collections of heterogeneous platforms Target = Tightly-coupled high-performance distributed applications (rather than loosely-coupled cooperative applications)

MSA Introduction8 Metacomputing Platforms (1) Low end of the field Cluster computing with heterogeneous networks of workstations or PCs –Ubiquitous in university departments and companies –Typical poor man’s parallel computer –Running large PVM or MPI experiments –Make use of all available resources: slower machines in addition to more recent ones

MSA Introduction9 Metacomputing Platforms (2) High end of the field Computational grid linking the most powerful supercomputers of the largest supercomputing centers through dedicated high-speed networks. Middle of the field Connecting medium size parallel servers (equipped with application-specific databases and application-oriented software) through fast but non-dedicated, thus creating a “meta-system”

MSA Introduction10 High end: Gusto

MSA Introduction11 Low end (1) Distributed ASCI Supercomputer (DAS) –Common platform for research –(Wide-area) parallel computing and distributed applications –November 1998, 4 universities, 200 nodes –Node 200 MHz Pentium Pro 128 MB memory, 2.5 GB disk Myrinet 1.28 Gbit/s (full duplex) Operating System: BSD/OS –ATM Network

MSA Introduction12 Low end (2)

MSA Introduction13 Administrative Issues Intensive computations on a set of processors across several countries and institutions –Strict rules to define the (good) usage of shared resources A major difficulty is to avoid a large increase in the administrative overhead –Challenge = find a tradeoff that does not increase the administrative load while preserving the users’ security se rules must be guaranteed by the runtime, together with methods to migrate computations to other sites whenever some local request is raised

MSA Introduction14 Tomorrow’s Virtual Super-Computer Metacomputing applications will execute on a hierarchical grid –Interconnection of clusters scattered all around the world A fundamental characteristic of the virtual super- computer: –A set of strongly heterogeneous and geographically scattered resources

MSA Introduction15 Algorithmic and Software Issues (1) Whereas the architectural vision is clear, the software developments are not so well understood

MSA Introduction16 Algorithmic and Software Issues (2) Low end of the field: –Cope with heterogeneity –Major algorithmic effort to be undertaken High end of the field –Logically assemble the distributed computers: extensions to PVM and MPI to handle distributed collection of clusters –Configuration and performance optimization Inherent complexity of networked and heterogeneous systems Resources often identified at runtime Dynamic nature of resource characteristics

MSA Introduction17 Algorithmic and Software Issues (3) High-performance computing applications must: –Configure themselves to fit the execution environment –Adapt their behavior to subsequent changes in resource characteristics Parallel environments focused on strongly homogeneous architectures (processor, memory, network) –Array and loop distribution, parallelizing compilers, HPF constructs, gang scheduling, MPI However… Metacomputing platforms are strongly heterogeneous!

Programming environments

MSA Introduction19 Programing models (1) Extensions of MPI: –MPI_Connect, Nexus, PACX-MPI, MPI-Plus, Data-Exchange, VCM, MagPIe, … Globus: a layered approach –Fundamental layer = a set of core services, including resource management, security, and communications that enable the linking and interoperation of distributed computer systems

MSA Introduction20 Programing models (2) Object-oriented technologies to cope with heterogeneity: –Encapsulate technical ``details'' such as protocols, data representations, migration policies –Legion is building on Mentat, an object-oriented parallel processing system –Albatross relies on a high-performance Java system, with a very efficient implementation of Java Remote Method Invocation.

MSA Introduction21 Programing models (3) Far from achieving the holy goal: –Using the computing resources remotely and transparently, just as we do with electricity, without knowing where it comes from

MSA Introduction22 References Globus Legion Albatross AppLeS www-cse.ucsd.edu/groups/hpcl/apples/apples.html NetSolve

Algorithmic issues

MSA Introduction24 Data Decomposition Techniques for Cluster Computing Block-cyclic distribution paradigm = preferred layout for data-parallel programs (HPF, ScaLAPACK) Evenly balances total workload only if all processors have same speed Extending ScaLAPACK to heterogeneous clusters turns out to be surprisingly difficult

MSA Introduction25 Algorithmic challenge Bad news: designing a matrix-matrix product or a dense linear solver proves a hard task on a heterogeneous cluster! Next problems: –Simple linear algebra kernels on a collection of clusters (extending the platform) – More ambitious routines, composed of a variety of elementary kernels, on a heterogeneous cluster (extending the application) –Implementing more ambitious routines on more ambitious platforms (extending both)

MSA Introduction26 Collections of clusters (1) Slower link Fast link

Conclusion

MSA Introduction28 (A) Algorithmic issues Difficulties seem largely underestimated Data decomposition, scheduling heuristics, load balancing become extremely difficult in the context of metacomputing platforms Research community focuses on low-level communication protocols and distributed system issues (light-weight process invocation, migration,...)

MSA Introduction29 (B) Programming level Which is the good level ? –Data-parallelism unrealistic, due to heterogeneity –Explicit message passing too low-level –Object-oriented approaches still request the user to have a deep knowledge of both its application behavior and the underlying resources –Remote computing systems (NetSolve) face severe limitations to efficiently load-balance the work –Relying on specialized but highly-tuned libraries of all kinds may prove a good trade-off

MSA Introduction30 (C) Applications Key applications (from scientific computing to data-bases) have dictated the way classical parallel machines are used, programmed, and even updated into more efficient platforms Key applications will strongly influence, or even guide, the development of metacomputing environments

MSA Introduction31 (C) Applications (cont’d) Which applications will be worth the abundant but hard-to-access resources of the grid ? –tightly-coupled grand challenges ? –mobile computing applications ? –micro-transactions on the Web ? All these applications require new programming paradigms to enable inexperienced users to access the magic grid!

Today’s program

MSA Introduction33 Session 1: Communication and Metacomputing Infrastructures 9h00:10h00, Metacomputing in a High Performance Computing Center (invited talk), M. Resh. 10:30-11:00, Scheduling Algorithms for Efficient Gather Operation in Distributed Heterogeneous Systems, Juin-ichi Hatta & Susumu Shibusawa 11:00-11:30, Applying and Monitoring Latency Based Metacomputing Infrastructures, Philipp Drum & Günther Rackl. 11:30-12:00, MPC: A New Message Passing Library in Corba T. Es-sqally, J. Guyard & E. Fleury.

MSA Introduction34 Session 2: Scientific Applications and Distributed Computing 14:00-15:00, The Netsolve Environment: Processing Towards a Seamless Grid (invited talk), D. Arnold & J. Dongarra 15:30-16:00, Specification of a Scilab Meta-Computing Extension, S. Contassot-Vivier, F. Lombard, J-M. Nicod & L. Philippe 16:00-16:30, Extending WebCom: A Proposed Framework for Web based Distributed Computing, J. P. Morrison, J. J. Kennedy & D. A. Power 16:30-17:30, Panel discussion