Grid Computing Sudhindra Rao.

Slides:



Advertisements
Similar presentations
Convergence Characteristics for Clusters, Grids, and P2P networks
Advertisements

Nimrod/G GRID Resource Broker and Computational Economy
The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
1 Project Overview EconomyGrid Economic Paradigm For “Resource Management and Scheduling” for Service-Oriented Grid Computing Presenter Name: Sama GovindaRamanujam.
Peer-to-Peer Computing Ding Choon Hoong Grid Computing and Distributed Systems (GRIDS) Lab. The University of Melbourne Melbourne, Australia
High Performance Computing Course Notes Grid Computing.
Parallel Programming Models and Paradigms Prof. Rajkumar Buyya Cloud Computing and Distributed Systems (CLOUDS) Lab. The University of Melbourne, Australia.
Seminar Grid Computing ‘05 Hui Li Sep 19, Overview Brief Introduction Presentations Projects Remarks.
Computer Science Department 1 Load Balancing and Grid Computing David Finkel Computer Science Department Worcester Polytechnic Institute.
Parallel Programming Models and Paradigms
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Grids and Grid Technologies for Wide-Area Distributed Computing Mark Baker, Rajkumar Buyya and Domenico Laforenza.
4b.1 Grid Computing Software Components of Globus 4.0 ITCS 4010 Grid Computing, 2005, UNC-Charlotte, B. Wilkinson, slides 4b.
1 GRID D. Royo, O. Ardaiz, L. Díaz de Cerio, R. Meseguer, A. Gallardo, K. Sanjeevan Computer Architecture Department Universitat Politècnica de Catalunya.
Grid Computing Net 535.
Introduction to Grid Computing Ann Chervenak Carl Kesselman And the members of the Globus Team.
Sergey Belov, Tatiana Goloskokova, Vladimir Korenkov, Nikolay Kutovskiy, Danila Oleynik, Artem Petrosyan, Roman Semenov, Alexander Uzhinskiy LIT JINR The.
Subject Code: WW Grid Rajkumar Buyya
Grid Toolkits Globus, Condor, BOINC, Xgrid Young Suk Moon.
Ali YILDIRIM Emre UZUNCAKARA
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Gridbus Toolkit for Belle Analysis Data Grid and Utility Computing Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS) Lab. Dept. of Computer.
Nimrod/G GRID Resource Broker and Computational Economy David Abramson, Rajkumar Buyya, Jon Giddy School of Computer Science and Software Engineering Monash.
DISTRIBUTED COMPUTING
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
1 678 Topics Covered (1) Part A: Foundation Socket Programming Thread Programming Elements of Parallel Computing Part B: Cluster Computing Elements of.
Grid Resource Management: Challenges, Approaches, & Solutions Dr. Rajkumar Buyya Cloud Computing and Distributed Systems (CLOUDS) Lab. The University of.
1 520 Student Presentation GridSim – Grid Modeling and Simulation Toolkit.
CCGrid 2003, Tokyo, Japan GridFlow: Workflow Management for Grid Computing Junwei Cao ( 曹军威 ) C&C Research Labs, NEC Europe Ltd., Germany Stephen A. Jarvis.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
The Globus Project: A Status Report Ian Foster Carl Kesselman
The Anatomy of the Grid Mahdi Hamzeh Fall 2005 Class Presentation for the Parallel Processing Course. All figures and data are copyrights of their respective.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Perspectives on Grid Technology Ian Foster Argonne National Laboratory The University of Chicago.
PARALLEL COMPUTING overview What is Parallel Computing? Traditionally, software has been written for serial computation: To be run on a single computer.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
Institute For Digital Research and Education Implementation of the UCLA Grid Using the Globus Toolkit Grid Center’s 2005 Community Workshop University.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
The Grid the united computing power Jian He Amit Karnik.
GRID ARCHITECTURE Chintan O.Patel. CS 551 Fall 2002 Workshop 1 Software Architectures 2 What is Grid ? "...a flexible, secure, coordinated resource- sharing.
Authors: Ronnie Julio Cole David
Globus Toolkit Massimo Sgaravatto INFN Padova. Massimo Sgaravatto Introduction Grid Services: LHC regional centres need distributed computing Analyze.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
7. Grid Computing Systems and Resource Management
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Military Technical Academy Bucharest, 2006 GRID - Synthesis - ADINA RIPOSAN Department of Applied Informatics.
COMP381 by M. Hamdi 1 Clusters: Networks of WS/PC.
GraDS MacroGrid Carl Kesselman USC/Information Sciences Institute.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
3/12/2013Computer Engg, IIT(BHU)1 PARALLEL COMPUTERS- 1.
3/12/2013Computer Engg, IIT(BHU)1 CONCEPTS-3. Clusters Classification Application Target ● High Performance (HP) Clusters ➢ Grand Challenging Applications.
Distributed Geospatial Information Processing (DGIP) Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
SYSTEM MODELS FOR ADVANCED COMPUTING Jhashuva. U 1 Asst. Prof CSE
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING CLOUD COMPUTING
Clouds , Grids and Clusters
SuperComputing 2003 “The Great Academia / Industry Grid Debate” ?
Globus —— Toolkits for Grid Computing
Grid Computing.
University of Technology
Convergence Characteristics for Clusters, Grids, and P2P networks
The Globus Toolkit™: Information Services
Large Scale Distributed Computing
Quality Assurance for Component-Based Software Development
Defining the Grid Fabrizio Gagliardi EMEA Director Technical Computing
Presentation transcript:

Grid Computing Sudhindra Rao

Outline History of Distributed Computing Grid – Definition, Architecture details P2P versus Grid Webservices Java – anywhere computing paradigm Middleware Grid models and recent research Research directions Tools and grids available References

History Shift from Centralized Computing to Distributed Computing – powerful processors, faster networks Parallel computing based on MPI and PVM models Cluster Computing Peer-to-peer computing Grid computing

Application and Infrastructure technology trends Serial applications Parallel applications Multi-threaded MPI/PVM OpenMP Client Server CORBA COM/DCOM . NET J2EE Custom distributed systems P2P App Integration Reliable messaging Reliable execution Service virtualization Web services Service Registration Service discovery Location independent service invocation Lifting apps off the servers Time Monolithic Open Distributed Virtualized Mainframes Storage : Direct attached Storage Open Systems Unix Windows Linux Clusters DRM Infrastructure Virtualization Grid OGSA Data Grid Service provisioning

Technology Evolution: Cluster, Grid, P2P * Sputnik 1960 1970 1975 1980 1985 1990 1995 2000 * ARPANET * Email * Ethernet * TCP/IP * IETF * Internet Era * WWW Era * Mosaic * XML * PC Clusters Crays MPPs Mainframes * HTML * W3C P2P Grids *XEROX PARC worm COMPUTING NETWORKING Minicomputers PCs WS Clusters PDAs Workstations HTC * Web Services

What is Cluster/Grid ? A type of parallel and distributed system that enables the sharing, selection, & aggregation of resources distributed in administrative domains depending on their availability, capability, performance, cost, and users quality of service requirements. Grid A Cluster A Single Cluster

Approaches for Parallel Programming Implicit Parallelism Supported by parallel languages and parallelizing compilers that take care of identifying parallelism, the scheduling of calculations and the placement of data. Explicit Parallelism In this approach, the programmer is responsible for most of the parallelization effort such as task decomposition, mapping task to processors, the communication structure. This approach is based on the assumption that the user is often the best judge of how parallelism can be exploited for a particular application.

Parallel Programming Models and Tools Shared Memory Model DSM Threads/OpenMP (enabled for clusters) Java threads (HKU JESSICA, IBM cJVM) Message Passing Model PVM MPI Hybrid Model Mixing shared and distributed memory model Using OpenMP and MPI together Object and Service Oriented Models Wide area distributed computing technologies OO: CORBA, DCOM, etc. Services: Web Services-based service composition 14

Levels of Parallelism PVM/MPI Threads Compilers CPU Task i-l Task i Code-Granularity Code Item Large grain (task level) Program Medium grain (control level) Function (thread) Fine grain (data level) Loop (Compiler) Very fine grain (multiple issue) With hardware PVM/MPI Task i-l Task i Task i+1 func1 ( ) { .... } func2 ( ) { .... } func3 ( ) { .... } Threads Compilers a ( 0 ) =.. b ( 0 ) =.. a ( 1 )=.. b ( 1 )=.. a ( 2 )=.. b ( 2 )=.. CPU + x Load

Cluster Architecture Parallel Applications Parallel Applications Sequential Applications Sequential Applications Sequential Applications Parallel Programming Environment Cluster Middleware (Single System Image and Availability Infrastructure) PC/Workstation Network Interface Hardware Communications Software PC/Workstation Network Interface Hardware Communications Software PC/Workstation Network Interface Hardware Communications Software PC/Workstation Network Interface Hardware Communications Software Cluster Interconnection Network/Switch

A Typical P2P Computing Environment Peer Discovery Service Peer Agent Application P3 pM Who can help ? Peer P2, P7 can help! pN Request P2 Sorry, I am busy. Peer Agent Request Peer Agent Response P1 R7 p4 p5

CPM: DC Economy-based P2P Computing (Jxta based Implementation) Market Server Market Repository Discovery - Membership CPM Agent User (Consumer) Bill Trader Job Management Resources (Provider) Accounting

Definition of a Grid Grid is a type of parallel and distributed system that enables the sharing, selection, and aggregation of geographically distributed "autonomous" resources dynamically at runtime depending on their availability, capability, performance, cost, and users' quality-of-service requirements Coordinated resource sharing and problem solving in dynamic, multi-institutional Virtual Organizations (VOs) Most current distributed technologies facilitate this in a local environment J2EE, CORBA, VPN are a few examples Nomadic users and applications provide new avenues for providing such a service Mechanisms required to coordinate trusted and untrusted access to resources

Grid Architecture

A Typical Grid Computing Environment Grid Information Service Grid Resource Broker database Application R2 2 R3 R4 R5 RN Grid Resource Broker R6 R1 Resource Broker Grid Information Service

Virtual Drug Design A Virtual Lab for “Molecular Modeling for Drug Design” on P2P Grid Data Replica Catalogue Grid Market Directory Grid Info. Service “Give me list PDBs sources Of type aldrich_300?” “service cost?” “service providers?” GTS Resource Broker “Screen 2K molecules in 30min. for $10” “mol.5 please?” GTS (RB maps suitable Grid nodes and Protein DataBank) “get mol.10 from pdb1 & screen it.” PDB2 GTS “mol.10 please?” GTS GTS (GTS - Grid Trade Server) PDB1

Scalable Seamless Computing: Breaking Administrative Barriers 2100 ? PERFORMANCE 2100 Administrative Barriers Individual Group Department Campus State National Globe Inter Planet Galaxy Desktop SMPs or SuperComputers Local Cluster Enterprise Cluster/Grid Global Cluster/Grid Inter Planetary Grid!

Basic Elements Application Development Tools Security Uniform Access Security System Management Computational Economy Resource Discovery Resource Allocation & Scheduling Data locality Network Management Application Development Tools

Cluster, Grid, P2P: Characteristics Population Commodity Computers High-end computers Edge of network (desktop PC) Ownership Single Multiple Discovery Membership Services Centralised Index & Decentralised Info Decentralized User Management Centralised Decentralised Resource mgmt Centralized Distributed Allocation/Scheduling Inter-Operability VIA based? No standards yet No standards Single System Image Yes No Scalability 100s 1000? Millions? [@Home] Capacity Guaranteed Varies, but high Varies Throughput Medium High Very High Speed(Lat. Bandwidth) Low, high High, Low

Issues in Grid computing Protocols required for interoperability Define standard services – for access of computation, data, resource discovery etc. APIs and SDKs to assist such protocol and service deployment Current Distributed Computing – Resource sharing in single organization – limited to sharing certain resource types only Need of services to support a common set of applications – Middleware

Projects Globus – A toolkit for grid computing infrastructure development Gridbus Legion OGSA – Standard for developing Grid application infrastructure (derived from Globus)

Grid Computing Approaches mix-and-match Object-oriented Internet/partial-P2P Grid Computing Approaches Network enabled Solvers NetSolve Economic-based Utility / Service-Oriented Computing Nimrod-G

Some Global Initiatives USA AppLeS Globus Legion Sun Grid Engine NASA IPG Condor-G Jxta NetSolve AccessGrid and many more... Cycle Stealing & .com Initiatives Distributed.net SETI@Home, …. Entropia, UD, SCS,…. Public Forums Global Grid Forum Australian Grid Forum IEEE TFCC CCGrid conference P2P conference Australia Nimrod-G Gridbus GridSim Virtual Lab DISCWorld GrangeNet. ..etc Europe UK eScience EU Data Grid Cactus XtremeWeb ..etc. India I-Grid Japan Ninf DataFarm Korea... N*Grid Singapore NGP

Globus Approach A toolkit and collection of services addressing key technical problems Modular “bag of services” model Not a vertically integrated solution General infrastructure tools (aka middleware) that can be applied to many application domains Inter-domain issues, rather than clustering Integration of intra-domain solutions Distinguish between local and global services

Grid computing – SuperScalar model IBM Ease the programming of GRID applications Basic idea: Grid  ns  seconds/minutes/hours

Automatic code generation app.idl gsstubgen client server app.c app-stubs.c app.h app-worker.c app-functions.c

Automatic code generation serveri app-functions.c app-worker.c app.c app-stubs.c . GRID superscalar runtime GT2 serveri app-functions.c app-worker.c client

Production Grids & Testbeds NASA’s Information Power Grid The Alliance National Technology Grid GUSTO Testbed

Testbed Statistics (Browse the Testbed) Grid Nodes: 218 distributed across 62 sites in 21 countries. Laptops, desktop PCs, WS, SMPs, Clusters, supercomputers Total CPUs: 3000+ (~3 TeraFlops) CPU Architecture: Intel x86, IA64, AMD, PowerPC, Alpha, MIPS Operating Systems: Windows or Unix-variants – Linux, Solaris, AIX, OSF, Irix, HP-UX Intranode Network: Ethernet, Fast Ethernet, Gigabit, Myrinet, QsNet, PARAMNet Internet/Wide Area Networks GrangeNet, AARNet, ERNet, APAN, TransPAC, & so on.

Grid Technologies and Applications Natural Language Engineering High Energy Physics Brain Activity Analysis Grid Apps. Molecular Docking Portfolio Analysis GAMESS Chemistry High-level Services and Tools … User-Level Middleware (Grid Tools) G-Monitor Programming Framework Gridscape Grid Brokers & Schedulers Nimrod-G Gridbus Data Broker Alchemi: .NET Grid Services +Clustering of desktop PCs Globus Data Management Services Grid Bank GMD Core Grid Middleware MDS GRAM GASS PKI-based Grid Security Interface (GSI) .NET JVM Condor PBS SGE LSF Tomcat Grid Fabric Windows Solaris Linux AIX IRIX OSF1 HP UX

Classes of Applications that can be powered by Grids Distributed HPC (Supercomputing): Computational science. High-Capacity/Throughput Computing: Large scale simulation/chip design & parameter studies. Content Sharing (free or paid) Sharing digital contents among peers (e.g., Napster) Remote software access/renting services: Application service provides (ASPs) & Web services. Data-intensive computing: Drug Design, Particle Physics, Stock Prediction... On-demand, realtime computing: Medical instrumentation & Mission Critical. Collaborative Computing: Collaborative design, Data exploration, education. Service Oriented Computing (SOC): Towards economic-based Utility Computing: New paradigm, new applications, new industries, and new business.

Analysis Summary Application Data Size Processing Time Nodes Belle Analysis (HEP) 300 MB input (100 jobs – 3MB each) 30 min. Australia, Japan Financial Portfolio Analysis 50 MB output (50 jobs – 1MB each) 20 min. Global Newswire Indexing 80 MB input (12 jobs – 7MB each job) GrangeNet, Australia GAMESS 4KB for each job. Total output: 860MB compressed Each job took 5-78 minutes. Total 15 hours (130 nodes, 15 sites)

What is Grid computing? Grid is the next-generation internet Grid requires a distributed operating system Grid requires new programming models Grid does not need high performance computers

Research directions Publisher/Subscriber systems on the Grid – How can the grid be used to manage such applications and what are the issues What levels of selectivity and regionalism is expected from VOs? How to handle the dynamics of the topology and nodes? Addressing QoS on Grid – best effort ? Efficient Discovery and Retrieval Replication techniques

References List of available resources on grid computing - http://www.gridcomputing.com Foster I., Kesselman, C., and Tuecke, S., - “The Anatomy of the Grid- Enabling Scalable Virtual Organizations” – Intl J. SuperComputer Applications, 2001 Casanova, H., “Distributed Computing Research Issues in Grid Computing” – ACM SIGACT News Distributed Computing Column 8 July, 2002 Lau, F., Ho, R. and Wang, C., “Grid Computing: Challenges and Design Approaches” “The grid : blueprint for a new computing infrastructure” Editors Foster, I., and Kesselman, C. , Elsevier, 2004