Download presentation
Presentation is loading. Please wait.
Published byCynthia Miller Modified over 9 years ago
1
CS 591x Grid Computing
2
Observe that - Today’s processors are tremendously powerful, even compared to a few years ago Millions of computers in the world Most are not busy at any one time
3
…Observe that - Large percentage of computers are interconnected via the Internet Networking technology has made tremendous progress Millions of computers have access to relatively high performance networking Networking performance progressing rapidly Internet-2 Lambda Rail – DWDM 10 Gs/fiber
4
…Observe that - Large number of computing problems have become increasingly complex Computational demands of computing programs have outstripped the computational capability of any one computer Yet, world-wide there appears to be a surplus of computational capacity (idle machines)
5
Recall that… Clusters came about by tying together a group of desktop computers… … to harness the computational power of these computers as a collective whole… physically in one place… …with a single common interconnect…
6
But what if…
7
Grid Computing Why not tie computational resources (desktop computers, supercomputers, etc.) together … … and harness their collective computational power. … thus Grid Computing
8
Grid Computing “A computational grid is a hardware and software infrastructure that provides dependable, consistent, pervasive and inexpensive access to high-end computational capabilities”( Foster and Kesselman, 1998)
9
A Grid is… …A collection of computational processing elements… …possibly organized dynamically… …utilizing relatively high performance networking… … to provide computational resources beyond those normally available
10
Grid Computing Primarily accomplished through middleware --software layers that tie discrete computers together into a grid must be based on standards – why? *** participating elements are administratively autonomous ***
11
the Virtual Organization Important concept in grid computing enabled by and part of a grid dynamically “convening” expertise around a problem dynamically “constructing” resources to support the approach to a problem may go away when problem is solved or project is completed
12
Middleware Issues Security transaction/data security authentication Resource Management authorization resource allocation Information services resource monitoring job monitoring Data Management data access data caching
13
Grids --Come in many “flavors”- Cluster of clusters, grids of high performance systems well known, stable resources under administrative management Dynamic grids Cycle “stealing” not so stable resources not always well known little or no communications among processes - sometimes
14
Standards OGSA – Open Grid Services Architecture OGSI – Open Grid Services Infrastructure Infrastructure around which OGSA is built Core grid service specification On-going development through the Global Grid Forum www.ggf.org
15
Globus Implementation of OGSA/OGSI Middleware for deploying a grid
16
Teragrid from: http://www.teragrid.org/userinfo/guide_hardware_table.html
17
TeraGrid Extensible Terascale Computational Facility Ties together HPCs from major national supercomputing centers in the U.S. Massive computational resources Well known, controlled computing environment see http://www.teragrid.org
18
The Sabre Grid Overall managed by PSC composed of clusters from PSC … and WVU (Energy)… … and the Department of Energy (NETL).. … and a Condor flock Early stages
19
Einstein@home
20
Cycle stealing searches for gravitation objects – pulsars in astronomy data runs as a screen saver – when computer is not used Berkeley Open Infrastructure for Network Computing – BOINC BOINC – “An open-source software platform for computing using volunteered resources. “ from:http://boinc.berkeley.edu/
21
Other BOINC based projects SETI@homeSETI@home – search for extraterrestrial intelligence Climateprediction.net – study climate change Predictor@homePredictor@home - investigate protein related diseases
22
Global Grid Exchange Uses central server deploys tasks to “common” computers from a large pool of available computer potentially massive pool of computers primarily Java based no inter-task communications has process fail-over capability
23
Global Grid Exchange Operated by the WV High Technology Consortium Foundation potentially thousands of computers Can run non-Java code requires special “intervention” to get by- pass security
24
Condor Developed and maintained by the University of Wisconsin – Madison Originally – a cycle-stealing approach to gathering high performance computational resources Can function like a cluster or like a grid (flocking)… can be part of a Globus based grid (Condor–G) Supports message passing
25
Others United Devices Unicore
26
Grid Computing further thoughts
27
Types of Grids Desktop Grids collections of computers office grids volunteer compute elements Can be heterogeneous Unreliable
28
Types of Grids Cluster Grids Cluster of Clusters Single system image “completely compiled” code Stable resources Known environment Sabre
29
Types of Grids HPC Grids Grid of “Big Iron” supercomputers Very high performance Stable platform reliable known environment not so many organizational/human issues TeraGrid
30
Types of Grids Data Grids access to distributed data resource global and local resource management common access protocol resources can be very large National Virtual Observatory
31
Requirements for a Grid Interface should provide the user community with a familiar, understandable interface command-line command (like qsub) and tools the user community is familiar with Job Scheduling Should be done in a manner similar to other parallel paradigms Known queuing algorithms
32
Requirements for a Grid Data Management Access to data by distributed processes Grid Global file system does not scale beyond a point Staging/Caching data Consistent namespace Remote Execution Environment User should have control of the execution environment environment variables/parameters
33
Grid Requirements Security Authentication – positively identify users, devices, other resources Confidentiality – information is not disclosed to unauthorized people, systems,… Data integrity – data not modified accidentally, maliciously Non-repudiation – trusted confirmation – “return receipt”
34
Grid Requirements Gang Scheduling process/thread scheduling must be managed grid wide all processes/threads must start/stop at the same time if a process/thread fails, grid must manage the entire job stop job, restart job
35
Grid Requirements Checkpointing and Job Migration Fault-tolerence – Failure recovery Load balancing Checkpointing – automatic, user-induced, none Management tools to manage grid as a system must respect rights, autonomy, authority of components
36
Some Barriers Resource Sharing call for sharing corporate resources things that have cost to companies/organizations System Integrity once someone has code running your computer….? Data Integrity confidence in results – are they correct architecture software environment tampering
37
Some Barriers Availability Critical Grid App vs. Critical Corporate App who gets priority how to assert that priority Ownership who owns the discovery if it was discovered on my computer Intellectual Property – does the U of X own a piece of my work Licensing calls for new licensing models (no named seats)
38
Some Barriers Culpability/Liability if its wrong – who’s to blame Propriety Commericial code running on a state- owned computer inappropriate code
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.