Fundamentals of Grid Computing IBM Redbooks paper Viktors Berstis Presented by: Saeed Ghanbari Saeed Ghanbari.

Slides:



Advertisements
Similar presentations
ITEC474 INTRODUCTION.
Advertisements

Distributed Data Processing
Operating System.
Distributed Computing
Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.
High Performance Computing Course Notes Grid Computing.
Introduction to DBA.
Setting up of condor scheduler on computing cluster Raman Sehgal NPD-BARC.
High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
Dinker Batra CLUSTERING Categories of Clusters. Dinker Batra Introduction A computer cluster is a group of linked computers, working together closely.
A Computation Management Agent for Multi-Institutional Grids
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
1 ITC242 – Introduction to Data Communications Week 12 Topic 18 Chapter 19 Network Management.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 1: Introduction to Windows Server 2003.
EEC-681/781 Distributed Computing Systems Lecture 3 Wenbing Zhao Department of Electrical and Computer Engineering Cleveland State University
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Workload Management Massimo Sgaravatto INFN Padova.
Silberschatz, Galvin and Gagne  Operating System Concepts Common System Components Process Management Main Memory Management File Management.
Grid Computing Net-535 Fall 2013.
Grid Computing Net 535.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment, Enhanced Chapter 1: Introduction to Windows Server 2003.
ATIF MEHMOOD MALIK KASHIF SIDDIQUE Improving dependability of Cloud Computing with Fault Tolerance and High Availability.
Module 13: Configuring Availability of Network Resources and Content.
Chapter 3: Operating-System Structures System Components Operating System Services System Calls System Programs System Structure Virtual Machines System.
PCGRID ‘08 Workshop, Miami, FL April 18, 2008 Preston Smith Implementing an Industrial-Strength Academic Cyberinfrastructure at Purdue University.
Chapter 8 Implementing Disaster Recovery and High Availability Hands-On Virtual Computing.
DISTRIBUTED COMPUTING
Module 7: Fundamentals of Administering Windows Server 2008.
Rensselaer Polytechnic Institute CSCI-4210 – Operating Systems CSCI-6140 – Computer Operating Systems David Goldschmidt, Ph.D.
Military Technical Academy Bucharest, 2004 GRID CONSTRUCTION SOFTWARE COMPONENTS ADINA RIPOSAN Applied Information Technology Department of Computer Engineering.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
Module 11: Implementing ISA Server 2004 Enterprise Edition.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 3: Operating-System Structures System Components Operating System Services.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
1 Week #10Business Continuity Backing Up Data Configuring Shadow Copies Providing Server and Service Availability.
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
Authors: Ronnie Julio Cole David
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
INTRUSION DETECTION SYSYTEM. CONTENT Basically this presentation contains, What is TripWire? How does TripWire work? Where is TripWire used? Tripwire.
VMware vSphere Configuration and Management v6
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
INFSO-RI Enabling Grids for E-sciencE EGEE is a project funded by the European Union under contract INFSO-RI Grid Accounting.
7. Grid Computing Systems and Resource Management
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Status of Globus activities Massimo Sgaravatto INFN Padova for the INFN Globus group
Distributed File System. Outline Basic Concepts Current project Hadoop Distributed File System Future work Reference.
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Building Preservation Environments with Data Grid Technology Reagan W. Moore Presenter: Praveen Namburi.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
G. Russo, D. Del Prete, S. Pardi Kick Off Meeting - Isola d'Elba, 2011 May 29th–June 01th A proposal for distributed computing monitoring for SuperB G.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
OpenPBS – Distributed Workload Management System
Clouds , Grids and Clusters
Grid Computing.
Introduction to Operating System (OS)
THE STEPS TO MANAGE THE GRID
Introduction to Cloud Computing
Real IBM C exam questions and answers
An Introduction to Computer Networking
Fault Tolerance Distributed Web-based Systems
Chapter 2: Operating-System Structures
Introduction to Operating Systems
Chapter 2: Operating-System Structures
Presentation transcript:

Fundamentals of Grid Computing IBM Redbooks paper Viktors Berstis Presented by: Saeed Ghanbari Saeed Ghanbari

What is Grid Computing? The term Grid computing originated in the early 1990s as a metaphor for making computer power as easy to access as an electric power grid. –The definitive definition of a Grid is provided by Ian Foster in his article "What is the Grid? Computing resources are not administered centrally. Open standards are used. Non-trivial quality of service is achieved. –Plaszczak/Wellner define Grid technology as "the technology that enables resource virtualization, on-demand provisioning, and service (resource) sharing between organizations." –IBM : "A Grid is a type of parallel and distributed system that enables the sharing, selection, and aggregation of resources distributed across multiple administrative domains based on the resources availability, capacity, performance, cost and users' quality-of-service requirements"

Topics to be covered What grid computing can do Grid concepts and components Grid construction Using a grid –A user’s perspective –An administrator’s perspective –An application developer’s perspective

What grid computing can do(1) Exploiting underutilized resources –Computing: Desktop: less than %5 Even servers in many organizations –Unused disk capacity –Implications: without undue overhead. remote machine must meet any special hardware, software, or resource requirements Parallel CPU capacity –Subjobs on different machines –Barriers often exist to perfect scalability.

What grid computing can do(2) Applications –Grid-enabled applications –no practical tools for transforming arbitrary applications to exploit the parallel capabilities of a grid.

What grid computing can do(3) Virtual resources and virtual organizations for collaboration –More capable than distributed computing Wider audience Open standards, hence highly heterogeneous systems –Data, equipment, software, services, licenses,… –Several real and virtual organizations

What grid computing can do(3) Access to additional resources –special equipment, software, licenses, and other services Resource balancing

What grid computing can do(4) Reliability –Now: redundancy in hardware –Future: Software –Utilize “autonomic computing” Management –More disperse IT infrastructure –Priority among projects

Grid concepts and components(1) Types of resources –Computation –Storage Primary/secondary storage Mountable networked filed system –AFS, NFS, DFS, GPFS Capacity increase Uniform name space Data Stripping

Grid concepts and components(2) Types of resources (cont) Communications –Redundant communication paths Software and licenses –License management software Special equipment, capacities, architectures, and policies –different architectures, operating systems, devices, capacities, and equipment. Jobs and applications –Application is a collection of jobs –Specific dependencies

Grid concepts and components(3) Types of resources (cont) Scheduling, reservation, and scavenging –scheduler automatically finds the most appropriate machine on which to run any given job –scavenging report its idle status to the grid management node. Search for Extraterrestrial Intelligence at Home –Reserved dedicated resources

Grid concepts and components(4) Intragrid to Intergrid –cluster same hardware/software –Intragrid heterogeneous machines/software multiple department/same organization –Intergrid heterogeneous machines/software multiple department/multiple organization

Grid construction(1) Grid software components Management components –resource accounting load sensors –resource evaluation overall usage patterns –autonomic computing Donor software –each machine needs to enroll as a member of the grid and install some software that manages the grid’s use of its resources –authentication –monitoring –check pointing / resuming Submission software

Grid construction(2) Grid software components (cont.) Distributed grid management –hierarchy of clusters Schedulers –job priority system –react to immediate load –monitor the progress of scheduled jobs & re- submisson –reservation system –meta-scheduler Communications –jobs communicate with each other. The open standard Message Passing Interface (MPI)

Using a grid: A user’s perspective(1) Enrolling and installing grid software –authentication for security purposes –certificate authority –decide which resources to donate to the grid Logging onto the grid –grid login ID

Using a grid: A user’s perspective(2) Queries and submitting jobs –staging the input data –different architectures : multiple versions of the program –job execution sandbox –collect results Data configuration –data replication –networked file system caching feature enabled

Using a grid: A user’s perspective(3) Monitoring progress and recovery –Degree of recovery for subjobs that fail –Failures Programming error Hardware or power failure Communications interruption Excessive slowness –Recovery Scheduler User

Using a grid: An administrator’s perspective(1) Planning Installation Managing enrollment of donors and users Certificate authority –It is critical to ensure the highest levels of security in a grid because the grid is designed to execute code and not just share data Positively identify entities requesting certificates Issuing, removing, and archiving certificates Protecting the certificate authority server Maintaining a namespace of unique names for certificate owners Serve signed certificates to those needing to authenticate entities Logging activity

Using a grid: An administrator’s perspective(2) Resource management –setting permissions –Tracking resource usage –Implementing a billing system –policies to achieve better utilization

Using a grid: An application developer’s perspective(1) Applications that are not enabled for using multiple processors but can be executed on different machines. Applications that are already designed to use the multiple processors of a grid setting. Applications that need to be modified or rewritten to better exploit a grid –Tools for debugging and measuring the behavior of grid applications

Using a grid: An application developer’s perspective(2) Globus –developer’s toolkit Manage grid operations Measurement Repair Debug grid applications Open Grid Services Architecture (OGSA)

A brief survey

A quick survey

Enabling Grids for E-sciencE (EGEE) CERN's new particle accelerator –15 petabytes(15 million gigabytes) a year stack of CDs more than 20 km high!!! –200 sites around the globe –Over computers –Runing up to jobs per day Has already served for: – chemical compounds in search of potential drugs for Flu –Simulations of over 40 million potential drug molecules against malaria

Questions ?