Grid Computing. What is a Grid? Many definitions exist in the literature Early definitions: Foster and Kesselman, 1998 –“A computational grid is a hardware.

Slides:



Advertisements
Similar presentations
Database Architectures and the Web
Advertisements

What is Grid Computing? Cevat Şener Dept. of Computer Engineering, METU.
High Performance Computing Course Notes Grid Computing.
A CSEL presentation based on I. Foster, Z. Yong, I. Raicu, and S. Lu, "Cloud Computing and Grid Computing 360-Degree Compared," in Grid Computing Environments.
Data Grids Darshan R. Kapadia Gregor von Laszewski
Introduction to Grids and Grid applications Gergely Sipos MTA SZTAKI
Chapter 9 Designing Systems for Diverse Environments.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
Introduction  What is an Operating System  What Operating Systems Do  How is it filling our life 1-1 Lecture 1.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Grid Services at NERSC Shreyas Cholia Open Software and Programming Group, NERSC NERSC User Group Meeting September 17, 2007.
GRID COMPUTING & GRID SCHEDULERS - Neeraj Shah. Definition A ‘Grid’ is a collection of different machines where in all of them contribute any combination.
Grid Computing Net 535.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
1 Exploring Data Reliability Tradeoffs in Replicated Storage Systems NetSysLab The University of British Columbia Abdullah Gharaibeh Advisor: Professor.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
DISTRIBUTED COMPUTING
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
Grid Computing - AAU 14/ Grid Computing Josva Kleist Danish Center for Grid Computing
Lecture 3: Sun: 16/4/1435 Distributed Computing Technologies and Middleware Lecturer/ Kawther Abas CS- 492 : Distributed system.
1 Introduction to Grid Computing. 2 What is a Grid? Many definitions exist in the literature Early definitions: Foster and Kesselman, 1998 “A computational.
1 School of Computer, National University of Defense Technology A Profile on the Grid Data Engine (GridDaEn) Xiao Nong
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
Chapter 4 Realtime Widely Distributed Instrumention System.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
The Globus Project: A Status Report Ian Foster Carl Kesselman
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Chapter 4:- Introduction to Grid and its Evolution Prepared By:- NITIN PANDYA Assistant Professor SVBIT.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
DISTRIBUTED COMPUTING Introduction Dr. Yingwu Zhu.
Interoperability Grids, Clouds and Collaboratories Ruth Pordes Executive Director Open Science Grid, Fermilab.
Perspectives on Grid Technology Ian Foster Argonne National Laboratory The University of Chicago.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
Issues Autonomic operation (fault tolerance) Minimize interference to applications Hardware support for new operating systems Resource management (global.
Tools for collaboration How to share your duck tales…
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
GRID ARCHITECTURE Chintan O.Patel. CS 551 Fall 2002 Workshop 1 Software Architectures 2 What is Grid ? "...a flexible, secure, coordinated resource- sharing.
Authors: Ronnie Julio Cole David
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
August 3, March, The AC3 GRID An investment in the future of Atlantic Canadian R&D Infrastructure Dr. Virendra C. Bhavsar UNB, Fredericton.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
7. Grid Computing Systems and Resource Management
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
2. WP9 – Earth Observation Applications ESA DataGrid Review Frascati, 10 June Welcome and introduction (15m) 2.WP9 – Earth Observation Applications.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
1 TCS Confidential. 2 Objective : In this session we will be able to learn:  What is Cloud Computing?  Characteristics  Cloud Flavors  Cloud Deployment.
Distributed Geospatial Information Processing (DGIP) Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
INTRODUCTION TO XSEDE. INTRODUCTION  Extreme Science and Engineering Discovery Environment (XSEDE)  “most advanced, powerful, and robust collection.
Parallel Computing Globus Toolkit – Grid Ayaka Ohira.
Grid and Cloud Computing
Clouds , Grids and Clusters
Joseph JaJa, Mike Smorul, and Sangchul Song
Grid Computing.
GRID COMPUTING PRESENTED BY : Richa Chaudhary.
Milestone 2 Include the names of the papers
Presentation transcript:

Grid Computing

What is a Grid? Many definitions exist in the literature Early definitions: Foster and Kesselman, 1998 –“A computational grid is a hardware and software infrastructure that provides dependable, consistent, pervasive, and inexpensive access to high-end computational facilities” Kleinrock 1969: – “We will probably see the spread of ‘computer utilities’, which, like present electric and telephone utilities, will service individual homes and offices across the country.”

3-point checklist (Foster 2002) Coordinates resources not subject to centralized control Uses standard, open, general purpose protocols and interfaces Deliver nontrivial qualities of service –e.g., response time, throughput, availability, security

Grid Architecture Autonomous, globally distributed computers/clusters

Why do we need Grids? Many large-scale problems cannot be solved by a single computer Globally distributed data and resources

Related Technologies Cluster computing Peer-to-peer computing Internet computing

Grid Computing Versus Cluster Computing Grids tend to be more loosely coupled, heterogeneous, and geographically dispersed. a grid can be dedicated to a specialized application, it is more common that –a single grid will be used for a variety of different purposes. middleware middlewareGrids are often constructed with the aid of general-purpose grid software libraries known as middleware.middleware

Middleware software consists of a set of services that allows multiple processes running on one or more machines to interact. –provides interoperability to support and simplify complex distributed applications. includes web servers, application servers, and similar tools that support application development and delivery. Middleware is especially integral to modern information technology based on XML, SOAP, Web services, and service- oriented architecture.

Use of Middleware Middleware services provide a functional set of application programming interfaces to allow an application to: 1. Locate transparently across the network, thus providing interaction with another service or application 2. Filter data to make them friendly usable or public 3. Be independent from network services 4. Be reliable and always available 5. Add complementary attributes like semantics

Peer-to-Peer Computing All computers have the same status Connect to each other Files can be accessed from any computer on the network Allows data sharing without going through central server Decentralized approach also useful for Grid Using P2P as infrastructure

Peer to Peer Architecture

Internet Computing Idea Idea : many idle PCs on the Internet can perform other computations while not being used “Cycle scavenging” – rely on getting free time on other people’s computers that would be wasted at night, during lunch, or even in the scattered seconds throughout the day when the computer is waiting for user input or slow devices. Example: Participating computers also donate some supporting amount of disk storage space, RAM, and network bandwidth Cycle scavenging advantages Good Utilization of resources Cycle scavenging disadvantages Heat produced by CPU power

Some Grid Applications Distributed supercomputing High-throughput computing On-demand computing Data-intensive computing Collaborative computing

Distributed Supercomputing Idea Idea: aggregate computational resources to tackle problems that cannot be solved by a single system Examples: climate modeling, computational chemistry Challenges include:  Scheduling scarce (rare) and expensive resources  Scalability of protocols and algorithms  Maintaining high levels of performance across heterogeneous systems

High-Throughput Computing Schedule large numbers of independent tasks Goal: exploit unused CPU cycles (e.g., from idle workstations) Utilize unused CPU cycles (Cycle scavenging) Unlike distributed computing, tasks loosely coupled Examples: parameter studies, cryptographic problems

On-Demand Computing Use Grid capabilities to meet short-term requirements for resources that cannot conveniently be located locally Unlike distributed computing, driven by: cost-performance rather than absolute performance Dispatch expensive or specialized computations to remote servers

Data-Intensive Computing Synthesize (integrate) data in geographically distributed repositories Synthesize may be computationally and communication intensive Examples:  High energy physics generate terabytes of distributed data, need complex queries to detect “interesting” events

Collaborative Computing Enable shared use of data archives and simulations Examples:  Collaborative exploration of large geophysical data sets Challenges:  Real-time demands of interactive applications  Rich variety of interactions

Grid Communities Who will use Grids? Broad view  Benefits of sharing outweigh costs  Universal, like a power Grid Narrow view  Cost of sharing across institutional boundaries  Resources only shared when incentive to do so  Grid will be specialized to support specific communities with specific goals

Grid Communities-cont… Government Small number of users Couple small numbers of high-end resources Goals:  Provide “strategic computing reserve” for crisis management  Support collaborative investigations of scientific and engineering problems Need to integrate diverse resources and balance diversity of competing interests

Health Maintenance Organization Share high-end computers, workstations, administrative databases, medical image archives, instruments, etc. across hospitals in a metropolitan area Enable new computationally enhanced applications Private grid  Small scale, central management, common purpose  Diversity of applications and complexity of integration Grid Communities-cont…

Materials Science Collaboratory Scientists operating a variety of instruments (electron microscopes, particle accelerators, X- ray sources) for characterization of materials Highly distributed and fluid community Sharing of instruments, archives, software, computers Virtual Grid  strong focus and narrow goals  Dynamic membership, decentralized, sharing resources Grid Communities-cont…

Computational Market Economy Combine:  Consumers with diverse needs and interests  Providers of specialized services  Providers of compute resources and network providers Public Grid  Need applications that can exploit loosely coupled resources  Need contributors of resources Grid Communities-cont…

Grid Users Many levels of users  Grid developers  Tool developers  Application developers  End users  System administrators

Some Grid challenges Data movement Data replication Resource management Job submission

Some Grid-Related Projects Globus Condor Nimrod-G

Globus Grid Toolkit Open source toolkit for building Grid systems and applications Enabling technology for the Grid Share computing power, databases, and other tools securely online Facilities for:  Resource monitoring  Resource discovery  Resource management  Security  File management

(Self reading) Data Management in Globus Toolkit Data movement  GridFTP  Reliable File Transfer (RFT) Data replication  Replica Location Service (RLS)  Data Replication Service (DRS)

(Self reading) GridFTP High performance, secure, reliable data transfer protocol Optimized for wide area networks Superset of Internet FTP protocol Features:  Multiple data channels for parallel transfers  Partial file transfers  Third party transfers  Reusable data channels  Command pipelining

Condor Original goal: high-throughput computing Harvest wasted CPU power from other machines Can also be used on a dedicated cluster Condor-G – Condor interface to Globus resources

Condor Provides many features of batch systems:  job queuing  scheduling policy  priority scheme  resource monitoring  resource management Users submit their serial or parallel jobs Condor places them into a queue Scheduling and monitoring Informs the user upon completion

Nimrod-G Tool to manage execution of parametric studies across distributed computers Manages experiment  Distributing files to remote systems  Performing the remote computation  Gathering results User submits declarative plan file  Parameters, default values, and commands necessary for performing the work Nimrod-G takes advantage of Globus toolkit features

Grid Case Studies Earth System Grid LIGO TeraGrid

Earth System Grid (ESG) Provide climate studies scientists with access to large datasets Data generated by computational models – requires massive computational power Most scientists work with subsets of the data Requires access to local copies of data

ESG Infrastructure Archival storage systems and disk storage systems at several sites Storage resource managers and GridFTP servers to provide access to storage systems Metadata catalog services Replica location services Web portal user interface

Earth System Grid

Earth System Grid Interface

Laser Interferometer Gravitational Wave Observatory (LIGO) Instruments at two sites to detect gravitational waves Each experiment run produces millions of files Scientists at other sites want these datasets on local storage LIGO deploys (Resources List Server) RLS servers at each site to register local mappings and collect information about mappings at other sites

Large Scale Data Replication for LIGO Goal Goal: detection of gravitational waves Three interferometers at two sites Generate 1 TB of data daily Need to replicate this data across 9 sites to make it available to scientists Scientists need to learn where data items are, and how to access them

LIGO

TeraGrid NSF (National Science Faciities)  high-performance computing facility Nine distributed sites, each with different capability, e.g., computation power, archiving facilities, visualization software Applications may require more than one site Data sizes on the order of gigabytes or terabytes

TeraGrid

Solution: Use GridFTP and RFT with front end command line tool Benefits of system:  Simple user interface  High performance data transfer capability  Ability to recover from both client and server software failures  Extensible configuration

The Reality Many types of Grids exist Private vs. public Regional vs. Global All-purpose vs. particular scientific problem