Milestone 2 Include the names of the papers

Slides:



Advertisements
Similar presentations
Cross-site data transfer on TeraGrid using GridFTP TeraGrid06 Institute User Introduction to TeraGrid June 12 th by Krishna Muriki
Advertisements

Distributed Data Processing
Database Architectures and the Web
High Performance Computing Course Notes Grid Computing.
Data Grids Darshan R. Kapadia Gregor von Laszewski
GridFTP: File Transfer Protocol in Grid Computing Networks
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Office of Science U.S. Department of Energy Grids and Portals at NERSC Presented by Steve Chan.
70-290: MCSE Guide to Managing a Microsoft Windows Server 2003 Environment Chapter 1: Introduction to Windows Server 2003.
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Grid Computing. What is a Grid? Many definitions exist in the literature Early definitions: Foster and Kesselman, 1998 –“A computational grid is a hardware.
Data Management Kelly Clynes Caitlin Minteer. Agenda Globus Toolkit Basic Data Management Systems Overview of Data Management Data Movement Grid FTP Reliable.
Presenter: Dipesh Gautam.  Introduction  Why Data Grid?  High Level View  Design Considerations  Data Grid Services  Topology  Grids and Cloud.
DISTRIBUTED COMPUTING
1 Introduction to Grid Computing. 2 What is a Grid? Many definitions exist in the literature Early definitions: Foster and Kesselman, 1998 “A computational.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
The Data Grid: Towards an Architecture for the Distributed Management and Analysis of Large Scientific Dataset Caitlin Minteer & Kelly Clynes.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
Chapter 4 Realtime Widely Distributed Instrumention System.
CYBERINFRASTRUCTURE FOR THE GEOSCIENCES Data Replication Service Sandeep Chandra GEON Systems Group San Diego Supercomputer Center.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Chapter 4:- Introduction to Grid and its Evolution Prepared By:- NITIN PANDYA Assistant Professor SVBIT.
Grid Architecture William E. Johnston Lawrence Berkeley National Lab and NASA Ames Research Center (These slides are available at grid.lbl.gov/~wej/Grids)
9 Systems Analysis and Design in a Changing World, Fourth Edition.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
Authors: Ronnie Julio Cole David
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
CEOS Working Group on Information Systems and Services - 1 Data Services Task Team Discussions on GRID and GRIDftp Stuart Doescher, USGS WGISS-15 May 2003.
11 CLUSTERING AND AVAILABILITY Chapter 11. Chapter 11: CLUSTERING AND AVAILABILITY2 OVERVIEW  Describe the clustering capabilities of Microsoft Windows.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
7. Grid Computing Systems and Resource Management
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Super Computing 2000 DOE SCIENCE ON THE GRID Storage Resource Management For the Earth Science Grid Scientific Data Management Research Group NERSC, LBNL.
The Globus Toolkit The Globus project was started by Ian Foster and Carl Kesselman from Argonne National Labs and USC respectively. The Globus toolkit.
INTRODUCTION TO GRID & CLOUD COMPUTING U. Jhashuva 1 Asst. Professor Dept. of CSE.
Data Infrastructure in the TeraGrid Chris Jordan Campus Champions Presentation May 6, 2009.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Parallel Computing Globus Toolkit – Grid Ayaka Ohira.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Accessing the VI-SEEM infrastructure
Grid and Cloud Computing
Clouds , Grids and Clusters
Introduction to Distributed Platforms
The Data Grid: Towards an architecture for Distributed Management
Introduction to Load Balancing:
GWE Core Grid Wizard Enterprise (
Access Grid and USAID November 14, 2007
GGF OGSA-WG, Data Use Cases Peter Kunszt Middleware Activity, Data Management Cluster EGEE is a project funded by the European.
Joseph JaJa, Mike Smorul, and Sangchul Song
Chapter 2: System Structures
A. Rama Bharathi Regd. No: 08931F0040 III M.C.A
Grid Computing.
Database Architectures and the Web
GRID COMPUTING PRESENTED BY : Richa Chaudhary.
Cloud Computing.
Chapter 1: Introduction
Grid Computing B.Ramamurthy 9/22/2018 B.Ramamurthy.
Ch 4. The Evolution of Analytic Scalability
Grid Computing Done by: Shamsa Amur Al-Matani.
Introduction to Grid Technology
Subject Name: Operating System Concepts Subject Number:
The Anatomy and The Physiology of the Grid
The Anatomy and The Physiology of the Grid
Distributed Systems Bina Ramamurthy 4/22/2019 B.Ramamurthy.
Database System Architectures
Data Management Components for a Research Data Archive
What is a Grid? Grid - describes many different models
Presentation transcript:

Milestone 2 Include the names of the papers You only have a page – be selective about what you include Be specific; summarize the authors’ contributions, not just what the paper is “about”. You might be able to reuse this text in the final paper if you’re specific and thorough.

Introduction to Grid Computing

Overview Background: What is the Grid? Related technologies Grid applications Communities Grid Tools Case Studies

What is a Grid? Many definitions exist in the literature Early defs: Foster and Kesselman, 1998 “A computational grid is a hardware and software infrastructure that provides dependable, consistent, pervasive, and inexpensive access to high-end computational facilities” Kleinrock 1969: “We will probably see the spread of ‘computer utilities’, which, like present electric and telephone utilities, will service individual homes and offices across the country.”

3-point checklist (Foster 2002) Coordinates resources not subject to centralized control Uses standard, open, general purpose protocols and interfaces Deliver nontrivial qualities of service e.g., response time, throughput, availability, security

Autonomous, globally distributed computers/clusters Grid Architecture Autonomous, globally distributed computers/clusters

Why do we need Grids? Many large-scale problems cannot be solved by a single computer Globally distributed data and resources

Background: Related technologies Cluster computing Peer-to-peer computing Internet computing

Cluster computing Idea: put some PCs together and get them to communicate Cheaper to build than a mainframe supercomputer Different sizes of clusters Scalable – can grow a cluster by adding more PCs

Cluster Architecture

Peer-to-Peer computing Connect to other computers Can access files from any computer on the network Allows data sharing without going through central server Decentralized approach also useful for Grid

Peer to Peer architecture

Internet computing Idea: many idle PCs on the Internet Can perform other computations while not being used “Cycle scavenging” – rely on getting free time on other people’s computers Example: SETI@home What are advantages/disadvantages of cycle scavenging?

Some Grid Applications Distributed supercomputing High-throughput computing On-demand computing Data-intensive computing Collaborative computing

Distributed Supercomputing Idea: aggregate computational resources to tackle problems that cannot be solved by a single system Examples: climate modeling, computational chemistry Challenges include: Scheduling scarce and expensive resources Scalability of protocols and algorithms Maintaining high levels of performance across heterogeneous systems

High-throughput computing Schedule large numbers of independent tasks Goal: exploit unused CPU cycles (e.g., from idle workstations) Unlike distributed computing, tasks loosely coupled Examples: parameter studies, cryptographic problems

On-demand computing Use Grid capabilities to meet short-term requirements for resources that cannot conveniently be located locally Unlike distributed computing, driven by cost-performance concerns rather than absolute performance Dispatch expensive or specialized computations to remote servers

Data-intensive computing Synthesize data in geographically distributed repositories Synthesis may be computationally and communication intensive Examples: High energy physics generate terabytes of distributed data, need complex queries to detect “interesting” events Distributed analysis of Sloan Digital Sky Survey data

Collaborative computing Enable shared use of data archives and simulations Examples: Collaborative exploration of large geophysical data sets Challenges: Real-time demands of interactive applications Rich variety of interactions

Grid Communities Who will use Grids? Broad view Narrow view Benefits of sharing outweigh costs Universal, like a power Grid Narrow view Cost of sharing across institutional boundaries is too high Resources only shared when incentive to do so Grid will be specialized to support specific communities with specific goals

Government Small number of users Couple small numbers of high-end resources Goals: Provide “strategic computing reserve” for crisis management Support collaborative investigations of scientific and engineering problems Need to integrate diverse resources and balance diversity of competing interests

Health Maintenance Organization Share high-end computers, workstations, administrative databases, medical image archives, instruments, etc. across hospitals in a metropolitan area Enable new computationally enhanced applications Private grid Small scale, central management, common purpose Diversity of applications and complexity of integration

Materials Science Collaboratory Scientists operating a variety of instruments (electron microscopes, particle accelerators, X-ray sources) for characterization of materials Highly distributed and fluid community Sharing of instruments, archives, software, computers Virtual Grid strong focus and narrow goals Dynamic membership, decentralized, sharing resources

Computational Market Economy Combine: Consumers with diverse needs and interests Providers of specialized services Providers of compute resources and network providers Public Grid Need applications that can exploit loosely coupled resources Need contributors of resources

Grid Users Many levels of users Grid developers Tool developers Application developers End users System administrators

Some Grid challenges Data movement Data replication Resource management Job submission

Some Grid-Related Projects Globus Condor Nimrod-G

Globus Grid Toolkit Open source toolkit for building Grid systems and applications Enabling technology for the Grid Share computing power, databases, and other tools securely online Facilities for: Resource monitoring Resource discovery Resource management Security File management

Data Management in Globus Toolkit Data movement GridFTP Reliable File Transfer (RFT) Data replication Replica Location Service (RLS) Data Replication Service (DRS)

GridFTP High performance, secure, reliable data transfer protocol Optimized for wide area networks Superset of Internet FTP protocol Features: Multiple data channels for parallel transfers Partial file transfers Third party transfers Reusable data channels Command pipelining

More GridFTP features Auto tuning of parameters Striping Transfer data in parallel among multiple senders and receivers instead of just one Extended block mode Send data in blocks Know block size and offset Data can arrive out of order Allows multiple streams

Striping Architecture Use “Striped” servers

Limitations of GridFTP Not a web service protocol (does not employ SOAP, WSDL, etc.) Requires client to maintain open socket connection throughout transfer Inconvenient for long transfers Cannot recover from client failures

GridFTP

Reliable File Transfer (RFT) Web service with “job-scheduler” functionality for data movement User provides source and destination URLs Service writes job description to a database and moves files Service methods for querying transfer status

RFT

Replica Location Service (RLS) Registry to keep track of where replicas exist on physical storage system Users or services register files in RLS when files created Distributed registry May consist of multiple servers at different sites Increase scale Fault tolerance

Replica Location Service (RLS) Logical file name – unique identifier for contents of file Physical file name – location of copy of file on storage system User can provide logical name and ask for replicas Or query to find logical name associated with physical file location

Data Replication Service (DRS) Pull-based replication capability Implemented as a web service Higher-level data management service built on top of RFT and RLS Goal: ensure that a specified set of files exists on a storage site First, query RLS to locate desired files Next, creates transfer request using RFT Finally, new replicas are registered with RLS

Condor Original goal: high-throughput computing Harvest wasted CPU power from other machines Can also be used on a dedicated cluster Condor-G – Condor interface to Globus resources

Condor Provides many features of batch systems: job queueing scheduling policy priority scheme resource monitoring resource management Users submit their serial or parallel jobs Condor places them into a queue Scheduling and monitoring Informs the user upon completion

Nimrod-G Tool to manage execution of parametric studies across distributed computers Manages experiment Distributing files to remote systems Performing the remote computation Gathering results User submits declarative plan file Parameters, default values, and commands necessary for performing the work Nimrod-G takes advantage of Globus toolkit features

Nimrod-G Architecture

Grid Case Studies Earth System Grid LIGO TeraGrid

Earth System Grid Provide climate studies scientists with access to large datasets Data generated by computational models – requires massive computational power Most scientists work with subsets of the data Requires access to local copies of data

ESG Infrastructure Archival storage systems and disk storage systems at several sites Storage resource managers and GridFTP servers to provide access to storage systems Metadata catalog services Replica location services Web portal user interface

Earth System Grid

Earth System Grid Interface

Laser Interferometer Gravitational Wave Observatory (LIGO) Instruments at two sites to detect gravitational waves Each experiment run produces millions of files Scientists at other sites want these datasets on local storage LIGO deploys RLS servers at each site to register local mappings and collect info about mappings at other sites

Large Scale Data Replication for LIGO Goal: detection of gravitational waves Three interferometers at two sites Generate 1 TB of data daily Need to replicate this data across 9 sites to make it available to scientists Scientists need to learn where data items are, and how to access them

LIGO

LIGO Solution Lightweight data replicator (LDR) Uses parallel data streams, tunable TCP windows, and tunable write/read buffers Tracks where copies of specific files can be found Stores descriptive information (metadata) in a database Can select files based on description rather than filename

TeraGrid NSF high-performance computing facility Nine distributed sites, each with different capability , e.g., computation power, archiving facilities, visualization software Applications may require more than one site Data sizes on the order of gigabytes or terabytes

TeraGrid

TeraGrid Solution: Use GridFTP and RFT with front end command line tool (tgcp) Benefits of system: Simple user interface High performance data transfer capability Ability to recover from both client and server software failures Extensible configuration

TGCP Details Idea: hide low level GridFTP commands from users Copy file smallfile.dat in a working directory to another system: tgcp smallfile.dat tg-login.sdsc.teragrid.org:/users/ux454332 GridFTP command: globus-url-copy -p 8 -tcp-bs 1198372 \ gsiftp://tg-gridftprr.uc.teragrid.org:2811/home/navarro/smallfile.dat \ gsiftp://tg-login.sdsc.teragrid.org:2811/users/ux454332/smallfile.dat

The reality We have spent a lot of time talking about “The Grid” There is “the Web” and “the Internet” Is there a single Grid?

The reality Many types of Grids exist Private vs. public Regional vs. Global All-purpose vs. particular scientific problem